Search | arXiv e-print repository

HLOB -- Information Persistence and Structure in Limit Order Books

Authors: Antonio Briola, Silvia Bartolucci, Tomaso Aste

Abstract: We introduce a novel large-scale deep learning model for Limit Order Book mid-price changes forecasting, and we name it `HLOB'. This architecture (i) exploits the information encoded by an Information Filtering Network, namely the Triangulated Maximally Filtered Graph, to unveil deeper and non-trivial dependency structures among volume levels; and (ii) guarantees deterministic design choices to ha… ▽ More We introduce a novel large-scale deep learning model for Limit Order Book mid-price changes forecasting, and we name it `HLOB'. This architecture (i) exploits the information encoded by an Information Filtering Network, namely the Triangulated Maximally Filtered Graph, to unveil deeper and non-trivial dependency structures among volume levels; and (ii) guarantees deterministic design choices to handle the complexity of the underlying system by drawing inspiration from the groundbreaking class of Homological Convolutional Neural Networks. We test our model against 9 state-of-the-art deep learning alternatives on 3 real-world Limit Order Book datasets, each including 15 stocks traded on the NASDAQ exchange, and we systematically characterize the scenarios where HLOB outperforms state-of-the-art architectures. Our approach sheds new light on the spatial distribution of information in Limit Order Books and on its degradation over increasing prediction horizons, narrowing the gap between microstructural modeling and deep learning-based forecasting in high-frequency financial markets. △ Less

Submitted 4 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

Comments: 34 pages, 7 figures, 7 tables, 3 equations

arXiv:2403.09267 [pdf, other]

Deep Limit Order Book Forecasting

Authors: Antonio Briola, Silvia Bartolucci, Tomaso Aste

Abstract: We exploit cutting-edge deep learning methodologies to explore the predictability of high-frequency Limit Order Book mid-price changes for a heterogeneous set of stocks traded on the NASDAQ exchange. In so doing, we release `LOBFrame', an open-source code base to efficiently process large-scale Limit Order Book data and quantitatively assess state-of-the-art deep learning models' forecasting capab… ▽ More We exploit cutting-edge deep learning methodologies to explore the predictability of high-frequency Limit Order Book mid-price changes for a heterogeneous set of stocks traded on the NASDAQ exchange. In so doing, we release `LOBFrame', an open-source code base to efficiently process large-scale Limit Order Book data and quantitatively assess state-of-the-art deep learning models' forecasting capabilities. Our results are twofold. We demonstrate that the stocks' microstructural characteristics influence the efficacy of deep learning methods and that their high forecasting power does not necessarily correspond to actionable trading signals. We argue that traditional machine learning metrics fail to adequately assess the quality of forecasts in the Limit Order Book context. As an alternative, we propose an innovative operational framework that evaluates predictions' practicality by focusing on the probability of accurately forecasting complete transactions. This work offers academics and practitioners an avenue to make informed and robust decisions on the application of deep learning techniques, their scope and limitations, effectively exploiting emergent statistical properties of the Limit Order Book. △ Less

Submitted 4 June, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

Comments: 43 pages, 14 figures, 12 Tables

arXiv:2403.07070 [pdf, ps, other]

Retail Central Bank Digital Currency: Motivations, Opportunities, and Mistakes

Authors: Geoffrey Goodell, Hazem Danny Al-Nakib, Tomaso Aste

Abstract: Nations around the world are conducting research into the design of central bank digital currency (CBDC), a new, digital form of money that would be issued by central banks alongside cash and central bank reserves. Retail CBDC would be used by individuals and businesses as form of money suitable for routine commerce. An important motivating factor in the development of retail CBDC is the decline o… ▽ More Nations around the world are conducting research into the design of central bank digital currency (CBDC), a new, digital form of money that would be issued by central banks alongside cash and central bank reserves. Retail CBDC would be used by individuals and businesses as form of money suitable for routine commerce. An important motivating factor in the development of retail CBDC is the decline of the popularity of central bank money for retail purchases and the increasing use of digital money created by the private sector for such purposes. The debate about how retail CBDC would be designed and implemented has led to many proposals, which have sparked considerable debate about business models, regulatory frameworks, and the socio-technical role of money in general. Here, we present a critical analysis of the existing proposals. We examine their motivations and themes, as well as their underlying assumptions. We also offer a reflection of the opportunity that retail CBDC represents and suggest a way forward in furtherance of the public interest. △ Less

Submitted 4 April, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

Comments: 31 pages, 1 figure

arXiv:2310.13572 [pdf, other]

Unraveling the Enigma of Double Descent: An In-depth Analysis through the Lens of Learned Feature Space

Authors: Yufei Gu, Xiaoqing Zheng, Tomaso Aste

Abstract: Double descent presents a counter-intuitive aspect within the machine learning domain, and researchers have observed its manifestation in various models and tasks. While some theoretical explanations have been proposed for this phenomenon in specific contexts, an accepted theory to account for its occurrence in deep learning remains yet to be established. In this study, we revisit the phenomenon o… ▽ More Double descent presents a counter-intuitive aspect within the machine learning domain, and researchers have observed its manifestation in various models and tasks. While some theoretical explanations have been proposed for this phenomenon in specific contexts, an accepted theory to account for its occurrence in deep learning remains yet to be established. In this study, we revisit the phenomenon of double descent and demonstrate that its occurrence is strongly influenced by the presence of noisy data. Through conducting a comprehensive analysis of the feature space of learned representations, we unveil that double descent arises in imperfect models trained with noisy data. We argue that double descent is a consequence of the model first learning the noisy data until interpolation and then adding implicit regularization via over-parameterization acquiring therefore capability to separate the information from the noise. △ Less

Submitted 25 April, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

arXiv:2308.13816 [pdf, other]

Homological Convolutional Neural Networks

Authors: Antonio Briola, Yuanrong Wang, Silvia Bartolucci, Tomaso Aste

Abstract: Deep learning methods have demonstrated outstanding performances on classification and regression tasks on homogeneous data types (e.g., image, audio, and text data). However, tabular data still pose a challenge, with classic machine learning approaches being often computationally cheaper and equally effective than increasingly complex deep learning architectures. The challenge arises from the fac… ▽ More Deep learning methods have demonstrated outstanding performances on classification and regression tasks on homogeneous data types (e.g., image, audio, and text data). However, tabular data still pose a challenge, with classic machine learning approaches being often computationally cheaper and equally effective than increasingly complex deep learning architectures. The challenge arises from the fact that, in tabular data, the correlation among features is weaker than the one from spatial or semantic relationships in images or natural language, and the dependency structures need to be modeled without any prior information. In this work, we propose a novel deep learning architecture that exploits the data structural organization through topologically constrained network representations to gain relational information from sparse tabular inputs. The resulting model leverages the power of convolution and is centered on a limited number of concepts from network topology to guarantee: (i) a data-centric and deterministic building pipeline; (ii) a high level of interpretability over the inference process; and (iii) an adequate room for scalability. We test our model on 18 benchmark datasets against 5 classic machine learning and 3 deep learning models, demonstrating that our approach reaches state-of-the-art performances on these challenging datasets. The code to reproduce all our experiments is provided at https://github.com/FinancialComputingUCL/HomologicalCNN. △ Less

Submitted 14 November, 2023; v1 submitted 26 August, 2023; originally announced August 2023.

Comments: 26 pages, 5 figures, 11 tables, 1 equation, 1 algorithm

arXiv:2306.15337 [pdf, other]

Homological Neural Networks: A Sparse Architecture for Multivariate Complexity

Authors: Yuanrong Wang, Antonio Briola, Tomaso Aste

Abstract: The rapid progress of Artificial Intelligence research came with the development of increasingly complex deep learning models, leading to growing challenges in terms of computational complexity, energy efficiency and interpretability. In this study, we apply advanced network-based information filtering techniques to design a novel deep neural network unit characterized by a sparse higher-order gra… ▽ More The rapid progress of Artificial Intelligence research came with the development of increasingly complex deep learning models, leading to growing challenges in terms of computational complexity, energy efficiency and interpretability. In this study, we apply advanced network-based information filtering techniques to design a novel deep neural network unit characterized by a sparse higher-order graphical architecture built over the homological structure of underlying data. We demonstrate its effectiveness in two application domains which are traditionally challenging for deep learning: tabular data and time series regression problems. Results demonstrate the advantages of this novel design which can tie or overcome the results of state-of-the-art machine learning and deep learning models using only a fraction of parameters. △ Less

Submitted 27 June, 2023; originally announced June 2023.

arXiv:2302.09543 [pdf, ps, other]

Topological Feature Selection

Authors: Antonio Briola, Tomaso Aste

Abstract: In this paper, we introduce a novel unsupervised, graph-based filter feature selection technique which exploits the power of topologically constrained network representations. We model dependency structures among features using a family of chordal graphs (the Triangulated Maximally Filtered Graph), and we maximise the likelihood of features' relevance by studying their relative position inside the… ▽ More In this paper, we introduce a novel unsupervised, graph-based filter feature selection technique which exploits the power of topologically constrained network representations. We model dependency structures among features using a family of chordal graphs (the Triangulated Maximally Filtered Graph), and we maximise the likelihood of features' relevance by studying their relative position inside the network. Such an approach presents three aspects that are particularly satisfactory compared to its alternatives: (i) it is highly tunable and easily adaptable to the nature of input data; (ii) it is fully explainable, maintaining, at the same time, a remarkable level of simplicity; (iii) it is computationally cheaper compared to its alternatives. We test our algorithm on 16 benchmark datasets from different applicative domains showing that it outperforms or matches the current state-of-the-art under heterogeneous evaluation conditions. △ Less

Submitted 1 July, 2023; v1 submitted 19 February, 2023; originally announced February 2023.

Comments: Accepted at the 2nd Annual Workshop on Topology, Algebra, and Geometry in Machine Learning (TAG-ML) at the 40th International Conference on Machine Learning, Honolulu, Hawaii, USA. 2023. 23 pages, 2 figures, 13 tables

arXiv:2208.12614 [pdf, other]

Regime-based Implied Stochastic Volatility Model for Crypto Option Pricing

Authors: Danial Saef, Yuanrong Wang, Tomaso Aste

Abstract: The increasing adoption of Digital Assets (DAs), such as Bitcoin (BTC), rises the need for accurate option pricing models. Yet, existing methodologies fail to cope with the volatile nature of the emerging DAs. Many models have been proposed to address the unorthodox market dynamics and frequent disruptions in the microstructure caused by the non-stationarity, and peculiar statistics, in DA markets… ▽ More The increasing adoption of Digital Assets (DAs), such as Bitcoin (BTC), rises the need for accurate option pricing models. Yet, existing methodologies fail to cope with the volatile nature of the emerging DAs. Many models have been proposed to address the unorthodox market dynamics and frequent disruptions in the microstructure caused by the non-stationarity, and peculiar statistics, in DA markets. However, they are either prone to the curse of dimensionality, as additional complexity is required to employ traditional theories, or they overfit historical patterns that may never repeat. Instead, we leverage recent advances in market regime (MR) clustering with the Implied Stochastic Volatility Model (ISVM). Time-regime clustering is a temporal clustering method, that clusters the historic evolution of a market into different volatility periods accounting for non-stationarity. ISVM can incorporate investor expectations in each of the sentiment-driven periods by using implied volatility (IV) data. In this paper, we applied this integrated time-regime clustering and ISVM method (termed MR-ISVM) to high-frequency data on BTC options at the popular trading platform Deribit. We demonstrate that MR-ISVM contributes to overcome the burden of complex adaption to jumps in higher order characteristics of option pricing models. This allows us to price the market based on the expectations of its participants in an adaptive fashion. △ Less

Submitted 27 September, 2022; v1 submitted 15 August, 2022; originally announced August 2022.

ACM Class: G.3

arXiv:2207.13914 [pdf, other]

doi 10.1016/j.frl.2022.103358

Anatomy of a Stablecoin's failure: the Terra-Luna case

Authors: Antonio Briola, David Vidal-Tomás, Yuanrong Wang, Tomaso Aste

Abstract: We quantitatively describe the main events that led to the Terra project's failure in May 2022. We first review, in a systematic way, news from heterogeneous social media sources; we discuss the fragility of the Terra project and its vicious dependence on the Anchor protocol. We hence identify the crash's trigger events, analysing hourly and transaction data for Bitcoin, Luna, and TerraUSD. Finall… ▽ More We quantitatively describe the main events that led to the Terra project's failure in May 2022. We first review, in a systematic way, news from heterogeneous social media sources; we discuss the fragility of the Terra project and its vicious dependence on the Anchor protocol. We hence identify the crash's trigger events, analysing hourly and transaction data for Bitcoin, Luna, and TerraUSD. Finally, using state-of-the-art techniques from network science, we study the evolution of dependency structures for 61 highly capitalised cryptocurrencies during the down-market and we also highlight the absence of herding behaviour analysing cross-sectional absolute deviation of returns. △ Less

Submitted 25 September, 2022; v1 submitted 28 July, 2022; originally announced July 2022.

Comments: 17 pages, 7 figures, 6 tables, 1 appendix

arXiv:2203.03991 [pdf, other]

Sparsification and Filtering for Spatial-temporal GNN in Multivariate Time-series

Authors: Yuanrong Wang, Tomaso Aste

Abstract: We propose an end-to-end architecture for multivariate time-series prediction that integrates a spatial-temporal graph neural network with a matrix filtering module. This module generates filtered (inverse) correlation graphs from multivariate time series before inputting them into a GNN. In contrast with existing sparsification methods adopted in graph neural network, our model explicitly leverag… ▽ More We propose an end-to-end architecture for multivariate time-series prediction that integrates a spatial-temporal graph neural network with a matrix filtering module. This module generates filtered (inverse) correlation graphs from multivariate time series before inputting them into a GNN. In contrast with existing sparsification methods adopted in graph neural network, our model explicitly leverage time-series filtering to overcome the low signal-to-noise ratio typical of complex systems data. We present a set of experiments, where we predict future sales from a synthetic time-series sales dataset. The proposed spatial-temporal graph neural network displays superior performances with respect to baseline approaches, with no graphical information, and with fully connected, disconnected graphs and unfiltered graphs. △ Less

Submitted 8 March, 2022; originally announced March 2022.

Comments: 7 pages, 1 figure, 3tables

arXiv:2101.07107 [pdf, other]

Deep Reinforcement Learning for Active High Frequency Trading

Authors: Antonio Briola, Jeremy Turiel, Riccardo Marcaccioli, Alvaro Cauderan, Tomaso Aste

Abstract: We introduce the first end-to-end Deep Reinforcement Learning (DRL) based framework for active high frequency trading in the stock market. We train DRL agents to trade one unit of Intel Corporation stock by employing the Proximal Policy Optimization algorithm. The training is performed on three contiguous months of high frequency Limit Order Book data, of which the last month constitutes the valid… ▽ More We introduce the first end-to-end Deep Reinforcement Learning (DRL) based framework for active high frequency trading in the stock market. We train DRL agents to trade one unit of Intel Corporation stock by employing the Proximal Policy Optimization algorithm. The training is performed on three contiguous months of high frequency Limit Order Book data, of which the last month constitutes the validation data. In order to maximise the signal to noise ratio in the training data, we compose the latter by only selecting training samples with largest price changes. The test is then carried out on the following month of data. Hyperparameters are tuned using the Sequential Model Based Optimization technique. We consider three different state characterizations, which differ in their LOB-based meta-features. Analysing the agents' performances on test data, we argue that the agents are able to create a dynamic representation of the underlying environment. They identify occasional regularities present in the data and exploit them to create long-term profitable trading strategies. Indeed, agents learn trading strategies able to produce stable positive returns in spite of the highly stochastic and non-stationary environment. △ Less

Submitted 19 August, 2023; v1 submitted 18 January, 2021; originally announced January 2021.

Comments: 9 pages, 4 figures

arXiv:2007.07319 [pdf, other]

Deep Learning modeling of Limit Order Book: a comparative perspective

Authors: Antonio Briola, Jeremy Turiel, Tomaso Aste

Abstract: The present work addresses theoretical and practical questions in the domain of Deep Learning for High Frequency Trading. State-of-the-art models such as Random models, Logistic Regressions, LSTMs, LSTMs equipped with an Attention mask, CNN-LSTMs and MLPs are reviewed and compared on the same tasks, feature space and dataset, and then clustered according to pairwise similarity and performance metr… ▽ More The present work addresses theoretical and practical questions in the domain of Deep Learning for High Frequency Trading. State-of-the-art models such as Random models, Logistic Regressions, LSTMs, LSTMs equipped with an Attention mask, CNN-LSTMs and MLPs are reviewed and compared on the same tasks, feature space and dataset, and then clustered according to pairwise similarity and performance metrics. The underlying dimensions of the modeling techniques are hence investigated to understand whether these are intrinsic to the Limit Order Book's dynamics. We observe that the Multilayer Perceptron performs comparably to or better than state-of-the-art CNN-LSTM architectures indicating that dynamic spatial and temporal dimensions are a good approximation of the LOB's dynamics, but not necessarily the true underlying dimensions. △ Less

Submitted 18 October, 2020; v1 submitted 12 July, 2020; originally announced July 2020.

Comments: 16 pages, 4 figures, 9 tables

arXiv:2005.04692 [pdf, ps, other]

Topological regularization with information filtering networks

Authors: Tomaso Aste

Abstract: A methodology to perform topological regularization via information filtering network is introduced. This methodology can be directly applied to covariance selection problem providing an instrument for sparse probabilistic modeling with both linear and non-linear multivariate probability distributions such as the elliptical and generalized hyperbolic families. It can also be directly implemented f… ▽ More A methodology to perform topological regularization via information filtering network is introduced. This methodology can be directly applied to covariance selection problem providing an instrument for sparse probabilistic modeling with both linear and non-linear multivariate probability distributions such as the elliptical and generalized hyperbolic families. It can also be directly implemented for $L_0$-norm regularized multicollinear regression. In this paper, I describe in detail an application to sparse modeling with multivariate Student-t. A specific $L_0$-norm regularized expectation-maximization likelihood maximization procedure is proposed for this sparse Student-t case. Examples with real data from stock prices log-returns and from artificially generated data demonstrate the applicability, performances, and potentials of this methodology. △ Less

Submitted 30 October, 2021; v1 submitted 10 May, 2020; originally announced May 2020.

Comments: 17 pages , 4 figures, 1 table

arXiv:2004.04605 [pdf, ps, other]

The cost of Bitcoin mining has never really increased

Authors: Yo-Der Song, Tomaso Aste

Abstract: The Bitcoin network is burning a large amount of energy for mining. In this paper we estimate the lower bound for the global energy cost for a period of ten years from 2010, taking into account changing oil costs, improvements in hashing technologies and hashing activity. Despite a ten-billion-fold increase in hashing activity and a ten-million-fold increase in total energy consumption, we find th… ▽ More The Bitcoin network is burning a large amount of energy for mining. In this paper we estimate the lower bound for the global energy cost for a period of ten years from 2010, taking into account changing oil costs, improvements in hashing technologies and hashing activity. Despite a ten-billion-fold increase in hashing activity and a ten-million-fold increase in total energy consumption, we find the cost relative to the volume of transactions has not increased nor decreased since 2010. This is consistent with the perspective that, in order to keep a the Blockchain system secure from double spending attacks, the proof or work must cost a sizable fraction of the value that can be transferred through the network. We estimate that in the Bitcoin network this fraction is of the order of 1%. △ Less

Submitted 18 May, 2020; v1 submitted 7 April, 2020; originally announced April 2020.

Comments: 16 pages, 6 figures

arXiv:2004.04125 [pdf]

Wisdom of Crowds Detects COVID-19 Severity Ahead of Officially Available Data

Authors: Jeremy Turiel, Delmiro Fernandez-Reyes, Tomaso Aste

Abstract: During the unfolding of a crisis, it is crucial to determine its severity, yet access to reliable data is challenging. We investigate the relation between geolocated Tweet Intensity of initial COVID-19 related tweet at the beginning of the pandemic across Italian, Spanish and USA regions and mortality in the region a month later. We find significant proportionality between early social media react… ▽ More During the unfolding of a crisis, it is crucial to determine its severity, yet access to reliable data is challenging. We investigate the relation between geolocated Tweet Intensity of initial COVID-19 related tweet at the beginning of the pandemic across Italian, Spanish and USA regions and mortality in the region a month later. We find significant proportionality between early social media reaction and the cumulative number of COVID-19 deaths almost a month later. Our findings suggest that "the crowds" perceived the risk correctly. This is one of the few examples where the "wisdom of crowds" can be quantified and applied in practice. This can be used to create real-time alert systems that could be of help for crisis-management and intervention, especially in develo** countries. Such systems could contribute to inform fast-response policy making at early stages of a crisis. △ Less

Submitted 22 June, 2020; v1 submitted 8 April, 2020; originally announced April 2020.

Comments: 14 pages, 3 figures, 3 tables

arXiv:1906.04619 [pdf, other]

doi 10.1038/s41467-019-13130-4

Achieving competitive advantage in academia through early career coauthorship with top scientists

Authors: Weihua Li, Tomaso Aste, Fabio Caccioli, Giacomo Livan

Abstract: We quantify the long term impact that the coauthorship with established top-cited scientists has on the career of junior researchers in four different scientific disciplines. Through matched pair analysis, we find that junior researchers who coauthor work with top scientists enjoy a persistent competitive advantage throughout the rest of their careers with respect to peers with similar early caree… ▽ More We quantify the long term impact that the coauthorship with established top-cited scientists has on the career of junior researchers in four different scientific disciplines. Through matched pair analysis, we find that junior researchers who coauthor work with top scientists enjoy a persistent competitive advantage throughout the rest of their careers with respect to peers with similar early career profiles. Such a competitive advantage materialises as a higher probability of repeatedly coauthoring work with top-cited scientists, and, ultimately, as a higher probability of becoming one. Notably, we find that the coauthorship with a top scientist has the strongest impact on the careers of junior researchers affiliated with less prestigious institutions. As a consequence, we argue that such institutions may hold vast amounts of untapped potential, which may be realised by improving access to top scientists. △ Less

Submitted 11 June, 2019; originally announced June 2019.

Comments: 17 pages, 7 figures, 2 tables

Journal ref: Nature Communications 10, 5170 (2019)

arXiv:1905.02266 [pdf, other]

Learning Clique Forests

Authors: Guido Previde Massara, Tomaso Aste

Abstract: We propose a topological learning algorithm for the estimation of the conditional dependency structure of large sets of random variables from sparse and noisy data. The algorithm, named Maximally Filtered Clique Forest (MFCF), produces a clique forest and an associated Markov Random Field (MRF) by generalising Prim's minimum spanning tree algorithm. To the best of our knowledge, the MFCF presents… ▽ More We propose a topological learning algorithm for the estimation of the conditional dependency structure of large sets of random variables from sparse and noisy data. The algorithm, named Maximally Filtered Clique Forest (MFCF), produces a clique forest and an associated Markov Random Field (MRF) by generalising Prim's minimum spanning tree algorithm. To the best of our knowledge, the MFCF presents three elements of novelty with respect to existing structure learning approaches. The first is the repeated application of a local topological move, the clique expansion, that preserves the decomposability of the underlying graph. Through this move the decomposability and calculation of scores is performed incrementally at the variable (rather than edge) level, and this provides better computational performance and an intuitive application of multivariate statistical tests. The second is the capability to accommodate a variety of score functions and, while this paper is focused on multivariate normal distributions, it can be directly generalised to different types of statistics. Finally, the third is the variable range of allowed clique sizes which is an adjustable topological constraint that acts as a topological penalizer providing a way to tackle sparsity at $l_0$ semi-norm level; this allows a clean decoupling of structure learning and parameter estimation. The MFCF produces a representation of the clique forest, together with a perfect ordering of the cliques and a perfect elimination ordering for the vertices. As an example we propose an application to covariance selection models and we show that the MCFC outperforms the Graphical Lasso for a number of classes of matrices. △ Less

Submitted 16 May, 2021; v1 submitted 6 May, 2019; originally announced May 2019.

Comments: 47 pages, 26 figures

arXiv:1902.08769 [pdf, ps, other]

doi 10.3389/fbloc.2019.00017

A Decentralised Digital Identity Architecture

Authors: Geoff Goodell, Tomaso Aste

Abstract: Current architectures to validate, certify, and manage identity are based on centralised, top-down approaches that rely on trusted authorities and third-party operators. We approach the problem of digital identity starting from a human rights perspective, with a primary focus on identity systems in the developed world. We assert that individual persons must be allowed to manage their personal info… ▽ More Current architectures to validate, certify, and manage identity are based on centralised, top-down approaches that rely on trusted authorities and third-party operators. We approach the problem of digital identity starting from a human rights perspective, with a primary focus on identity systems in the developed world. We assert that individual persons must be allowed to manage their personal information in a multitude of different ways in different contexts and that to do so, each individual must be able to create multiple unrelated identities. Therefore, we first define a set of fundamental constraints that digital identity systems must satisfy to preserve and promote privacy as required for individual autonomy. With these constraints in mind, we then propose a decentralised, standards-based approach, using a combination of distributed ledger technology and thoughtful regulation, to facilitate many-to-many relationships among providers of key services. Our proposal for digital identity differs from others in its approach to trust in that we do not seek to bind credentials to each other or to a mutually trusted authority to achieve strong non-transferability. Because the system does not implicitly encourage its users to maintain a single aggregated identity that can potentially be constrained or reconstructed against their interests, individuals and organisations are free to embrace the system and share in its benefits. △ Less

Submitted 26 October, 2019; v1 submitted 23 February, 2019; originally announced February 2019.

Comments: 30 pages, 10 figures, 3 tables

arXiv:1811.12240 [pdf, other]

doi 10.3389/fbloc.2019.00004

Can Cryptocurrencies Preserve Privacy and Comply with Regulations?

Authors: Geoff Goodell, Tomaso Aste

Abstract: Cryptocurrencies offer an alternative to traditional methods of electronic value exchange, promising anonymous, cash-like electronic transfers, but in practice they fall short for several key reasons. We consider the false choice between total surveillance, as represented by banking as currently implemented by institutions, and impenetrable lawlessness, as represented by privacy-enhancing cryptocu… ▽ More Cryptocurrencies offer an alternative to traditional methods of electronic value exchange, promising anonymous, cash-like electronic transfers, but in practice they fall short for several key reasons. We consider the false choice between total surveillance, as represented by banking as currently implemented by institutions, and impenetrable lawlessness, as represented by privacy-enhancing cryptocurrencies as currently deployed. We identify a range of alternatives between those two extremes, and we consider two potential compromise approaches that offer both the auditability required for regulators and the anonymity required for users. △ Less

Submitted 7 May, 2019; v1 submitted 29 November, 2018; originally announced November 2018.

Comments: 20 pages, 10 figures, 3 tables

arXiv:1808.03781 [pdf, other]

Reciprocity and success in academic careers

Authors: Weihua Li, Tomaso Aste, Fabio Caccioli, Giacomo Livan

Abstract: The growing importance of citation-based bibliometric indicators in sha** the prospects of academic careers incentivizes scientists to boost the numbers of citations they receive. Whereas the exploitation of self-citations has been extensively documented, the impact of reciprocated citations has not yet been studied. We study reciprocity in a citation network of authors, and compare it with the… ▽ More The growing importance of citation-based bibliometric indicators in sha** the prospects of academic careers incentivizes scientists to boost the numbers of citations they receive. Whereas the exploitation of self-citations has been extensively documented, the impact of reciprocated citations has not yet been studied. We study reciprocity in a citation network of authors, and compare it with the average reciprocity computed in an ensemble of null network models. We show that obtaining citations through reciprocity correlates negatively with a successful career in the long term. Nevertheless, at the aggregate level we show evidence of a steady increase in reciprocity over the years, largely fuelled by the exchange of citations between coauthors. Our results characterize the structure of author networks in a time of increasing emphasis on citation-based indicators, and we discuss their implications towards a fairer assessment of academic impact. △ Less

Submitted 11 August, 2018; originally announced August 2018.

Comments: 19 pages, 14 figures

arXiv:1807.05836 [pdf, other]

doi 10.1080/14697688.2019.1622313

Forecasting market states

Authors: Pier Francesco Procacci, Tomaso Aste

Abstract: We propose a novel methodology to define, analyze and forecast market states. In our approach market states are identified by a reference sparse precision matrix and a vector of expectation values. In our procedure, each multivariate observation is associated with a given market state accordingly to a minimization of a penalized Mahalanobis distance. The procedure is made computationally very effi… ▽ More We propose a novel methodology to define, analyze and forecast market states. In our approach market states are identified by a reference sparse precision matrix and a vector of expectation values. In our procedure, each multivariate observation is associated with a given market state accordingly to a minimization of a penalized Mahalanobis distance. The procedure is made computationally very efficient and can be used with a large number of assets. We demonstrate that this procedure is successful at clustering different states of the markets in an unsupervised manner. In particular, we describe an experiment with one hundred log-returns and two states in which the methodology automatically associates states prevalently to pre- and post- crisis periods with one state gathering periods with average positive returns and the other state periods with average negative returns, therefore discovering spontaneously the common classification of `bull' and `bear' markets. In another experiment, with again one hundred log-returns and two states, we demonstrate that this procedure can be efficiently used to forecast off-sample future market states with significant prediction accuracy. This methodology opens the way to a range of applications in risk management and trading strategies in the context where the correlation structure plays a central role. △ Less

Submitted 27 May, 2019; v1 submitted 13 July, 2018; originally announced July 2018.

Comments: 13 pages, 5 figures

Journal ref: Quantitative Finance 19 (2019) 1491-1498

arXiv:1708.06586 [pdf, other]

doi 10.1016/j.physa.2018.02.108

Dynamic correlations at different time-scales with Empirical Mode Decomposition

Authors: Noemi Nava, T. Di Matteo, Tomaso Aste

Abstract: The Empirical Mode Decomposition (EMD) provides a tool to characterize time series in terms of its implicit components oscillating at different time-scales. We apply this decomposition to intraday time series of the following three financial indices: the S\&P 500 (USA), the IPC (Mexico) and the VIX (volatility index USA), obtaining time-varying multidimensional cross-correlations at different time… ▽ More The Empirical Mode Decomposition (EMD) provides a tool to characterize time series in terms of its implicit components oscillating at different time-scales. We apply this decomposition to intraday time series of the following three financial indices: the S\&P 500 (USA), the IPC (Mexico) and the VIX (volatility index USA), obtaining time-varying multidimensional cross-correlations at different time-scales. The correlations computed over a rolling window are compared across the three indices, across the components at different time-scales, at different lags and over time. We uncover a rich heterogeneity of interactions which depends on the time-scale and has important led-lag relations which can have practical use for portfolio management, risk estimation and investments. △ Less

Submitted 22 August, 2017; originally announced August 2017.

Comments: 19 pages, 11 figures

arXiv:1704.01414 [pdf, other]

Blockchain Inefficiency in the Bitcoin Peers Network

Authors: Giuseppe Pappalardo, T. Di Matteo, Guido Caldarelli, Tomaso Aste

Abstract: We investigate Bitcoin network monitoring the dynamics of blocks and transactions. We unveil that 43\% of the transactions are still not included in the Blockchain after 1h from the first time they were seen in the network and 20\% of the transactions are still not included in the Blockchain after 30 days, revealing therefore great inefficiency in the Bitcoin system. However, we observe that most… ▽ More We investigate Bitcoin network monitoring the dynamics of blocks and transactions. We unveil that 43\% of the transactions are still not included in the Blockchain after 1h from the first time they were seen in the network and 20\% of the transactions are still not included in the Blockchain after 30 days, revealing therefore great inefficiency in the Bitcoin system. However, we observe that most of these `forgotten' transactions have low values and in terms of transferred value the system is less inefficient with 93\% of the transactions value being included into the Blockchain within 3h. The fact that a sizeable fraction of transactions is not processed timely casts serious doubts on the usability of the Bitcoin Blockchain for reliable time-stam** purposes and calls for a debate about the right systems of incentives which a peer-to-peer unintermediated system should introduce to promote efficient transaction recording. △ Less

Submitted 5 April, 2017; originally announced April 2017.

Comments: 15 pages, 8 figures, 3 tables

arXiv:1606.04872 [pdf, other]

The multiplex dependency structure of financial markets

Authors: Nicoló Musmeci, Vincenzo Nicosia, Tomaso Aste, Tiziana Di Matteo, Vito Latora

Abstract: We propose here a multiplex network approach to investigate simultaneously different types of dependency in complex data sets. In particular, we consider multiplex networks made of four layers corresponding respectively to linear, non-linear, tail, and partial correlations among a set of financial time series. We construct the sparse graph on each layer using a standard network filtering procedure… ▽ More We propose here a multiplex network approach to investigate simultaneously different types of dependency in complex data sets. In particular, we consider multiplex networks made of four layers corresponding respectively to linear, non-linear, tail, and partial correlations among a set of financial time series. We construct the sparse graph on each layer using a standard network filtering procedure, and we then analyse the structural properties of the obtained multiplex networks. The study of the time evolution of the multiplex constructed from financial data uncovers important changes in intrinsically multiplex properties of the network, and such changes are associated with periods of financial stress. We observe that some features are unique to the multiplex structure and would not be visible otherwise by the separate analysis of the single-layer networks corresponding to each dependency measure. △ Less

Submitted 15 June, 2016; originally announced June 2016.

Comments: 12 pages, 5 figures

arXiv:1606.02597 [pdf, other]

doi 10.1038/s41598-017-03481-7

Excess reciprocity distorts reputation in online social networks

Authors: Giacomo Livan, Fabio Caccioli, Tomaso Aste

Abstract: The peer-to-peer (P2P) economy relies on establishing trust in distributed networked systems, where the reliability of a user is assessed through digital peer-review processes that aggregate ratings into reputation scores. Here we present evidence of a network effect which biases digital reputation, revealing that P2P networks display exceedingly high levels of reciprocity. In fact, these are much… ▽ More The peer-to-peer (P2P) economy relies on establishing trust in distributed networked systems, where the reliability of a user is assessed through digital peer-review processes that aggregate ratings into reputation scores. Here we present evidence of a network effect which biases digital reputation, revealing that P2P networks display exceedingly high levels of reciprocity. In fact, these are much higher than those compatible with a null assumption that preserves the empirically observed level of agreement between all pairs of nodes, and rather close to the highest levels structurally compatible with the networks' reputation landscape. This indicates that the crowdsourcing process underpinning digital reputation can be significantly distorted by the attempt of users to mutually boost reputation, or to retaliate, through the exchange of ratings. We uncover that the least active users are predominantly responsible for such reciprocity-induced bias, and that this fact can be exploited to obtain more reliable reputation estimates. Our findings are robust across different P2P platforms, including both cases where ratings are used to vote on the content produced by users and to vote on user profiles. △ Less

Submitted 2 May, 2017; v1 submitted 8 June, 2016; originally announced June 2016.

Comments: 22 pages, 13 figures

Journal ref: Nature Scientific Reports 7, Article number: 3551 (2017)

arXiv:1602.07349 [pdf, other]

doi 10.1103/PhysRevE.94.062306

Parsimonious modeling with Information Filtering Networks

Authors: Wolfram Barfuss, Guido Previde Massara, T. Di Matteo, Tomaso Aste

Abstract: We introduce a methodology to construct parsimonious probabilistic models. This method makes use of Information Filtering Networks to produce a robust estimate of the global sparse inverse covariance from a simple sum of local inverse covariances computed on small sub-parts of the network. Being based on local and low-dimensional inversions, this method is computationally very efficient and statis… ▽ More We introduce a methodology to construct parsimonious probabilistic models. This method makes use of Information Filtering Networks to produce a robust estimate of the global sparse inverse covariance from a simple sum of local inverse covariances computed on small sub-parts of the network. Being based on local and low-dimensional inversions, this method is computationally very efficient and statistically robust even for the estimation of inverse covariance of high-dimensional, noisy and short time-series. Applied to financial data our method results computationally more efficient than state-of-the-art methodologies such as Glasso producing, in a fraction of the computation time, models that can have equivalent or better performances but with a sparser inference structure. We also discuss performances with sparse factor models where we notice that relative performances decrease with the number of factors. The local nature of this approach allows us to perform computations in parallel and provides a tool for dynamical adaptation by partial updating when the properties of some variables change without the need of recomputing the whole model. This makes this approach particularly suitable to handle big datasets with large numbers of variables. Examples of practical application for forecasting, stress testing and risk allocation in financial systems are also provided. △ Less

Submitted 23 November, 2016; v1 submitted 23 February, 2016; originally announced February 2016.

Comments: 17 pages, 10 figures, 3 tables

Journal ref: Phys. Rev. E 94, 062306 (2016)

arXiv:1601.04535 [pdf, other]

A nonlinear impact: evidences of causal effects of social media on market prices

Authors: Thársis T. P. Souza, Tomaso Aste

Abstract: Online social networks offer a new way to investigate financial markets' dynamics by enabling the large-scale analysis of investors' collective behavior. We provide empirical evidence that suggests social media and stock markets have a nonlinear causal relationship. We take advantage of an extensive data set composed of social media messages related to DJIA index components. By using information-t… ▽ More Online social networks offer a new way to investigate financial markets' dynamics by enabling the large-scale analysis of investors' collective behavior. We provide empirical evidence that suggests social media and stock markets have a nonlinear causal relationship. We take advantage of an extensive data set composed of social media messages related to DJIA index components. By using information-theoretic measures to cope for possible nonlinear causal coupling between social media and stock markets systems, we point out stunning differences in the results with respect to linear coupling. Two main conclusions are drawn: First, social media significant causality on stocks' returns are purely nonlinear in most cases; Second, social media dominates the directional coupling with stock market, an effect not observable within linear modeling. Results also serve as empirical guidance on model adequacy in the investigation of sociotechnical and financial systems. △ Less

Submitted 1 March, 2016; v1 submitted 18 January, 2016; originally announced January 2016.

Comments: 17 pages, 4 figures

arXiv:1508.03981 [pdf, other]

In Quest of Significance: Identifying Types of Twitter Sentiment Events that Predict Spikes in Sales

Authors: Olga Kolchyna, Th'arsis T. P. Souza, Tomaso Aste, Philip C. Treleaven

Abstract: We study the power of Twitter events to predict consumer sales events by analysing sales for 75 companies from the retail sector and over 150 million tweets mentioning those companies along with their sentiment. We suggest an approach for events identification on Twitter extending existing methodologies of event study. We also propose a robust method for clustering Twitter events into different ty… ▽ More We study the power of Twitter events to predict consumer sales events by analysing sales for 75 companies from the retail sector and over 150 million tweets mentioning those companies along with their sentiment. We suggest an approach for events identification on Twitter extending existing methodologies of event study. We also propose a robust method for clustering Twitter events into different types based on their shape, which captures the varying dynamics of information propagation through the social network. We provide empirical evidence that through events differentiation based on their shape we can clearly identify types of Twitter events that have a more significant power to predict spikes in sales than the aggregated Twitter signal. △ Less

Submitted 17 August, 2015; originally announced August 2015.

arXiv:1507.00955 [pdf, other]

Twitter Sentiment Analysis: Lexicon Method, Machine Learning Method and Their Combination

Authors: Olga Kolchyna, Tharsis T. P. Souza, Philip Treleaven, Tomaso Aste

Abstract: This paper covers the two approaches for sentiment analysis: i) lexicon based method; ii) machine learning method. We describe several techniques to implement these approaches and discuss how they can be adopted for sentiment classification of Twitter messages. We present a comparative study of different lexicon combinations and show that enhancing sentiment lexicons with emoticons, abbreviations… ▽ More This paper covers the two approaches for sentiment analysis: i) lexicon based method; ii) machine learning method. We describe several techniques to implement these approaches and discuss how they can be adopted for sentiment classification of Twitter messages. We present a comparative study of different lexicon combinations and show that enhancing sentiment lexicons with emoticons, abbreviations and social-media slang expressions increases the accuracy of lexicon-based classification for Twitter. We discuss the importance of feature generation and feature selection processes for machine learning sentiment classification. To quantify the performance of the main sentiment analysis methods over Twitter we run these algorithms on a benchmark Twitter dataset from the SemEval-2013 competition, task 2-B. The results show that machine learning method based on SVM and Naive Bayes classifiers outperforms the lexicon method. We present a new ensemble method that uses a lexicon based sentiment score as input feature for the machine learning approach. The combined method proved to produce more precise classifications. We also show that employing a cost-sensitive classifier for highly unbalanced datasets yields an improvement of sentiment classification performance up to 7%. △ Less

Submitted 18 September, 2015; v1 submitted 3 July, 2015; originally announced July 2015.

Comments: 32 pages, 5 figures

Journal ref: Handbook of Sentiment Analysis in Finance. Mitra, G. and Yu, X. (Eds.). (2016). ISBN 1910571571

arXiv:1507.00784 [pdf, other]

Twitter Sentiment Analysis Applied to Finance: A Case Study in the Retail Industry

Authors: Thársis Tuani Pinto Souza, Olga Kolchyna, Philip C. Treleaven, Tomaso Aste

Abstract: This paper presents a financial analysis over Twitter sentiment analytics extracted from listed retail brands. We investigate whether there is statistically-significant information between the Twitter sentiment and volume, and stock returns and volatility. Traditional newswires are also considered as a proxy for the market sentiment for comparative purpose. The results suggest that social media is… ▽ More This paper presents a financial analysis over Twitter sentiment analytics extracted from listed retail brands. We investigate whether there is statistically-significant information between the Twitter sentiment and volume, and stock returns and volatility. Traditional newswires are also considered as a proxy for the market sentiment for comparative purpose. The results suggest that social media is indeed a valuable source in the analysis of the financial dynamics in the retail sector even when compared to mainstream news such as the Wall Street Journal and Dow Jones Newswires. △ Less

Submitted 11 July, 2015; v1 submitted 2 July, 2015; originally announced July 2015.

Comments: 23 pages, 5 figures, 9 tables

Journal ref: In: Handbook of Sentiment Analysis in Finance. Mitra, G. and Yu, X. (Eds.). (2016). ISBN 1910571571., 2016

arXiv:1505.02445 [pdf, other]

Network Filtering for Big Data: Triangulated Maximally Filtered Graph

Authors: Guido Previde Massara, T. Di Matteo, Tomaso Aste

Abstract: We propose a network-filtering method, the Triangulated Maximally Filtered Graph (TMFG), that provides an approximate solution to the Weighted Maximal Planar Graph problem. The underlying idea of TMFG consists in building a triangulation that maximizes a score function associated with the amount of information retained by the network. TMFG uses as weights any arbitrary similarity measure to arrang… ▽ More We propose a network-filtering method, the Triangulated Maximally Filtered Graph (TMFG), that provides an approximate solution to the Weighted Maximal Planar Graph problem. The underlying idea of TMFG consists in building a triangulation that maximizes a score function associated with the amount of information retained by the network. TMFG uses as weights any arbitrary similarity measure to arrange data into a meaningful network structure that can be used for clustering, community detection and modeling. The method is fast, adaptable and scalable to very large datasets, it allows online updating and learning as new data can be inserted and deleted with combinations of local and non-local moves. TMFG permits readjustments of the network in consequence of changes in the strength of the similarity measure. The method is based on local topological moves and can therefore take advantage of parallel and GPUs computing. We discuss how this network-filtering method can be used intuitively and efficiently for big data studies and its significance from an information-theoretic perspective. △ Less

Submitted 25 August, 2015; v1 submitted 10 May, 2015; originally announced May 2015.

Comments: 16 pages, 7 Figures, 2 Tables

arXiv:1406.0496 [pdf, other]

doi 10.1371/journal.pone.0116201

Relation between Financial Market Structure and the Real Economy: Comparison between Clustering Methods

Authors: Nicolo Musmeci, Tomaso Aste, Tiziana Di Matteo

Abstract: We quantify the amount of information filtered by different hierarchical clustering methods on correlations between stock returns comparing it with the underlying industrial activity structure. Specifically, we apply, for the first time to financial data, a novel hierarchical clustering approach, the Directed Bubble Hierarchical Tree and we compare it with other methods including the Linkage and k… ▽ More We quantify the amount of information filtered by different hierarchical clustering methods on correlations between stock returns comparing it with the underlying industrial activity structure. Specifically, we apply, for the first time to financial data, a novel hierarchical clustering approach, the Directed Bubble Hierarchical Tree and we compare it with other methods including the Linkage and k-medoids. In particular, by taking the industrial sector classification of stocks as a benchmark partition, we evaluate how the different methods retrieve this classification. The results show that the Directed Bubble Hierarchical Tree can outperform other methods, being able to retrieve more information with fewer clusters. Moreover, we show that the economic information is hidden at different levels of the hierarchical structures depending on the clustering method. The dynamical analysis on a rolling window also reveals that the different methods show different degrees of sensitivity to events affecting financial markets, like crises. These results can be of interest for all the applications of clustering methods to portfolio optimization and risk hedging. △ Less

Submitted 21 January, 2015; v1 submitted 2 June, 2014; originally announced June 2014.

Comments: 31 pages, 17 figures

Journal ref: Journal of Network Theory in Finance, VOLUME 4, NUMBER 2 (2018)

arXiv:1306.0924 [pdf]

doi 10.1371/journal.pone.0084912

Graph theory enables drug repurposing. How a mathematical model can drive the discovery of hidden Mechanisms of Action

Authors: Ruggero Gramatica, T. Di Matteo, Stefano Giorgetti, Massimo Barbiani, Dorian Bevec, Tomaso Aste

Abstract: We introduced a methodology to efficiently exploit natural-language expressed biomedical knowledge for repurposing existing drugs towards diseases for which they were not initially intended. Leveraging on developments in Computational Linguistics and Graph Theory, a methodology is defined to build a graph representation of knowledge, which is automatically analysed to discover hidden relations bet… ▽ More We introduced a methodology to efficiently exploit natural-language expressed biomedical knowledge for repurposing existing drugs towards diseases for which they were not initially intended. Leveraging on developments in Computational Linguistics and Graph Theory, a methodology is defined to build a graph representation of knowledge, which is automatically analysed to discover hidden relations between any drug and any disease: these relations are specific paths among the biomedical entities of the graph, representing possible Modes of Action for any given pharmacological compound. These paths are ranked according to their relevance, exploiting a measure induced by a stochastic process defined on the graph. Here we show, providing real-world examples, how the method successfully retrieves known pathophysiological Mode of Actions and finds new ones by meaningfully selecting and aggregating contributions from known bio-molecular interactions. Applications of this methodology are presented, and prove the efficacy of the method for selecting drugs as treatment options for rare diseases. △ Less

Submitted 4 June, 2013; originally announced June 2013.

Comments: 8 pages, 7 figures

Journal ref: PLoS ONE 9 (2013) e84912

arXiv:1110.4477 [pdf, other]

doi 10.1371/journal.pone.0031929

Hierarchical information clustering by means of topologically embedded graphs

Authors: Won-Min Song, T. Di Matteo, Tomaso Aste

Abstract: We introduce a graph-theoretic approach to extract clusters and hierarchies in complex data-sets in an unsupervised and deterministic manner, without the use of any prior information. This is achieved by building topologically embedded networks containing the subset of most significant links and analyzing the network structure. For a planar embedding, this method provides both the intra-cluster hi… ▽ More We introduce a graph-theoretic approach to extract clusters and hierarchies in complex data-sets in an unsupervised and deterministic manner, without the use of any prior information. This is achieved by building topologically embedded networks containing the subset of most significant links and analyzing the network structure. For a planar embedding, this method provides both the intra-cluster hierarchy, which describes the way clusters are composed, and the inter-cluster hierarchy which describes how clusters gather together. We discuss performance, robustness and reliability of this method by first investigating several artificial data-sets, finding that it can outperform significantly other established approaches. Then we show that our method can successfully differentiate meaningful clusters and hierarchies in a variety of real data-sets. In particular, we find that the application to gene expression patterns of lymphoma samples uncovers biologically significant groups of genes which play key-roles in diagnosis, prognosis and treatment of some of the most relevant human lymphoid malignancies. △ Less

Submitted 20 October, 2011; originally announced October 2011.

Comments: 33 Pages, 18 Figures, 5 Tables

Journal ref: PLoS ONE 7 (2012) e31929

Showing 1–34 of 34 results for author: Aste, T