Skip to main content

Showing 1–34 of 34 results for author: Aste, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.18938  [pdf, other

    q-fin.TR cs.LG

    HLOB -- Information Persistence and Structure in Limit Order Books

    Authors: Antonio Briola, Silvia Bartolucci, Tomaso Aste

    Abstract: We introduce a novel large-scale deep learning model for Limit Order Book mid-price changes forecasting, and we name it `HLOB'. This architecture (i) exploits the information encoded by an Information Filtering Network, namely the Triangulated Maximally Filtered Graph, to unveil deeper and non-trivial dependency structures among volume levels; and (ii) guarantees deterministic design choices to ha… ▽ More

    Submitted 4 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: 34 pages, 7 figures, 7 tables, 3 equations

  2. arXiv:2403.09267  [pdf, other

    q-fin.TR cs.LG

    Deep Limit Order Book Forecasting

    Authors: Antonio Briola, Silvia Bartolucci, Tomaso Aste

    Abstract: We exploit cutting-edge deep learning methodologies to explore the predictability of high-frequency Limit Order Book mid-price changes for a heterogeneous set of stocks traded on the NASDAQ exchange. In so doing, we release `LOBFrame', an open-source code base to efficiently process large-scale Limit Order Book data and quantitatively assess state-of-the-art deep learning models' forecasting capab… ▽ More

    Submitted 4 June, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: 43 pages, 14 figures, 12 Tables

  3. arXiv:2403.07070  [pdf, ps, other

    cs.CY

    Retail Central Bank Digital Currency: Motivations, Opportunities, and Mistakes

    Authors: Geoffrey Goodell, Hazem Danny Al-Nakib, Tomaso Aste

    Abstract: Nations around the world are conducting research into the design of central bank digital currency (CBDC), a new, digital form of money that would be issued by central banks alongside cash and central bank reserves. Retail CBDC would be used by individuals and businesses as form of money suitable for routine commerce. An important motivating factor in the development of retail CBDC is the decline o… ▽ More

    Submitted 4 April, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: 31 pages, 1 figure

  4. arXiv:2310.13572  [pdf, other

    cs.LG

    Unraveling the Enigma of Double Descent: An In-depth Analysis through the Lens of Learned Feature Space

    Authors: Yufei Gu, Xiaoqing Zheng, Tomaso Aste

    Abstract: Double descent presents a counter-intuitive aspect within the machine learning domain, and researchers have observed its manifestation in various models and tasks. While some theoretical explanations have been proposed for this phenomenon in specific contexts, an accepted theory to account for its occurrence in deep learning remains yet to be established. In this study, we revisit the phenomenon o… ▽ More

    Submitted 25 April, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

  5. arXiv:2308.13816  [pdf, other

    cs.LG cs.AI cs.CC

    Homological Convolutional Neural Networks

    Authors: Antonio Briola, Yuanrong Wang, Silvia Bartolucci, Tomaso Aste

    Abstract: Deep learning methods have demonstrated outstanding performances on classification and regression tasks on homogeneous data types (e.g., image, audio, and text data). However, tabular data still pose a challenge, with classic machine learning approaches being often computationally cheaper and equally effective than increasingly complex deep learning architectures. The challenge arises from the fac… ▽ More

    Submitted 14 November, 2023; v1 submitted 26 August, 2023; originally announced August 2023.

    Comments: 26 pages, 5 figures, 11 tables, 1 equation, 1 algorithm

  6. arXiv:2306.15337  [pdf, other

    cs.LG cs.AI

    Homological Neural Networks: A Sparse Architecture for Multivariate Complexity

    Authors: Yuanrong Wang, Antonio Briola, Tomaso Aste

    Abstract: The rapid progress of Artificial Intelligence research came with the development of increasingly complex deep learning models, leading to growing challenges in terms of computational complexity, energy efficiency and interpretability. In this study, we apply advanced network-based information filtering techniques to design a novel deep neural network unit characterized by a sparse higher-order gra… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

  7. arXiv:2302.09543  [pdf, ps, other

    cs.LG cs.AI

    Topological Feature Selection

    Authors: Antonio Briola, Tomaso Aste

    Abstract: In this paper, we introduce a novel unsupervised, graph-based filter feature selection technique which exploits the power of topologically constrained network representations. We model dependency structures among features using a family of chordal graphs (the Triangulated Maximally Filtered Graph), and we maximise the likelihood of features' relevance by studying their relative position inside the… ▽ More

    Submitted 1 July, 2023; v1 submitted 19 February, 2023; originally announced February 2023.

    Comments: Accepted at the 2nd Annual Workshop on Topology, Algebra, and Geometry in Machine Learning (TAG-ML) at the 40th International Conference on Machine Learning, Honolulu, Hawaii, USA. 2023. 23 pages, 2 figures, 13 tables

  8. arXiv:2208.12614  [pdf, other

    q-fin.CP cs.LG

    Regime-based Implied Stochastic Volatility Model for Crypto Option Pricing

    Authors: Danial Saef, Yuanrong Wang, Tomaso Aste

    Abstract: The increasing adoption of Digital Assets (DAs), such as Bitcoin (BTC), rises the need for accurate option pricing models. Yet, existing methodologies fail to cope with the volatile nature of the emerging DAs. Many models have been proposed to address the unorthodox market dynamics and frequent disruptions in the microstructure caused by the non-stationarity, and peculiar statistics, in DA markets… ▽ More

    Submitted 27 September, 2022; v1 submitted 15 August, 2022; originally announced August 2022.

    ACM Class: G.3

  9. arXiv:2207.13914  [pdf, other

    q-fin.GN cs.SI q-fin.ST

    Anatomy of a Stablecoin's failure: the Terra-Luna case

    Authors: Antonio Briola, David Vidal-Tomás, Yuanrong Wang, Tomaso Aste

    Abstract: We quantitatively describe the main events that led to the Terra project's failure in May 2022. We first review, in a systematic way, news from heterogeneous social media sources; we discuss the fragility of the Terra project and its vicious dependence on the Anchor protocol. We hence identify the crash's trigger events, analysing hourly and transaction data for Bitcoin, Luna, and TerraUSD. Finall… ▽ More

    Submitted 25 September, 2022; v1 submitted 28 July, 2022; originally announced July 2022.

    Comments: 17 pages, 7 figures, 6 tables, 1 appendix

  10. arXiv:2203.03991  [pdf, other

    cs.LG q-fin.CP

    Sparsification and Filtering for Spatial-temporal GNN in Multivariate Time-series

    Authors: Yuanrong Wang, Tomaso Aste

    Abstract: We propose an end-to-end architecture for multivariate time-series prediction that integrates a spatial-temporal graph neural network with a matrix filtering module. This module generates filtered (inverse) correlation graphs from multivariate time series before inputting them into a GNN. In contrast with existing sparsification methods adopted in graph neural network, our model explicitly leverag… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

    Comments: 7 pages, 1 figure, 3tables

  11. arXiv:2101.07107  [pdf, other

    cs.LG cs.AI cs.MA q-fin.TR

    Deep Reinforcement Learning for Active High Frequency Trading

    Authors: Antonio Briola, Jeremy Turiel, Riccardo Marcaccioli, Alvaro Cauderan, Tomaso Aste

    Abstract: We introduce the first end-to-end Deep Reinforcement Learning (DRL) based framework for active high frequency trading in the stock market. We train DRL agents to trade one unit of Intel Corporation stock by employing the Proximal Policy Optimization algorithm. The training is performed on three contiguous months of high frequency Limit Order Book data, of which the last month constitutes the valid… ▽ More

    Submitted 19 August, 2023; v1 submitted 18 January, 2021; originally announced January 2021.

    Comments: 9 pages, 4 figures

  12. arXiv:2007.07319  [pdf, other

    q-fin.TR cs.LG q-fin.CP stat.ML

    Deep Learning modeling of Limit Order Book: a comparative perspective

    Authors: Antonio Briola, Jeremy Turiel, Tomaso Aste

    Abstract: The present work addresses theoretical and practical questions in the domain of Deep Learning for High Frequency Trading. State-of-the-art models such as Random models, Logistic Regressions, LSTMs, LSTMs equipped with an Attention mask, CNN-LSTMs and MLPs are reviewed and compared on the same tasks, feature space and dataset, and then clustered according to pairwise similarity and performance metr… ▽ More

    Submitted 18 October, 2020; v1 submitted 12 July, 2020; originally announced July 2020.

    Comments: 16 pages, 4 figures, 9 tables

  13. arXiv:2005.04692  [pdf, ps, other

    cs.LG stat.ML

    Topological regularization with information filtering networks

    Authors: Tomaso Aste

    Abstract: A methodology to perform topological regularization via information filtering network is introduced. This methodology can be directly applied to covariance selection problem providing an instrument for sparse probabilistic modeling with both linear and non-linear multivariate probability distributions such as the elliptical and generalized hyperbolic families. It can also be directly implemented f… ▽ More

    Submitted 30 October, 2021; v1 submitted 10 May, 2020; originally announced May 2020.

    Comments: 17 pages , 4 figures, 1 table

  14. arXiv:2004.04605  [pdf, ps, other

    cs.CR q-fin.GN

    The cost of Bitcoin mining has never really increased

    Authors: Yo-Der Song, Tomaso Aste

    Abstract: The Bitcoin network is burning a large amount of energy for mining. In this paper we estimate the lower bound for the global energy cost for a period of ten years from 2010, taking into account changing oil costs, improvements in hashing technologies and hashing activity. Despite a ten-billion-fold increase in hashing activity and a ten-million-fold increase in total energy consumption, we find th… ▽ More

    Submitted 18 May, 2020; v1 submitted 7 April, 2020; originally announced April 2020.

    Comments: 16 pages, 6 figures

  15. arXiv:2004.04125  [pdf

    physics.soc-ph cs.SI

    Wisdom of Crowds Detects COVID-19 Severity Ahead of Officially Available Data

    Authors: Jeremy Turiel, Delmiro Fernandez-Reyes, Tomaso Aste

    Abstract: During the unfolding of a crisis, it is crucial to determine its severity, yet access to reliable data is challenging. We investigate the relation between geolocated Tweet Intensity of initial COVID-19 related tweet at the beginning of the pandemic across Italian, Spanish and USA regions and mortality in the region a month later. We find significant proportionality between early social media react… ▽ More

    Submitted 22 June, 2020; v1 submitted 8 April, 2020; originally announced April 2020.

    Comments: 14 pages, 3 figures, 3 tables

  16. arXiv:1906.04619  [pdf, other

    physics.soc-ph cs.DL cs.SI

    Achieving competitive advantage in academia through early career coauthorship with top scientists

    Authors: Weihua Li, Tomaso Aste, Fabio Caccioli, Giacomo Livan

    Abstract: We quantify the long term impact that the coauthorship with established top-cited scientists has on the career of junior researchers in four different scientific disciplines. Through matched pair analysis, we find that junior researchers who coauthor work with top scientists enjoy a persistent competitive advantage throughout the rest of their careers with respect to peers with similar early caree… ▽ More

    Submitted 11 June, 2019; originally announced June 2019.

    Comments: 17 pages, 7 figures, 2 tables

    Journal ref: Nature Communications 10, 5170 (2019)

  17. arXiv:1905.02266  [pdf, other

    stat.ML cs.LG

    Learning Clique Forests

    Authors: Guido Previde Massara, Tomaso Aste

    Abstract: We propose a topological learning algorithm for the estimation of the conditional dependency structure of large sets of random variables from sparse and noisy data. The algorithm, named Maximally Filtered Clique Forest (MFCF), produces a clique forest and an associated Markov Random Field (MRF) by generalising Prim's minimum spanning tree algorithm. To the best of our knowledge, the MFCF presents… ▽ More

    Submitted 16 May, 2021; v1 submitted 6 May, 2019; originally announced May 2019.

    Comments: 47 pages, 26 figures

  18. A Decentralised Digital Identity Architecture

    Authors: Geoff Goodell, Tomaso Aste

    Abstract: Current architectures to validate, certify, and manage identity are based on centralised, top-down approaches that rely on trusted authorities and third-party operators. We approach the problem of digital identity starting from a human rights perspective, with a primary focus on identity systems in the developed world. We assert that individual persons must be allowed to manage their personal info… ▽ More

    Submitted 26 October, 2019; v1 submitted 23 February, 2019; originally announced February 2019.

    Comments: 30 pages, 10 figures, 3 tables

  19. Can Cryptocurrencies Preserve Privacy and Comply with Regulations?

    Authors: Geoff Goodell, Tomaso Aste

    Abstract: Cryptocurrencies offer an alternative to traditional methods of electronic value exchange, promising anonymous, cash-like electronic transfers, but in practice they fall short for several key reasons. We consider the false choice between total surveillance, as represented by banking as currently implemented by institutions, and impenetrable lawlessness, as represented by privacy-enhancing cryptocu… ▽ More

    Submitted 7 May, 2019; v1 submitted 29 November, 2018; originally announced November 2018.

    Comments: 20 pages, 10 figures, 3 tables

  20. arXiv:1808.03781  [pdf, other

    physics.soc-ph cs.SI

    Reciprocity and success in academic careers

    Authors: Weihua Li, Tomaso Aste, Fabio Caccioli, Giacomo Livan

    Abstract: The growing importance of citation-based bibliometric indicators in sha** the prospects of academic careers incentivizes scientists to boost the numbers of citations they receive. Whereas the exploitation of self-citations has been extensively documented, the impact of reciprocated citations has not yet been studied. We study reciprocity in a citation network of authors, and compare it with the… ▽ More

    Submitted 11 August, 2018; originally announced August 2018.

    Comments: 19 pages, 14 figures

  21. arXiv:1807.05836  [pdf, other

    q-fin.ST cs.LG stat.ML

    Forecasting market states

    Authors: Pier Francesco Procacci, Tomaso Aste

    Abstract: We propose a novel methodology to define, analyze and forecast market states. In our approach market states are identified by a reference sparse precision matrix and a vector of expectation values. In our procedure, each multivariate observation is associated with a given market state accordingly to a minimization of a penalized Mahalanobis distance. The procedure is made computationally very effi… ▽ More

    Submitted 27 May, 2019; v1 submitted 13 July, 2018; originally announced July 2018.

    Comments: 13 pages, 5 figures

    Journal ref: Quantitative Finance 19 (2019) 1491-1498

  22. Dynamic correlations at different time-scales with Empirical Mode Decomposition

    Authors: Noemi Nava, T. Di Matteo, Tomaso Aste

    Abstract: The Empirical Mode Decomposition (EMD) provides a tool to characterize time series in terms of its implicit components oscillating at different time-scales. We apply this decomposition to intraday time series of the following three financial indices: the S\&P 500 (USA), the IPC (Mexico) and the VIX (volatility index USA), obtaining time-varying multidimensional cross-correlations at different time… ▽ More

    Submitted 22 August, 2017; originally announced August 2017.

    Comments: 19 pages, 11 figures

  23. arXiv:1704.01414  [pdf, other

    cs.CY cs.CR

    Blockchain Inefficiency in the Bitcoin Peers Network

    Authors: Giuseppe Pappalardo, T. Di Matteo, Guido Caldarelli, Tomaso Aste

    Abstract: We investigate Bitcoin network monitoring the dynamics of blocks and transactions. We unveil that 43\% of the transactions are still not included in the Blockchain after 1h from the first time they were seen in the network and 20\% of the transactions are still not included in the Blockchain after 30 days, revealing therefore great inefficiency in the Bitcoin system. However, we observe that most… ▽ More

    Submitted 5 April, 2017; originally announced April 2017.

    Comments: 15 pages, 8 figures, 3 tables

  24. arXiv:1606.04872  [pdf, other

    physics.soc-ph cs.CE q-fin.ST

    The multiplex dependency structure of financial markets

    Authors: Nicoló Musmeci, Vincenzo Nicosia, Tomaso Aste, Tiziana Di Matteo, Vito Latora

    Abstract: We propose here a multiplex network approach to investigate simultaneously different types of dependency in complex data sets. In particular, we consider multiplex networks made of four layers corresponding respectively to linear, non-linear, tail, and partial correlations among a set of financial time series. We construct the sparse graph on each layer using a standard network filtering procedure… ▽ More

    Submitted 15 June, 2016; originally announced June 2016.

    Comments: 12 pages, 5 figures

  25. arXiv:1606.02597  [pdf, other

    physics.soc-ph cs.SI

    Excess reciprocity distorts reputation in online social networks

    Authors: Giacomo Livan, Fabio Caccioli, Tomaso Aste

    Abstract: The peer-to-peer (P2P) economy relies on establishing trust in distributed networked systems, where the reliability of a user is assessed through digital peer-review processes that aggregate ratings into reputation scores. Here we present evidence of a network effect which biases digital reputation, revealing that P2P networks display exceedingly high levels of reciprocity. In fact, these are much… ▽ More

    Submitted 2 May, 2017; v1 submitted 8 June, 2016; originally announced June 2016.

    Comments: 22 pages, 13 figures

    Journal ref: Nature Scientific Reports 7, Article number: 3551 (2017)

  26. Parsimonious modeling with Information Filtering Networks

    Authors: Wolfram Barfuss, Guido Previde Massara, T. Di Matteo, Tomaso Aste

    Abstract: We introduce a methodology to construct parsimonious probabilistic models. This method makes use of Information Filtering Networks to produce a robust estimate of the global sparse inverse covariance from a simple sum of local inverse covariances computed on small sub-parts of the network. Being based on local and low-dimensional inversions, this method is computationally very efficient and statis… ▽ More

    Submitted 23 November, 2016; v1 submitted 23 February, 2016; originally announced February 2016.

    Comments: 17 pages, 10 figures, 3 tables

    Journal ref: Phys. Rev. E 94, 062306 (2016)

  27. arXiv:1601.04535  [pdf, other

    q-fin.ST cs.CY physics.data-an q-fin.CP

    A nonlinear impact: evidences of causal effects of social media on market prices

    Authors: Thársis T. P. Souza, Tomaso Aste

    Abstract: Online social networks offer a new way to investigate financial markets' dynamics by enabling the large-scale analysis of investors' collective behavior. We provide empirical evidence that suggests social media and stock markets have a nonlinear causal relationship. We take advantage of an extensive data set composed of social media messages related to DJIA index components. By using information-t… ▽ More

    Submitted 1 March, 2016; v1 submitted 18 January, 2016; originally announced January 2016.

    Comments: 17 pages, 4 figures

  28. arXiv:1508.03981  [pdf, other

    cs.SI cs.CY cs.IR stat.AP

    In Quest of Significance: Identifying Types of Twitter Sentiment Events that Predict Spikes in Sales

    Authors: Olga Kolchyna, Th'arsis T. P. Souza, Tomaso Aste, Philip C. Treleaven

    Abstract: We study the power of Twitter events to predict consumer sales events by analysing sales for 75 companies from the retail sector and over 150 million tweets mentioning those companies along with their sentiment. We suggest an approach for events identification on Twitter extending existing methodologies of event study. We also propose a robust method for clustering Twitter events into different ty… ▽ More

    Submitted 17 August, 2015; originally announced August 2015.

  29. arXiv:1507.00955  [pdf, other

    cs.CL cs.IR cs.LG stat.ME stat.ML

    Twitter Sentiment Analysis: Lexicon Method, Machine Learning Method and Their Combination

    Authors: Olga Kolchyna, Tharsis T. P. Souza, Philip Treleaven, Tomaso Aste

    Abstract: This paper covers the two approaches for sentiment analysis: i) lexicon based method; ii) machine learning method. We describe several techniques to implement these approaches and discuss how they can be adopted for sentiment classification of Twitter messages. We present a comparative study of different lexicon combinations and show that enhancing sentiment lexicons with emoticons, abbreviations… ▽ More

    Submitted 18 September, 2015; v1 submitted 3 July, 2015; originally announced July 2015.

    Comments: 32 pages, 5 figures

    Journal ref: Handbook of Sentiment Analysis in Finance. Mitra, G. and Yu, X. (Eds.). (2016). ISBN 1910571571

  30. arXiv:1507.00784  [pdf, other

    cs.CY cs.SI q-fin.CP

    Twitter Sentiment Analysis Applied to Finance: A Case Study in the Retail Industry

    Authors: Thársis Tuani Pinto Souza, Olga Kolchyna, Philip C. Treleaven, Tomaso Aste

    Abstract: This paper presents a financial analysis over Twitter sentiment analytics extracted from listed retail brands. We investigate whether there is statistically-significant information between the Twitter sentiment and volume, and stock returns and volatility. Traditional newswires are also considered as a proxy for the market sentiment for comparative purpose. The results suggest that social media is… ▽ More

    Submitted 11 July, 2015; v1 submitted 2 July, 2015; originally announced July 2015.

    Comments: 23 pages, 5 figures, 9 tables

    Journal ref: In: Handbook of Sentiment Analysis in Finance. Mitra, G. and Yu, X. (Eds.). (2016). ISBN 1910571571., 2016

  31. arXiv:1505.02445  [pdf, other

    cs.DS cond-mat.stat-mech cs.IR

    Network Filtering for Big Data: Triangulated Maximally Filtered Graph

    Authors: Guido Previde Massara, T. Di Matteo, Tomaso Aste

    Abstract: We propose a network-filtering method, the Triangulated Maximally Filtered Graph (TMFG), that provides an approximate solution to the Weighted Maximal Planar Graph problem. The underlying idea of TMFG consists in building a triangulation that maximizes a score function associated with the amount of information retained by the network. TMFG uses as weights any arbitrary similarity measure to arrang… ▽ More

    Submitted 25 August, 2015; v1 submitted 10 May, 2015; originally announced May 2015.

    Comments: 16 pages, 7 Figures, 2 Tables

  32. Relation between Financial Market Structure and the Real Economy: Comparison between Clustering Methods

    Authors: Nicolo Musmeci, Tomaso Aste, Tiziana Di Matteo

    Abstract: We quantify the amount of information filtered by different hierarchical clustering methods on correlations between stock returns comparing it with the underlying industrial activity structure. Specifically, we apply, for the first time to financial data, a novel hierarchical clustering approach, the Directed Bubble Hierarchical Tree and we compare it with other methods including the Linkage and k… ▽ More

    Submitted 21 January, 2015; v1 submitted 2 June, 2014; originally announced June 2014.

    Comments: 31 pages, 17 figures

    Journal ref: Journal of Network Theory in Finance, VOLUME 4, NUMBER 2 (2018)

  33. Graph theory enables drug repurposing. How a mathematical model can drive the discovery of hidden Mechanisms of Action

    Authors: Ruggero Gramatica, T. Di Matteo, Stefano Giorgetti, Massimo Barbiani, Dorian Bevec, Tomaso Aste

    Abstract: We introduced a methodology to efficiently exploit natural-language expressed biomedical knowledge for repurposing existing drugs towards diseases for which they were not initially intended. Leveraging on developments in Computational Linguistics and Graph Theory, a methodology is defined to build a graph representation of knowledge, which is automatically analysed to discover hidden relations bet… ▽ More

    Submitted 4 June, 2013; originally announced June 2013.

    Comments: 8 pages, 7 figures

    Journal ref: PLoS ONE 9 (2013) e84912

  34. arXiv:1110.4477  [pdf, other

    physics.data-an cs.DS physics.bio-ph q-bio.QM q-fin.CP

    Hierarchical information clustering by means of topologically embedded graphs

    Authors: Won-Min Song, T. Di Matteo, Tomaso Aste

    Abstract: We introduce a graph-theoretic approach to extract clusters and hierarchies in complex data-sets in an unsupervised and deterministic manner, without the use of any prior information. This is achieved by building topologically embedded networks containing the subset of most significant links and analyzing the network structure. For a planar embedding, this method provides both the intra-cluster hi… ▽ More

    Submitted 20 October, 2011; originally announced October 2011.

    Comments: 33 Pages, 18 Figures, 5 Tables

    Journal ref: PLoS ONE 7 (2012) e31929