-
Modeling non-linear Effects with Neural Networks in Relational Event Models
Authors:
Edoardo Filippi-Mazzola,
Ernst C. Wit
Abstract:
Dynamic networks offer an insight of how relational systems evolve. However, modeling these networks efficiently remains a challenge, primarily due to computational constraints, especially as the number of observed events grows. This paper addresses this issue by introducing the Deep Relational Event Additive Model (DREAM) as a solution to the computational challenges presented by modeling non-lin…
▽ More
Dynamic networks offer an insight of how relational systems evolve. However, modeling these networks efficiently remains a challenge, primarily due to computational constraints, especially as the number of observed events grows. This paper addresses this issue by introducing the Deep Relational Event Additive Model (DREAM) as a solution to the computational challenges presented by modeling non-linear effects in Relational Event Models (REMs). DREAM relies on Neural Additive Models to model non-linear effects, allowing each effect to be captured by an independent neural network. By strategically trading computational complexity for improved memory management and leveraging the computational capabilities of Graphic Processor Units (GPUs), DREAM efficiently captures complex non-linear relationships within data. This approach demonstrates the capability of DREAM in modeling dynamic networks and scaling to larger networks. Comparisons with traditional REM approaches showcase DREAM superior computational efficiency. The model potential is further demonstrated by an examination of the patent citation network, which contains nearly 8 million nodes and 100 million events.
△ Less
Submitted 19 December, 2023;
originally announced December 2023.
-
Relational Event Modeling
Authors:
Federica Bianchi,
Edoardo Filippi-Mazzola,
Alessandro Lomi,
Ernst C. Wit
Abstract:
Advances in information technology have increased the availability of time-stamped relational data such as those produced by email exchanges or interaction through social media. Whereas the associated information flows could be aggregated into cross-sectional panels, the temporal ordering of the events frequently contains information that requires new models for the analysis of continuous-time int…
▽ More
Advances in information technology have increased the availability of time-stamped relational data such as those produced by email exchanges or interaction through social media. Whereas the associated information flows could be aggregated into cross-sectional panels, the temporal ordering of the events frequently contains information that requires new models for the analysis of continuous-time interactions, subject to both endogenous and exogenous influences. The introduction of the Relational Event Model (REM) has been a major development that has led to further methodological improvements stimulated by new questions that REMs made possible. In this review, we track the intellectual history of the REM, define its core properties, and discuss why and how it has been considered useful in empirical research. We describe how the demands of novel applications have stimulated methodological, computational, and inferential advancements.
△ Less
Submitted 30 June, 2023;
originally announced June 2023.
-
A Stochastic Gradient Relational Event Additive Model for modelling US patent citations from 1976 until 2022
Authors:
Edoardo Filippi-Mazzola,
Ernst C. Wit
Abstract:
Until 2022, the US patent citation network contained almost 10 million patents and over 100 million citations. To overcome limitations in analyzing such complex networks, we propose a stochastic gradient relational event additive model (STREAM) that models the relationships between citing patents as events that occur over time, where predictors are modeled through B-splines. Our model identifies k…
▽ More
Until 2022, the US patent citation network contained almost 10 million patents and over 100 million citations. To overcome limitations in analyzing such complex networks, we propose a stochastic gradient relational event additive model (STREAM) that models the relationships between citing patents as events that occur over time, where predictors are modeled through B-splines. Our model identifies key factors driving patent citation and reveals insights, such as time windows where citations are more likely and the relevance of the increasing citation numbers per patent. Overall, the STREAM offers the potential for capturing dynamics in large sparse networks while maintaining interpretability.
△ Less
Submitted 24 April, 2023; v1 submitted 14 March, 2023;
originally announced March 2023.
-
Drivers of the decrease of patent similarities from 1976 to 2021
Authors:
Edoardo Filippi-Mazzola,
Federica Bianchi,
Ernst C. Wit
Abstract:
The citation network of patents citing prior art arises from the legal obligation of patent applicants to properly disclose their invention. One way to study the relationship between current patents and their antecedents is by analyzing the similarity between the textual elements of patents. Many patent similarity indicators have shown a constant decrease since the mid-70s. Although several explan…
▽ More
The citation network of patents citing prior art arises from the legal obligation of patent applicants to properly disclose their invention. One way to study the relationship between current patents and their antecedents is by analyzing the similarity between the textual elements of patents. Many patent similarity indicators have shown a constant decrease since the mid-70s. Although several explanations have been proposed, more comprehensive analyses of this phenomenon have been rare. In this paper, we use a computationally efficient measure of patent similarity scores that leverages state-of-the-art Natural Language Processing tools, to investigate potential drivers of this apparent similarity decrease. This is achieved by modeling patent similarity scores by means of generalized additive models. We found that non-linear modeling specifications are able to distinguish between distinct, temporally varying drivers of the patent similarity levels that explain more variation in the data ($R^2\sim 18\%$) compared to previous methods. Moreover, the model reveals an underlying trend in similarity scores that is fundamentally different from the one presented previously.
△ Less
Submitted 14 March, 2023; v1 submitted 12 December, 2022;
originally announced December 2022.
-
Model-based clustering of categorical data based on the Hamming distance
Authors:
Raffaele Argiento,
Edoardo Filippi-Mazzola,
Lucia Paci
Abstract:
A model-based approach is developed for clustering categorical data with no natural ordering. The proposed method exploits the Hamming distance to define a family of probability mass functions to model the data. The elements of this family are then considered as kernels of a finite mixture model with an unknown number of components.
Conjugate Bayesian inference has been derived for the parameter…
▽ More
A model-based approach is developed for clustering categorical data with no natural ordering. The proposed method exploits the Hamming distance to define a family of probability mass functions to model the data. The elements of this family are then considered as kernels of a finite mixture model with an unknown number of components.
Conjugate Bayesian inference has been derived for the parameters of the Hamming distribution model. The mixture is framed in a Bayesian nonparametric setting, and a transdimensional blocked Gibbs sampler is developed to provide full Bayesian inference on the number of clusters, their structure, and the group-specific parameters, facilitating the computation with respect to customary reversible jump algorithms. The proposed model encompasses a parsimonious latent class model as a special case when the number of components is fixed. Model performances are assessed via a simulation study and reference datasets, showing improvements in clustering recovery over existing approaches.
△ Less
Submitted 29 June, 2024; v1 submitted 9 December, 2022;
originally announced December 2022.