-
Speed Limit: Obey, or Not Obey?
Authors:
Zhengbing He,
Mirco Nanni,
Luca Pappalardo,
Paolo Santi,
Carlo Ratti
Abstract:
It is commonly expected that drivers maintain a driving speed that is lower than or around the posted speed limit, as failure to obey may result in safety risks and fines. By taking randomly selected road segments as examples, this study compares the percentages of speeding vehicles in five countries worldwide, namely, two European countries (Germany and Italy), two Asian countries (Japan and Chin…
▽ More
It is commonly expected that drivers maintain a driving speed that is lower than or around the posted speed limit, as failure to obey may result in safety risks and fines. By taking randomly selected road segments as examples, this study compares the percentages of speeding vehicles in five countries worldwide, namely, two European countries (Germany and Italy), two Asian countries (Japan and China), and one North American country (the United States). Contrary to expectations, our results show that more than 80% of drivers violate the posted speed limits in the studied road segments in Italy, Japan, and the United States. In particular, a significant portion (45.3%) of drivers in Italy exceed the posted speed limit by a substantial margin (30 km/h), while few speeding vehicles are observed in the road segment examined in China. Meanwhile, it is found that drivers on low-speed-limit roads are more likely to exceed the posted speed limit, particularly when there are fewer on-road vehicles. The comparison of different countries' speeding fines indicates that for the purpose of preventing speeding, increasing fines (as Italy has done) is less effective than enhancing supervision (as China has done). The findings remind law enforcement agencies and traffic authorities of the importance of the supervision of driver's behavior and the necessity of revisiting the rationale for the current speed limit settings.
△ Less
Submitted 27 November, 2023; v1 submitted 22 July, 2023;
originally announced July 2023.
-
Explaining the difference between men's and women's football
Authors:
Luca Pappalardo,
Alessio Rossi,
Giuseppe Pontillo,
Michela Natilli,
Paolo Cintia
Abstract:
Women's football is gaining supporters and practitioners worldwide, raising questions about what the differences are with men's football. While the two sports are often compared based on the players' physical attributes, we analyze the spatio-temporal events during matches in the last World Cups to compare male and female teams based on their technical performance. We train an artificial intellige…
▽ More
Women's football is gaining supporters and practitioners worldwide, raising questions about what the differences are with men's football. While the two sports are often compared based on the players' physical attributes, we analyze the spatio-temporal events during matches in the last World Cups to compare male and female teams based on their technical performance. We train an artificial intelligence model to recognize if a team is male or female based on variables that describe a match's playing intensity, accuracy, and performance quality. Our model accurately distinguishes between men's and women's football, revealing crucial technical differences, which we investigate through the extraction of explanations from the classifier's decisions. The differences between men's and women's football are rooted in play accuracy, the recovery time of ball possession, and the players' performance quality. Our methodology may help journalists and fans understand what makes women's football a distinct sport and coaches design tactics tailored to female teams.
△ Less
Submitted 5 January, 2021;
originally announced January 2021.
-
Modelling Human Mobility considering Spatial,Temporal and Social Dimensions
Authors:
Giuliano Cornacchia,
Giulio Rossetti,
Luca Pappalardo
Abstract:
Modelling human mobility is crucial in several areas, from urban planning to epidemic modeling, traffic forecasting, and what-if analysis. On the one hand, existing models focus mainly on reproducing the spatial and temporal dimensions of human mobility, while the social aspect, though it influences human movements significantly, is often neglected. On the other hand, those models that capture som…
▽ More
Modelling human mobility is crucial in several areas, from urban planning to epidemic modeling, traffic forecasting, and what-if analysis. On the one hand, existing models focus mainly on reproducing the spatial and temporal dimensions of human mobility, while the social aspect, though it influences human movements significantly, is often neglected. On the other hand, those models that capture some social aspects of human mobility have trivial and unrealistic spatial and temporal mechanisms. In this paper, we propose STS-EPR, a modeling framework that embeds mechanisms to capture the spatial, temporal, and social aspects together. Our experiments show that STS-EPR outperforms existing spatial-temporal or social models on a set of standard mobility metrics and that it can be used with a limited amount of information without any significant loss of realism. STS-EPR, which is open-source and tested on open data, is a step towards the design of mechanistic models that can capture all the aspects of human mobility in a comprehensive way.
△ Less
Submitted 5 July, 2020;
originally announced July 2020.
-
The relationship between human mobility and viral transmissibility during the COVID-19 epidemics in Italy
Authors:
Paolo Cintia,
Luca Pappalardo,
Salvatore Rinzivillo,
Daniele Fadda,
Tobia Boschi,
Fosca Giannotti,
Francesca Chiaromonte,
Pietro Bonato,
Francesco Fabbri,
Francesco Penone,
Marcello Savarese,
Francesco Calabrese,
Giorgio Guzzetta,
Flavia Riccardo,
Valentina Marziano,
Piero Poletti,
Filippo Trentini,
Antonino Bella,
Xanthi Andrianou,
Martina Del Manso,
Massimo Fabiani,
Stefania Bellino,
Stefano Boros,
Alberto Mateo Urdiales,
Maria Fenicia Vescio
, et al. (7 additional authors not shown)
Abstract:
In 2020, countries affected by the COVID-19 pandemic implemented various non-pharmaceutical interventions to contrast the spread of the virus and its impact on their healthcare systems and economies. Using Italian data at different geographic scales, we investigate the relationship between human mobility, which subsumes many facets of the population's response to the changing situation, and the sp…
▽ More
In 2020, countries affected by the COVID-19 pandemic implemented various non-pharmaceutical interventions to contrast the spread of the virus and its impact on their healthcare systems and economies. Using Italian data at different geographic scales, we investigate the relationship between human mobility, which subsumes many facets of the population's response to the changing situation, and the spread of COVID-19. Leveraging mobile phone data from February through September 2020, we find a striking relationship between the decrease in mobility flows and the net reproduction number. We find that the time needed to switch off mobility and bring the net reproduction number below the critical threshold of 1 is about one week. Moreover, we observe a strong relationship between the number of days spent above such threshold before the lockdown-induced drop in mobility flows and the total number of infections per 100k inhabitants. Estimating the statistical effect of mobility flows on the net reproduction number over time, we document a 2-week lag positive association, strong in March and April, and weaker but still significant in June. Our study demonstrates the value of big mobility data to monitor the epidemic and inform control interventions during its unfolding.
△ Less
Submitted 1 April, 2021; v1 submitted 4 June, 2020;
originally announced June 2020.
-
Mobile phone data analytics against the COVID-19 epidemics in Italy: flow diversity and local job markets during the national lockdown
Authors:
Pietro Bonato,
Paolo Cintia,
Francesco Fabbri,
Daniele Fadda,
Fosca Giannotti,
Pier Luigi Lopalco,
Sara Mazzilli,
Mirco Nanni,
Luca Pappalardo,
Dino Pedreschi,
Francesco Penone,
Salvatore Rinzivillo,
Giulio Rossetti,
Marcello Savarese,
Lara Tavoschi
Abstract:
Understanding collective mobility patterns is crucial to plan the restart of production and economic activities, which are currently put in stand-by to fight the diffusion of the epidemics. In this report, we use mobile phone data to infer the movements of people between Italian provinces and municipalities, and we analyze the incoming, outcoming and internal mobility flows before and during the n…
▽ More
Understanding collective mobility patterns is crucial to plan the restart of production and economic activities, which are currently put in stand-by to fight the diffusion of the epidemics. In this report, we use mobile phone data to infer the movements of people between Italian provinces and municipalities, and we analyze the incoming, outcoming and internal mobility flows before and during the national lockdown (March 9th, 2020) and after the closure of non-necessary productive and economic activities (March 23th, 2020). The population flow across provinces and municipalities enable for the modelling of a risk index tailored for the mobility of each municipality or province. Such an index would be a useful indicator to drive counter-measures in reaction to a sudden reactivation of the epidemics. Mobile phone data, even when aggregated to preserve the privacy of individuals, are a useful data source to track the evolution in time of human mobility, hence allowing for monitoring the effectiveness of control measures such as physical distancing. We address the following analytical questions: How does the mobility structure of a territory change? Do incoming and outcoming flows become more predictable during the lockdown, and what are the differences between weekdays and weekends? Can we detect proper local job markets based on human mobility flows, to eventually shape the borders of a local outbreak?
△ Less
Submitted 23 April, 2020;
originally announced April 2020.
-
Disease State Prediction From Single-Cell Data Using Graph Attention Networks
Authors:
Neal G. Ravindra,
Arijit Sehanobish,
Jenna L. Pappalardo,
David A. Hafler,
David van Dijk
Abstract:
Single-cell RNA sequencing (scRNA-seq) has revolutionized biological discovery, providing an unbiased picture of cellular heterogeneity in tissues. While scRNA-seq has been used extensively to provide insight into both healthy systems and diseases, it has not been used for disease prediction or diagnostics. Graph Attention Networks (GAT) have proven to be versatile for a wide range of tasks by lea…
▽ More
Single-cell RNA sequencing (scRNA-seq) has revolutionized biological discovery, providing an unbiased picture of cellular heterogeneity in tissues. While scRNA-seq has been used extensively to provide insight into both healthy systems and diseases, it has not been used for disease prediction or diagnostics. Graph Attention Networks (GAT) have proven to be versatile for a wide range of tasks by learning from both original features and graph structures. Here we present a graph attention model for predicting disease state from single-cell data on a large dataset of Multiple Sclerosis (MS) patients. MS is a disease of the central nervous system that can be difficult to diagnose. We train our model on single-cell data obtained from blood and cerebrospinal fluid (CSF) for a cohort of seven MS patients and six healthy adults (HA), resulting in 66,667 individual cells. We achieve 92 % accuracy in predicting MS, outperforming other state-of-the-art methods such as a graph convolutional network and a random forest classifier. Further, we use the learned graph attention model to get insight into the features (cell types and genes) that are important for this prediction. The graph attention model also allow us to infer a new feature space for the cells that emphasizes the differences between the two conditions. Finally we use the attention weights to learn a new low-dimensional embedding that can be visualized. To the best of our knowledge, this is the first effort to use graph attention, and deep learning in general, to predict disease state from single-cell data. We envision applying this method to single-cell data for other diseases.
△ Less
Submitted 12 March, 2020; v1 submitted 14 February, 2020;
originally announced February 2020.
-
PlayeRank: data-driven performance evaluation and player ranking in soccer via a machine learning approach
Authors:
Luca Pappalardo,
Paolo Cintia,
Paolo Ferragina,
Emanuele Massucco,
Dino Pedreschi,
Fosca Giannotti
Abstract:
The problem of evaluating the performance of soccer players is attracting the interest of many companies and the scientific community, thanks to the availability of massive data capturing all the events generated during a match (e.g., tackles, passes, shots, etc.). Unfortunately, there is no consolidated and widely accepted metric for measuring performance quality in all of its facets. In this pap…
▽ More
The problem of evaluating the performance of soccer players is attracting the interest of many companies and the scientific community, thanks to the availability of massive data capturing all the events generated during a match (e.g., tackles, passes, shots, etc.). Unfortunately, there is no consolidated and widely accepted metric for measuring performance quality in all of its facets. In this paper, we design and implement PlayeRank, a data-driven framework that offers a principled multi-dimensional and role-aware evaluation of the performance of soccer players. We build our framework by deploying a massive dataset of soccer-logs and consisting of millions of match events pertaining to four seasons of 18 prominent soccer competitions. By comparing PlayeRank to known algorithms for performance evaluation in soccer, and by exploiting a dataset of players' evaluations made by professional soccer scouts, we show that PlayeRank significantly outperforms the competitors. We also explore the ratings produced by {\sf PlayeRank} and discover interesting patterns about the nature of excellent performances and what distinguishes the top players from the others. At the end, we explore some applications of PlayeRank -- i.e. searching players and player versatility --- showing its flexibility and efficiency, which makes it worth to be used in the design of a scalable platform for soccer analytics.
△ Less
Submitted 25 January, 2019; v1 submitted 14 February, 2018;
originally announced February 2018.
-
Prediction of next career moves from scientific profiles
Authors:
Charlotte James,
Luca Pappalardo,
Alina Sirbu,
Filippo Simini
Abstract:
Changing institution is a scientist's key career decision, which plays an important role in education, scientific productivity, and the generation of scientific knowledge. Yet, our understanding of the factors influencing a relocation decision is very limited. In this paper we investigate how the scientific profile of a scientist determines their decision to move (i.e., change institution). To thi…
▽ More
Changing institution is a scientist's key career decision, which plays an important role in education, scientific productivity, and the generation of scientific knowledge. Yet, our understanding of the factors influencing a relocation decision is very limited. In this paper we investigate how the scientific profile of a scientist determines their decision to move (i.e., change institution). To this aim, we describe a scientist's profile by three main aspects: the scientist's recent scientific career, the quality of their scientific environment and the structure of their scientific collaboration network. We then design and implement a two-stage predictive model: first, we use data mining to predict which researcher will move in the next year on the basis of their scientific profile; second we predict which institution they will choose by using a novel social-gravity model, an adaptation of the traditional gravity model of human mobility. Experiments on a massive dataset of scientific publications show that our approach performs well in both the stages, resulting in a 85% reduction of the prediction error with respect to the state-of-the-art approaches.
△ Less
Submitted 13 February, 2018;
originally announced February 2018.
-
Human Perception of Performance
Authors:
Luca Pappalardo,
Paolo Cintia,
Dino Pedreschi,
Fosca Giannotti,
Albert-Laszlo Barabasi
Abstract:
Humans are routinely asked to evaluate the performance of other individuals, separating success from failure and affecting outcomes from science to education and sports. Yet, in many contexts, the metrics driving the human evaluation process remain unclear. Here we analyse a massive dataset capturing players' evaluations by human judges to explore human perception of performance in soccer, the wor…
▽ More
Humans are routinely asked to evaluate the performance of other individuals, separating success from failure and affecting outcomes from science to education and sports. Yet, in many contexts, the metrics driving the human evaluation process remain unclear. Here we analyse a massive dataset capturing players' evaluations by human judges to explore human perception of performance in soccer, the world's most popular sport. We use machine learning to design an artificial judge which accurately reproduces human evaluation, allowing us to demonstrate how human observers are biased towards diverse contextual features. By investigating the structure of the artificial judge, we uncover the aspects of the players' behavior which attract the attention of human judges, demonstrating that human evaluation is based on a noticeability heuristic where only feature values far from the norm are considered to rate an individual's performance.
△ Less
Submitted 5 December, 2017;
originally announced December 2017.
-
Effective injury forecasting in soccer with GPS training data and machine learning
Authors:
Alessio Rossi,
Luca Pappalardo,
Paolo Cintia,
Marcello Iaia,
Javier Fernandez,
Daniel Medina
Abstract:
Injuries have a great impact on professional soccer, due to their large influence on team performance and the considerable costs of rehabilitation for players. Existing studies in the literature provide just a preliminary understanding of which factors mostly affect injury risk, while an evaluation of the potential of statistical models in forecasting injuries is still missing. In this paper, we p…
▽ More
Injuries have a great impact on professional soccer, due to their large influence on team performance and the considerable costs of rehabilitation for players. Existing studies in the literature provide just a preliminary understanding of which factors mostly affect injury risk, while an evaluation of the potential of statistical models in forecasting injuries is still missing. In this paper, we propose a multi-dimensional approach to injury forecasting in professional soccer that is based on GPS measurements and machine learning. By using GPS tracking technology, we collect data describing the training workload of players in a professional soccer club during a season. We then construct an injury forecaster and show that it is both accurate and interpretable by providing a set of case studies of interest to soccer practitioners. Our approach opens a novel perspective on injury prevention, providing a set of simple and practical rules for evaluating and interpreting the complex relations between injury risk and training performance in professional soccer.
△ Less
Submitted 5 November, 2018; v1 submitted 23 May, 2017;
originally announced May 2017.
-
Quantifying the relation between performance and success in soccer
Authors:
Luca Pappalardo,
Paolo Cintia
Abstract:
The availability of massive data about sports activities offers nowadays the opportunity to quantify the relation between performance and success. In this study, we analyze more than 6,000 games and 10 million events in six European leagues and investigate this relation in soccer competitions. We discover that a team's position in a competition's final ranking is significantly related to its typic…
▽ More
The availability of massive data about sports activities offers nowadays the opportunity to quantify the relation between performance and success. In this study, we analyze more than 6,000 games and 10 million events in six European leagues and investigate this relation in soccer competitions. We discover that a team's position in a competition's final ranking is significantly related to its typical performance, as described by a set of technical features extracted from the soccer data. Moreover we find that, while victory and defeats can be explained by the team's performance during a game, it is difficult to detect draws by using a machine learning approach. We then simulate the outcomes of an entire season of each league only relying on technical data, i.e. excluding the goals scored, exploiting a machine learning model trained on data from past seasons. The simulation produces a team ranking (the PC ranking) which is close to the actual ranking, suggesting that a complex systems' view on soccer has the potential of revealing hidden patterns regarding the relation between performance and success.
△ Less
Submitted 26 September, 2017; v1 submitted 2 May, 2017;
originally announced May 2017.
-
Data-driven generation of spatio-temporal routines in human mobility
Authors:
Luca Pappalardo,
Filippo Simini
Abstract:
The generation of realistic spatio-temporal trajectories of human mobility is of fundamental importance in a wide range of applications, such as the develo** of protocols for mobile ad-hoc networks or what-if analysis in urban ecosystems. Current generative algorithms fail in accurately reproducing the individuals' recurrent schedules and at the same time in accounting for the possibility that i…
▽ More
The generation of realistic spatio-temporal trajectories of human mobility is of fundamental importance in a wide range of applications, such as the develo** of protocols for mobile ad-hoc networks or what-if analysis in urban ecosystems. Current generative algorithms fail in accurately reproducing the individuals' recurrent schedules and at the same time in accounting for the possibility that individuals may break the routine during periods of variable duration. In this article we present DITRAS (DIary-based TRAjectory Simulator), a framework to simulate the spatio-temporal patterns of human mobility. DITRAS operates in two steps: the generation of a mobility diary and the translation of the mobility diary into a mobility trajectory. We propose a data-driven algorithm which constructs a diary generator from real data, capturing the tendency of individuals to follow or break their routine. We also propose a trajectory generator based on the concept of preferential exploration and preferential return. We instantiate DITRAS with the proposed diary and trajectory generators and compare the resulting algorithm with real data and synthetic data produced by other generative algorithms, built by instantiating DITRAS with several combinations of diary and trajectory generators. We show that the proposed algorithm reproduces the statistical properties of real trajectories in the most accurate way, making a step forward the understanding of the origin of the spatio-temporal patterns of human mobility.
△ Less
Submitted 9 December, 2017; v1 submitted 16 July, 2016;
originally announced July 2016.
-
An analytical framework to nowcast well-being using mobile phone data
Authors:
Luca Pappalardo,
Maarten Vanhoof,
Lorenzo Gabrielli,
Zbigniew Smoreda,
Dino Pedreschi,
Fosca Giannotti
Abstract:
An intriguing open question is whether measurements made on Big Data recording human activities can yield us high-fidelity proxies of socio-economic development and well-being. Can we monitor and predict the socio-economic development of a territory just by observing the behavior of its inhabitants through the lens of Big Data? In this paper, we design a data-driven analytical framework that uses…
▽ More
An intriguing open question is whether measurements made on Big Data recording human activities can yield us high-fidelity proxies of socio-economic development and well-being. Can we monitor and predict the socio-economic development of a territory just by observing the behavior of its inhabitants through the lens of Big Data? In this paper, we design a data-driven analytical framework that uses mobility measures and social measures extracted from mobile phone data to estimate indicators for socio-economic development and well-being. We discover that the diversity of mobility, defined in terms of entropy of the individual users' trajectories, exhibits (i) significant correlation with two different socio-economic indicators and (ii) the highest importance in predictive models built to predict the socio-economic indicators. Our analytical framework opens an interesting perspective to study human behavior through the lens of Big Data by means of new statistical indicators that quantify and possibly "nowcast" the well-being and the socio-economic development of a territory.
△ Less
Submitted 16 March, 2016;
originally announced June 2016.