-
Digital Epidemiology after COVID-19: impact and prospects
Authors:
Sara Mesquita,
Lília Perfeito,
Daniela Paolotti,
Joana Gonçalves-Sá
Abstract:
Epidemiology and Public Health have increasingly relied on structured and unstructured data, collected inside and outside of typical health systems, to study, identify, and mitigate diseases at the population level. Focusing on infectious disease, we review how Digital Epidemiology (DE) was at the beginning of 2020 and how it was changed by the COVID-19 pandemic, in both nature and breadth. We arg…
▽ More
Epidemiology and Public Health have increasingly relied on structured and unstructured data, collected inside and outside of typical health systems, to study, identify, and mitigate diseases at the population level. Focusing on infectious disease, we review how Digital Epidemiology (DE) was at the beginning of 2020 and how it was changed by the COVID-19 pandemic, in both nature and breadth. We argue that DE will become a progressively useful tool as long as its potential is recognized and its risks are minimized. Therefore, we expand on the current views and present a new definition of DE that, by highlighting the statistical nature of the datasets, helps in identifying possible biases. We offer some recommendations to reduce inequity and threats to privacy and argue in favour of complex multidisciplinary approaches to tackling infectious diseases.
△ Less
Submitted 8 December, 2023;
originally announced December 2023.
-
Learning from pandemics: using extraordinary events can improve disease now-casting models
Authors:
Sara Mesquita,
Cláudio Haupt Vieira,
Lília Perfeito,
Joana Gonçalves-Sá
Abstract:
Online searches have been used to study different health-related behaviours, including monitoring disease outbreaks. An obvious caveat is that several reasons can motivate individuals to seek online information and models that are blind to people's motivations are of limited use and can even mislead. This is particularly true during extraordinary public health crisis, such as the ongoing pandemic,…
▽ More
Online searches have been used to study different health-related behaviours, including monitoring disease outbreaks. An obvious caveat is that several reasons can motivate individuals to seek online information and models that are blind to people's motivations are of limited use and can even mislead. This is particularly true during extraordinary public health crisis, such as the ongoing pandemic, when fear, curiosity and many other reasons can lead individuals to search for health-related information, masking the disease-driven searches. However, health crisis can also offer an opportunity to disentangle between different drivers and learn about human behavior. Here, we focus on the two pandemics of the 21st century (2009-H1N1 flu and Covid-19) and propose a methodology to discriminate between search patterns linked to general information seeking (media driven) and search patterns possibly more associated with actual infection (disease driven). We show that by learning from such pandemic periods, with high anxiety and media hype, it is possible to select online searches and improve model performance both in pandemic and seasonal settings. Moreover, and despite the common claim that more data is always better, our results indicate that lower volume of the right data can be better than including large volumes of apparently similar data, especially in the long run. Our work provides a general framework that can be applied beyond specific events and diseases, and argues that algorithms can be improved simply by using less (better) data. This has important consequences, for example, to solve the accuracy-explainability trade-off in machine-learning.
△ Less
Submitted 17 January, 2021;
originally announced January 2021.
-
PTPARL-D: Annotated Corpus of 44 years of Portuguese Parliament debates
Authors:
Paulo Almeida,
Manuel Marques-Pita,
Joana Gonçalves-Sá
Abstract:
In a representative democracy, some decide in the name of the rest, and these elected officials are commonly gathered in public assemblies, such as parliaments, where they discuss policies, legislate, and vote on fundamental initiatives. A core aspect of such democratic processes are the plenary debates, where important public discussions take place. Many parliaments around the world are increasin…
▽ More
In a representative democracy, some decide in the name of the rest, and these elected officials are commonly gathered in public assemblies, such as parliaments, where they discuss policies, legislate, and vote on fundamental initiatives. A core aspect of such democratic processes are the plenary debates, where important public discussions take place. Many parliaments around the world are increasingly kee** the transcripts of such debates, and other parliamentary data, in digital formats accessible to the public, increasing transparency and accountability. Furthermore, some parliaments are bringing old paper transcripts to semi-structured digital formats. However, these records are often only provided as raw text or even as images, with little to no annotation, and inconsistent formats, making them difficult to analyze and study, reducing both transparency and public reach. Here, we present PTPARL-D, an annotated corpus of debates in the Portuguese Parliament, from 1976 to 2019, covering the entire period of Portuguese democracy.
△ Less
Submitted 26 April, 2020;
originally announced April 2020.
-
Some scientific knowledge is a dangerous thing: overconfidence grows non-linearly with knowledge
Authors:
Simone Lackner,
Frederico Francisco,
Cristina Mendonça,
André Mata,
Joana Gonçalves-Sá
Abstract:
Overconfidence is a prevalent problem and particularly consequential in its relation with scientific knowledge: being unaware of one`s own ignorance can affect behaviours and threaten public policies and health. We introduce both analytical and methodological changes to the study of confidence in science knowledge and in attitudes towards science and apply them to four large surveys, spanning 30 y…
▽ More
Overconfidence is a prevalent problem and particularly consequential in its relation with scientific knowledge: being unaware of one`s own ignorance can affect behaviours and threaten public policies and health. We introduce both analytical and methodological changes to the study of confidence in science knowledge and in attitudes towards science and apply them to four large surveys, spanning 30 years in Europe and the USA. We propose a new indirect confidence metric that does not rely on self-reporting or peer comparison, and study how knowledge and confidence vary across their full scale. We find that confidence grows much faster than knowledge, giving rise to a non-linear relationship, with the largest confidence gaps appearing at intermediate knowledge levels. These high-confidence \textbackslash intermediate-knowledge groups also display the least positive attitudes towards science, with important consequences for science communication. These results are contrary to current models, including the predictions of the Dunning-Kruger effect, and we discuss how our model, if correct, can guide future research and communication policies.
△ Less
Submitted 15 September, 2023; v1 submitted 26 March, 2019;
originally announced March 2019.
-
Human Sexual Cycles are Driven by Culture and Match Collective Moods
Authors:
Ian B. Wood,
Pedro Leal Varela,
Johan Bollen,
Luis M. Rocha,
Joana Gonçalves-Sá
Abstract:
It is a long-standing question whether human sexual and reproductive cycles are affected predominantly by biology or culture. The literature is mixed with respect to whether biological or cultural factors best explain the reproduction cycle phenomenon, with biological explanations dominating the argument. The biological hypothesis proposes that human reproductive cycles are an adaptation to the se…
▽ More
It is a long-standing question whether human sexual and reproductive cycles are affected predominantly by biology or culture. The literature is mixed with respect to whether biological or cultural factors best explain the reproduction cycle phenomenon, with biological explanations dominating the argument. The biological hypothesis proposes that human reproductive cycles are an adaptation to the seasonal cycles caused by hemisphere positioning, while the cultural hypothesis proposes that conception dates vary mostly due to cultural factors, such as vacation schedule or religious holidays. However, for many countries, common records used to investigate these hypotheses are incomplete or unavailable, biasing existing analysis towards primarily Christian countries in the Northern Hemisphere. Here we show that interest in sex peaks sharply online during major cultural and religious celebrations, regardless of hemisphere location. This online interest, when shifted by nine months, corresponds to documented human birth cycles, even after adjusting for numerous factors such as language, season, and amount of free time due to holidays. We further show that mood, measured independently on Twitter, contains distinct collective emotions associated with those cultural celebrations, and these collective moods correlate with sex search volume outside of these holidays as well. Our results provide converging evidence that the cyclic sexual and reproductive behavior of human populations is mostly driven by culture and that this interest in sex is associated with specific emotions, characteristic of, but not limited to, major cultural and religious celebrations.
△ Less
Submitted 27 October, 2017; v1 submitted 12 July, 2017;
originally announced July 2017.