-
Decoding the Sociotechnical Dimensions of Digital Misinformation: A Comprehensive Literature Review
Authors:
Alisson Andrey Puska,
Luiz Adolpho Baroni,
Roberto Pereira
Abstract:
This paper presents a systematic literature review in Computer Science that provide an overview of the initiatives related to digital misinformation. This is an exploratory study that covers research from 1993 to 2020, focusing on the investigation of the phenomenon of misinformation. The review consists of 788 studies from SCOPUS, IEEE, and ACM digital libraries, synthesizing the primary research…
▽ More
This paper presents a systematic literature review in Computer Science that provide an overview of the initiatives related to digital misinformation. This is an exploratory study that covers research from 1993 to 2020, focusing on the investigation of the phenomenon of misinformation. The review consists of 788 studies from SCOPUS, IEEE, and ACM digital libraries, synthesizing the primary research directions and sociotechnical challenges. These challenges are classified into Physical, Empirical, Syntactic, Semantic, Pragmatic, and Social dimensions, drawing from Organizational Semiotics. The map** identifies issues related to the concept of misinformation, highlights deficiencies in mitigation strategies, discusses challenges in approaching stakeholders, and unveils various sociotechnical aspects relevant to understanding and mitigating the harmful effects of digital misinformation. As contributions, this study present a novel categorization of mitigation strategies, a sociotechnical taxonomy for classifying types of false information and elaborate on the inter-relation of sociotechnical aspects and their impacts.
△ Less
Submitted 2 April, 2024;
originally announced June 2024.
-
Estimation of COVID-19 under-reporting in Brazilian States through SARI
Authors:
Balthazar Paixão,
Lais Baroni,
Rebecca Salles,
Luciana Escobar,
Carlos de Sousa,
Marcel Pedroso,
Raphael Saldanha,
Rafaelli Coutinho,
Fabio Porto,
Eduardo Ogasawara
Abstract:
Due to its impact, COVID-19 has been stressing the academy to search for curing, mitigating, or controlling it. However, when it comes to controlling, there are still few studies focused on under-reporting estimates. It is believed that under-reporting is a relevant factor in determining the actual mortality rate and, if not considered, can cause significant misinformation. Therefore, the objectiv…
▽ More
Due to its impact, COVID-19 has been stressing the academy to search for curing, mitigating, or controlling it. However, when it comes to controlling, there are still few studies focused on under-reporting estimates. It is believed that under-reporting is a relevant factor in determining the actual mortality rate and, if not considered, can cause significant misinformation. Therefore, the objective of this work is to estimate the under-reporting of cases and deaths of COVID-19 in Brazilian states using data from the Infogripe on notification of Severe Acute Respiratory Infection (SARI). The methodology is based on the concepts of inertia and the use of event detection techniques to study the time series of hospitalized SARI cases. The estimate of real cases of the disease, called novelty, is calculated by comparing the difference in SARI cases in 2020 (after COVID-19) with the total expected cases in recent years (2016 to 2019) derived from a seasonal exponential moving average. The results show that under-reporting rates vary significantly between states and that there are no general patterns for states in the same region in Brazil.
The published version of this paper is made available at https://doi.org/10.1007/s00354-021-00125-3.
Please cite as: B. Paixão, L. Baroni, M. Pedroso, R. Salles, L. Escobar, C. de Sousa, R. de Freitas Saldanha, J. Soares, R. Coutinho, et al., 2021, Estimation of COVID-19 Under-Reporting in the Brazilian States Through SARI, New Generation Computing
△ Less
Submitted 5 April, 2021; v1 submitted 23 June, 2020;
originally announced June 2020.
-
Netherlands Dataset: A New Public Dataset for Machine Learning in Seismic Interpretation
Authors:
Reinaldo Mozart Silva,
Lais Baroni,
Rodrigo S. Ferreira,
Daniel Civitarese,
Daniela Szwarcman,
Emilio Vital Brazil
Abstract:
Machine learning and, more specifically, deep learning algorithms have seen remarkable growth in their popularity and usefulness in the last years. This is arguably due to three main factors: powerful computers, new techniques to train deeper networks and larger datasets. Although the first two are readily available in modern computers and ML libraries, the last one remains a challenge for many do…
▽ More
Machine learning and, more specifically, deep learning algorithms have seen remarkable growth in their popularity and usefulness in the last years. This is arguably due to three main factors: powerful computers, new techniques to train deeper networks and larger datasets. Although the first two are readily available in modern computers and ML libraries, the last one remains a challenge for many domains. It is a fact that big data is a reality in almost all fields nowadays, and geosciences are not an exception. However, to achieve the success of general-purpose applications such as ImageNet - for which there are +14 million labeled images for 1000 target classes - we not only need more data, we need more high-quality labeled data. When it comes to the Oil&Gas industry, confidentiality issues hamper even more the sharing of datasets. In this work, we present the Netherlands interpretation dataset, a contribution to the development of machine learning in seismic interpretation. The Netherlands F3 dataset acquisition was carried out in the North Sea, Netherlands offshore. The data is publicly available and contains pos-stack data, 8 horizons and well logs of 4 wells. For the purposes of our machine learning tasks, the original dataset was reinterpreted, generating 9 horizons separating different seismic facies intervals. The interpreted horizons were used to generate approximatelly 190,000 labeled images for inlines and crosslines. Finally, we present two deep learning applications in which the proposed dataset was employed and produced compelling results.
△ Less
Submitted 26 March, 2019;
originally announced April 2019.
-
Penobscot Dataset: Fostering Machine Learning Development for Seismic Interpretation
Authors:
Lais Baroni,
Reinaldo Mozart Silva,
Rodrigo S. Ferreira,
Daniel Civitarese,
Daniela Szwarcman,
Emilio Vital Brazil
Abstract:
We have seen in the past years the flourishing of machine and deep learning algorithms in several applications such as image classification and segmentation, object detection and recognition, among many others. This was only possible, in part, because datasets like ImageNet -- with +14 million labeled images -- were created and made publicly available, providing researches with a common ground to…
▽ More
We have seen in the past years the flourishing of machine and deep learning algorithms in several applications such as image classification and segmentation, object detection and recognition, among many others. This was only possible, in part, because datasets like ImageNet -- with +14 million labeled images -- were created and made publicly available, providing researches with a common ground to compare their advances and extend the state-of-the-art. Although we have seen an increasing interest in machine learning in geosciences as well, we will only be able to achieve a significant impact in our community if we collaborate to build such a common basis. This is even more difficult when it comes to the Oil&Gas industry, in which confidentiality and commercial interests often hinder the sharing of datasets with others. In this letter, we present the Penobscot interpretation dataset, our contribution to the development of machine learning in geosciences, more specifically in seismic interpretation. The Penobscot 3D seismic dataset was acquired in the Scotian shelf, offshore Nova Scotia, Canada. The data is publicly available and comprises pre- and pos-stack data, 5 horizons and well logs of 2 wells. However, for the dataset to be of practical use for our tasks, we had to reinterpret the seismic, generating 7 horizons separating different seismic facies intervals. The interpreted horizons were used to generated +100,000 labeled images for inlines and crosslines. To demonstrate the utility of our dataset, results of two experiments are presented.
△ Less
Submitted 21 March, 2019;
originally announced March 2019.