\SetBgContents

Bridging the Digital Divide: Map** Internet Connectivity Evolution, Inequalities, and Resilience in six Brazilian Cities

Nicolò Gozzi1,2, Niccolò Comini2, Nicola Perra3,2,4
( 1 ISI Foundation, Turin, Italy
2 The World Bank Group
3 School of Mathematical Sciences, Queen Mary University of London, UK
4 The Alan Turing Institute, London, UK )
Abstract

We investigate the evolution of Internet speed and its implications for access to key digital services, as well as the resilience of the network during crises, focusing on six major Brazilian cities: Belo Horizonte, Brasília, Fortaleza, Manaus, Rio de Janeiro, and São Paulo. Leveraging a unique dataset of Internet Speedtest® results provided by Ookla®, we analyze Internet speed trends from 2017201720172017 to 2023202320232023. Our findings reveal significant improvements in Internet speed across all cities. However, we find that prosperous areas generally exhibit better Internet access, and that the dependence of Internet quality on wealth have increased over time. Additionally, we investigate the impact of Internet quality on access to critical online services, focusing on e-learning. Our analysis shows that nearly 13%percent1313\%13 % of catchment areas around educational facilities have Internet speeds below the threshold required for e-learning, with less rich areas experiencing more significant challenges. Moreover, we investigate the network’s resilience during the COVID-19 pandemic, finding a sharp decline in network quality following the declaration of national emergency. We also find that less wealthy areas experience larger drops in network quality during crises. Overall, this study underscores the importance of addressing disparities in Internet access to ensure equitable access to digital services and enhance network resilience during crises.

1 Introduction

The widespread availability of Internet connectivity has transformed several aspects of our lives. From communication and commerce to education and entertainment, access to reliable, fast, and affordable Internet connectivity has become a critical factor for promoting economic and social development [1, 2, 3]. Despite the large overall improvements in technology and adoption witnessed over the last decades, we still observe huge gaps in access to digital services and varying levels of digital literacy. The COVID-19 pandemic has shown the impact of such digital divide and highlighted the importance of addressing it. Indeed, during the acute phases of the crisis, as numerous activities rapidly migrated online, unequal access to a reliable Internet connection affected the possibility to carry out activities from home increasing the possible exposures to the virus for the unconnected  [4, 5, 6, 7]. Particularly clear are the negative impact of Internet connectivity disparities on educational achievement, access to tele-medicine, and adoption of remote working [8, 9, 10, 11, 12, 13, 14, 15, 16]

In this context, we aim to investigate how Internet connectivity has evolved over the past years across regions and socio-economic strata, its impact on the access to key services, and its resilience to extraordinary events such as the COVID-19 Pandemic. As a case study, we consider six major Brazilian cities: Belo Horizonte, Brasília, Fortaleza, Manaus, Rio de Janeiro, and São Paulo. Brazil reports one of the highest GINI index in the world [17] and inequality has been one of the main issues affecting its socio-economic development for decades. On the other hand, Brazil can compete with the most advanced areas in the world when it comes to digital capabilities. Indeed, it hosts cloud services of some of the most important providers, and it is home to several high-tech startups. However, the inequality observed in the socioeconomic dimension, is also reflected in the digital sector. For instance, while the overall average broadband fixed access for every 100100100100 inhabitants is 24242424 [18] there is a significant heterogeneity among states. Some Brazilian States such as Santa Catarina (36.1536.1536.1536.15) outperform OECD countries (e.g., Italy, 32.132.132.132.1) while others such as Acre (13.813.813.813.8), Amazonas (13.813.813.813.8), and Maranhao (9.99.99.99.9) report remarkably lower figures. Additionally, access and usage of digital tools is far from being inclusive and several areas, even within wealthier states, face a dramatic digital inequality. For these reasons, Brazil constitutes a perfect representation of the complex socio-economic dynamics and challenges that public and private sector face in addressing the digital gap.

To quantify Internet quality and its evolution in these cities, we leverage a unique dataset provided by Ookla consisting of nearly 100M100𝑀100M100 italic_M geolocalized Speedtest results, collected in the time window spanning from 2017201720172017 to 2023202320232023. We split the analysis in two parts. In the first, we focus on characterising the spatio-temporal evolution of Internet connectivity by exploring differences across socio-economic indicators. In the second part instead, we study Internet connectivity indicators in the catchment areas of educational activities and quantify the resilience of the digital infrastructure during the COVID-19 Pandemic.

We find significant improvements in Internet quality across all cities considered between 2017201720172017 and 2023202320232023. Interestingly, we observe a trend towards a more homogeneous distribution of Internet speed, indicating reduced dispersion over the years. However, despite this increased homogeneity, we find an increasing correlation between Internet speed and wealth, with wealthier areas experiencing better Internet access and with this gap widening over time. Furthermore, we also find a noticeable increase in spatial autocorrelation of Internet quality over the years, with the emergence of clusters characterized by high and low speeds.

Furthermore, our analysis reveals that approximately 13%percent1313\%13 % of catchment areas around education facilities, where 8%percent88\%8 % of the school-age population resides, experience Internet speeds insufficient for accessing key digital services such as e-learning. Additionally, these areas tend to exhibit lower wealth, suggesting a compounding effect of inequality.

Finally, we assess the impact of the stress placed on the network following the declaration of the COVID-19 national emergency in Brazil. We find that, on average, this caused a 20%percent20-20\%- 20 % in download speed across all cities, with values ranging from 7%percent7-7\%- 7 % in Brasília to almost 30%percent30-30\%- 30 % in Manaus. Our findings indicate that this impact was more pronounced in less wealthy areas compared to more wealthy ones.

Overall, this study shows that while the evolution of Internet quality showed an overall progress, disparities persist, with socio-economic factors playing significant roles. Addressing these disparities is crucial to ensure equitable access to digital services and to enhance network resilience in times of crisis. This study demonstrates that despite the resources allocated by the public and private sector to the strengthening of the Brazilian digital infrastructure, investments are still needed, particularly in the less affluent areas.

2 Results

2.1 Internet speed evolution analysis

As a first step, our research aims to analyze the evolution of Internet quality, specifically measured by fixed download speed, across six major Brazilian cities: Belo Horizonte, Brasília, Fortaleza, Manaus, Rio de Janeiro, and São Paulo. To accomplish this, we leverage a unique dataset consisting of 100Msimilar-toabsent100𝑀\sim 100M∼ 100 italic_M Internet Speedtest results provided by Ookla. The data covers the period between 2017201720172017 and 2023202320232023. Furthermore, it is geolocalized and provides the download/upload speed (i.e., Megabits per second) and latency in milliseconds for fixed networks. In the Supplementary Information we show results considering mobile network, which we also discuss below.

It is important to highlight from the start how the data serves only as a proxy of Internet quality. Indeed, due to the details of the software/tool used to make a measurement, possible bottlenecks in home networks (e.g., routers), the number of devices connected to a specific network, and selection biases (e.g., tests might be done when users are experiencing connectivity issues or when users need to connect in a new location and/or by more digitally aware users) the outcome of tests might differ from the real Internet speed [19, 7]. Nevertheless, Ookla is the canonical network performance testing service. It is widely used to infer the features of Internet connectivity across and within regions by academic and official institutions  [20, 21, 19]. Furthermore, as described below, our analysis aggregates Speedtest results within specific geographical cells thus averaging among many measurements. This allows to reduce the possible impact of the more technical issues mentioned.

To ensure uniform spatial coverage we partition the geographical area of each city into hexagonal cells, creating a regular grid (see Fig. 6A). Then, we calculate the Internet speed within each of these units as function of time. This approach allows to explore different resolutions and finer scales with respect to administrative partitions. We also compute a proxy measure for wealth in each of these unit using the Relative Wealth Index (RWI) provided by Meta [22]. For a more detailed description of our methodology, please refer to Section 4.

Figure 1 shows the evolution of Internet speed, from 2017201720172017 to 2023202320232023 in the six cities. Across the board, our analysis reveals a significant improvement in Internet speed throughout all cities over the past six years. Specifically, Belo Horizonte exhibits the highest median download speed (176mbps176𝑚𝑏𝑝𝑠176mbps176 italic_m italic_b italic_p italic_s) in 2023202320232023, followed by São Paulo (146mpbs146𝑚𝑝𝑏𝑠146mpbs146 italic_m italic_p italic_b italic_s), Manaus (116mbps116𝑚𝑏𝑝𝑠116mbps116 italic_m italic_b italic_p italic_s), Rio de Janeiro (114mbps114𝑚𝑏𝑝𝑠114mbps114 italic_m italic_b italic_p italic_s) Fortaleza (111mbps111𝑚𝑏𝑝𝑠111mbps111 italic_m italic_b italic_p italic_s), and Brasília (105mbps105𝑚𝑏𝑝𝑠105mbps105 italic_m italic_b italic_p italic_s). On the other hand, Manaus experienced the highest growth during the period, marking a +1200%percent1200+1200\%+ 1200 % increase, followed by Belo Horizonte (+1012%percent1012+1012\%+ 1012 %), Rio de Janeiro (+719%percent719+719\%+ 719 %), Fortaleza (+685%percent685+685\%+ 685 %), Brasília (+677%percent677+677\%+ 677 %), and São Paulo (+463%percent463+463\%+ 463 %). Furthermore, in the same plot we show the coefficient of variation of the distribution of Internet speed within each city across the years. The coefficient of variation is a measure of dispersion defined as the ratio between standard deviation and average of a statistical distribution. Our findings indicate a decreasing trend in the coefficient of variation across the six cities, suggesting a persistent trend towards a more homogeneous distribution of Internet speed as it improved. However, we acknowledge differences among the cities examined. Brasília exhibits the highest dispersion in Internet speed distribution in 2023202320232023 (CV=0.80𝐶𝑉0.80CV=0.80italic_C italic_V = 0.80), while Fortaleza demonstrates the less disperse distribution (CV=0.28𝐶𝑉0.28CV=0.28italic_C italic_V = 0.28). More quantitatively, in 2023202320232023, the ratio between the 3rdsuperscript3𝑟𝑑3^{rd}3 start_POSTSUPERSCRIPT italic_r italic_d end_POSTSUPERSCRIPT and 1stsuperscript1𝑠𝑡1^{st}1 start_POSTSUPERSCRIPT italic_s italic_t end_POSTSUPERSCRIPT quartiles of Internet speed is 5.85.85.85.8 in Brasília, whereas it is only 1.41.41.41.4 in Fortaleza.

It is important to highlight how, despite a general trend towards an homogenisation of the dispersion of Internet speeds, the data still reveals persistent and even increasing disparities across socio-economic strata. Figure 2 shows the logarithm of the ratio between the average Internet speed measured in cells with wealth higher than the 75thsuperscript75𝑡75^{th}75 start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT quantile and those with wealth lower than the 25thsuperscript25𝑡25^{th}25 start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT quantile. This metric is meant to compare and highlight the differences between the wealthiest and the poorest units. A value close to zero indicates similar Internet quality for both wealthy and less wealthy areas, while positive (negative) values denote better Internet quality for the more wealthy (less wealthy). As detailed in Section 4, the wealth of each unit is calculated using the Relative Wealth Index (RWI) provided by Meta [22].

Across various years and cities, our analysis reveals a consistent trend: wealthy areas generally experience better Internet quality compared to less wealthy areas. Moreover, we note an increasing disparity in Internet quality between more and less wealthy areas over the years. In the case of Manaus and Sao Paolo cells characterized by higher RWI features better Internet quality across the whole time horizon under study. This trend is observed also in Brasilia with the exception of 2017201720172017. In Rio de Janeiro instead, only in the last two years Internet quality in wealthy cells was better with respect to less wealthy areas, though the negative values are closer to zero. Finally in Belo Horizonte and Fortaleza, the values are overall smaller with respect to the other cities though positive in the last years. The association between Internet quality and RWI is supported by the Pearson correlation coefficient between Internet speed and RWI, shown in Figure 2. The coefficient has increased across all cities in recent years, with all cities showing a positive correlation as of 2023202320232023, which is significant at 5%percent55\%5 % level with the exception of Belo Horizonte and Fortaleza.

In the Supplementary Information we repeat the analyses presented in Figure 1 and Figure 2 for mobile network. Also in that case, we find a significant overall improvement of mobile network speed over the period considered. Interestingly, we find that, while mobile speed and wealth are also positively correlated, the observed trend in time is decreasing, contrasting the finding in the case of fixed network.

In the case of Rio de Janeiro, we extend our analysis to include tests conducted both inside and outside favelas. The results of this analysis are presented in the Supplementary Information. Favelas are informal, densely populated urban settlements in Brazil, typically characterized by substandard housing and a lack of basic services, arising from socio-economic disparities and rapid urbanization. Not surprisingly, we find that tests performed within a favela generally exhibit lower internet speeds. Additionally, this disparity has increased over the years. In 2017201720172017, the median speed of tests conducted inside and outside favelas was 13.713.713.713.7 Mbps and 14.314.314.314.3 Mbps, respectively, reflecting a 4%percent44\%4 % difference. By 2023202320232023, these speeds had changed to 40.140.140.140.1 Mbps and 94.294.294.294.2 Mbps, respectively, resulting in a 57.4%percent57.457.4\%57.4 % difference.

Refer to caption
Figure 1: Evolution of Internet speed, expressed in download speed (Mbps), across the six cities considered. Boxplots show the distribution of Internet speed within each hexagonal unit in each city. The orange line indicates the evolution of the coefficient of variation of the Internet speed distribution over the years.
Refer to caption
Figure 2: Logarithm of the ratio between Internet speeds measured in spatial units with wealth higher than the 75thsuperscript75𝑡75^{th}75 start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT quantile and those with wealth lower than the 25thsuperscript25𝑡25^{th}25 start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT quantile. The orange line represents the Pearson correlation coefficient between Internet speed and RWI of different spatial units. Circles indicate where the coefficient is significant at the 5%percent55\%5 % level.

To investigate whether Internet speed has become more spatially autocorrelated over time, we calculate the Moran’s I𝐼Iitalic_I statistic for download speed in each hexagonal unit across various cities for each year within the study period [23]. The Moran’s I𝐼Iitalic_I quantifies the degree of spatial autocorrelation of a quantity, indicating the extent to which similar values cluster or disperse across geographical units. More in detail, a positive (negative) Moran’s I𝐼Iitalic_I indicates spatial autocorrelation (dispersion) in the dataset, meaning that similar (dissimilar) values tend to cluster together in space. Our analysis reveals the emergence of spatial clusters characterized by high or low Internet speed. This finding is exemplified in Figure 3A where we present the results for Rio de Janeiro in 2017201720172017, 2020202020202020, and 2023202320232023 (in the Supplementary Information we show results also for other cities). The global Moran’s I𝐼Iitalic_I values exhibit a notable increase from approximately 00 in 2017201720172017 to 0.170.170.170.17 in 2020202020202020 and further to 0.400.400.400.40 in 2023202320232023. Visual inspection of the maps also reveals the emergence of spatial clusters of high and low Internet speed over the years. Specifically, the maps indicate units where the local Moran’s I𝐼Iitalic_I statistic — measuring the spatial clustering pattern of individual observations — is significant at the 5%percent55\%5 % level, with units colored to denote low-speed (red) or high-speed (blue) clusters. Furthermore, we analyze the evolution of the global Moran’s I𝐼Iitalic_I across different cities over the six-year period. The findings observed in the case of Rio de Janeiro are consistent across cities, with the statistic generally demonstrating an increase over the years. In more details, we observe how in all cities, with the exception of Belo Horizonte, the last two years show the highest values of Moran’s I𝐼Iitalic_I. Also, we note how in Brasilia, Fortaleza, and Sao Paulo, the global Moran’s I𝐼Iitalic_I, measured considering data collected in 2023202320232023, is smaller with respect to the previous year. The decreasing trend in the last year is also observed, though to a lesser extent, also in the case of Manus and in Belo Horizonte, though in the latter the value obtained it is not significant at the 5%percent55\%5 % level. In the Supplementary Information we repeat this analysis for mobile network. We find also in that case positive and significant spatial autocorrelation of mobile Internet speed, event tough in this case the temporal trend is less clear.

Refer to caption
Figure 3: Spatial Clustering of Internet Speed. A) Distribution of spatial units with significant local Moran’s I𝐼Iitalic_I in Rio de Janeiro in 2017201720172017, 2020202020202020, and 2023202320232023. Clusters of low (high) Internet speed are shown in red (blue). B) Evolution of global Moran’s I𝐼Iitalic_I in each city between 2017201720172017 and 2023202320232023. Circles indicate where the statistic is significant at the 5%percent55\%5 % level.

2.2 Access to e-learning

In the second part of our analysis we investigate whether the disparities in Internet quality highlighted in the previous section may impact access to key services. While acknowledging the diversity and variety of these, in the following we use e-learning as a concrete and arguably important example. Indeed, as mentioned in the Introduction, extant research has highlighted the positive relationship between Internet quality and educational attainment [9]. Our approach is as follows. First, we gather the locations of educational facilities across the six cities under consideration using data from OpenStreetMap [24]. Next, we conduct a Voronoi tessellation for each city, with the positions of educational facilities as centroids. This process allows us to obtain the catchment area of each educational facility. By construction a catchment area describes the closest educational entity for people living in that region. Subsequently, we compute the Internet speed within each catchment area by aggregating the download speeds of all tests performed within. For this analysis, we consider the most recent data from 2023202320232023. Our aim is to focus solely on recent data to accurately characterize the current disparities in access to essential services. Additionally, we calculate the RWI for each area. Further details on our methodology are available in Section 4. In Figure 4, we present the distribution of fixed download speeds across all catchment areas in the six cities. We also highlight a threshold of 80808080 Mbps (approximately 10101010 megabytes per second) as the minimum speed required to access e-learning services [25]. Remarkably, across all cities we find that nearly 13%percent1313\%13 % of catchment areas have speeds below this threshold, affecting approximately 8%percent88\%8 % of the population in school age. This implies that less than one in every ten children may encounter challenges in accessing e-learning services. We note how e-learning is a general term referring to both synchronous and asynchronous learning activities. These span from access to dedicated platforms to ability of exploring broader online resources for homeworks. Nonetheless, we observe a significant variability across cities. In Belo Horizonte, none of the catchment areas exhibit an Internet speed below the 80mbps80𝑚𝑏𝑝𝑠80mbps80 italic_m italic_b italic_p italic_s threshold. Following closely is Brasília, with only 4.8%percent4.84.8\%4.8 % falling below, then São Paulo (6.8%percent6.86.8\%6.8 %), Fortaleza (7.4%percent7.47.4\%7.4 %), and Manaus (8.3%percent8.38.3\%8.3 %). In stark contrast, nearly 24%percent2424\%24 % of catchment areas in Rio de Janeiro fall below this threshold, with approximately 26%percent2626\%26 % of the school-age population residing in these areas.

Additionally, in the inset of each plot in Figure 4, we display the RWI distribution of catchment areas below and above the 80808080 Mbps threshold. Across all cities, our analysis indicates that, on average, catchment areas below the threshold are 15%percent1515\%15 % less wealthy than areas above the threshold. We assess the differences in RWI distribution between the two cases using a t-test, finding a significant difference in the case of Rio de Janeiro, São Paulo, and Manaus (significance level 5%percent55\%5 %). This observation points to a compounding effect of inequality. Indeed, students facing higher challenges in accessing key digital services such as e-learning may already be foreclosed from other opportunities due to their socio-economic disadvantage.

Refer to caption
Figure 4: Internet Speed in Catchment Areas of Education Facilities. Distribution of Internet speed, measured as download speed (Mbps), in catchment areas of education facilities across all cities (2023). The portion of the distribution where speed is lower than 80808080 Mbps is colored in red. In the inset of each plot, RWI distribution of catchment areas of education facilities featuring a download speed above and below 80808080 Mbps is shown.

2.3 Network resilience during crises

Finally, we aim to investigate the resilience of the network to external shocks and the potential heterogeneous impacts of such events. As a case study, we consider the COVID-19 pandemic. With infections and deaths surging worldwide and restrictions being imposed, the world moved online to maintain essential activities. Arguably, such an unprecedented surge in demand may have affected network quality. In Figure 5A, we present the median daily download speeds in the six cities between March and June 2020202020202020. Additionally, we mark the date when Brazil declared a national emergency with a vertical dashed line and we show the increase in the percentage of individuals staying at home measured using data from the COVID-19 Community Mobility Reports published by Google [26]. Across all cities, we observe a sharp decline in network quality, as measured by download speed, following the declaration of the national emergency. Concurrently, the fraction of population staying at home increased. After the initial drop, we observe a gradual recovery, with download speeds approaching pre-emergency levels by June 2020202020202020. Among the cities considered, Manaus experienced the most significant drop in median download speed computed in periods March 1stsuperscript1𝑠𝑡1^{st}1 start_POSTSUPERSCRIPT italic_s italic_t end_POSTSUPERSCRIPT-March 20thsuperscript20𝑡20^{th}20 start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT and March 20thsuperscript20𝑡20^{th}20 start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT-April 1stsuperscript1𝑠𝑡1^{st}1 start_POSTSUPERSCRIPT italic_s italic_t end_POSTSUPERSCRIPT, with a decline of 29%percent29-29\%- 29 %, while Brasília showed the lowest drop at 7%percent7-7\%- 7 %. The other cities experienced declines ranging from Rio de Janeiro (25%percent25-25\%- 25 %), Fortaleza (21%percent21-21\%- 21 %), São Paulo (19%percent19-19\%- 19 %), to Belo Horizonte (16%percent16-16\%- 16 %).

Furthermore, in Figure 5B, we illustrate these drops for the top and bottom quartiles of the RWI. We observe that, with the exception of Brasília, more wealthy areas experienced smaller drops compared to less wealthy areas. When combined with the previous findings, this suggests that besides experiencing slower Internet speeds, less wealthy areas may also face more significant drawbacks during extraordinary stress on the network.

In the Supplementary Information we repeat this analysis for mobile network. Also in that case we find that mobile Internet speed was significantly affected by the stress put on the network following national emergency declaration, even tough we do not observe a clear divide in the drops experienced by more and less wealthy areas.

Refer to caption
Figure 5: Network Resilience During the COVID-19 Pandemic. A) Daily median download speed in the six cities between March and June 2020202020202020. The vertical dashed line indicates when Brazil declared the national emergency. The percentage change in individuals staying at home as measured via Google Community Mobility Reports is also shown. B) Drop in Internet speed following the national emergency declaration in the top and bottom quartiles of the RWI in each city.

3 Discussion

In this study, we analysed the spatio-temporal evolution of Internet speed in six Brazilian cities spanning the years 2017201720172017 and 2023202320232023. Our analysis revealed a significant increase in Internet speed across all cities, along with a trend towards more uniform distribution. However, we also identified the emergence of spatial clusters characterized by high/low Internet speed. Furthermore, we found an increasing correlation between Internet speed and measures of wealth, indicating that more wealthy areas tended to experience higher Internet speeds over time. Such inequality pattern was also reported by the analysis done in the case of the favelas in Rio de Janeiro, which revealed an increasing internet speed gap with the rest of the city.

To further characterize the impact of such disparities, we considered two case studies. In the first one, we focused on the Internet speed in catchment areas around educational facilities in each city. Notably, we find that, as of 2023202320232023, approximately 13%percent1313\%13 % of these areas may have encountered challenges in accessing key digital services such as e-learning. We observed significant variations among cities, with Rio de Janeiro reaching a peak of 24%percent2424\%24 % of these areas falling below the threshold for e-learning. Additionally, we showed that these areas tend to be less wealthy, suggesting a potential compounding effect of inequality, where regions already facing limited access to opportunities may also encounter challenges in digital access.

In our second case study, we examined the unprecedented stress placed on the network due to the shift online driven by the COVID-19 pandemic. Our analysis showed a significant decrease in Internet speed across all cities following the declaration of national emergency. Moreover, we found that less wealthy areas generally experienced more pronounced declines in Internet connectivity during the early weeks of the COVID-19 crisis. This result is even more concerning when combined with findings from a recent study that has shown how access to a fast Internet is an effective measure in case of exogenous shocks such as the pandemic to limit the exposure to infections [7].

Overall, these findings are confirmed also in the case of mobile network, whose related analysis are presented in the Supplementary Information. Interestingly, however, in the case of mobile network we observe that over time, the correlation between wealth and speed showed a gradual decline. This phenomenon could be attributed to the higher demand in economically disadvantaged regions for more affordable connectivity options, such as mobile connection. Consequently, the evolution of mobile Internet may have diverged from that of fixed Internet due to distinct demands and consumer segmentation.

The present study comes with limitations. First, we used data from Internet Speedtest results provided by Ookla, which is only a proxy for Internet speed. As discussed, due to several factors, the outcome of tests might differ from the real Internet. Nevertheless, Ookla is widely used by academic and official institutions to measure Internet connectivity. Additionally, our methodology aims at attenuating some of the possible issues deriving from the heterogeneous use of this service, as detailed in Sec. 4 and in Ref. [7]. Second, we use only proxy data to measure wealth. Indeed, we consider the Relative Wealth Index published by Meta [22] to characterize wealth at the desired spatial granularity, nonetheless such data come with inherent limitations, as is the case with all proxy measures. Lastly, Internet speed and wealth are linked by a feedback loop that we do not fully characterize due to data availability. As a result, our study mostly focuses on associations over time and space rather than causation or providing comprehensive explanations of the current landscape.

Since 2020202020202020, about 28282828 USD billion have been invested in the telecom sector in Brazil [27]. Despite the significant amount of resources, the underlying efforts were not enough to provide a level playing field for all Internet users. This study, indeed, has shown how the poorest segments of population still experience a slower Internet connectivity compared to the most wealthy and how this gap may widen in case of exogenous shocks. Such disparity can have a significant impact on the socio-economic development of the country and requires a joint work of policy makers and the private sector to be solved. Specific policies at the local level should be promoted to improve connectivity in the poorest areas of towns, favoring the penetration of fiber to the Home (FTTH) technology, the affordability of high-speed Internet packages and devices, the development of specific digital skills through dedicated training and awareness programs. All these measures will support a more equal access to the Internet, ensuring that all individuals have access to a fast, affordable, and reliable Internet connection.

4 Materials and methods

4.1 Measuring Internet speed

We characterize Internet quality using as proxy Speedtest Intelligence® data by Ookla [28]. Speedtest apps offer free analyses of Internet performance metrics. The tests are geolocalized and provide download/upload speed (expressed in Megabits per second). Here, following a common practice, we consider download speed as a metric to assess the quality of Internet. The dataset includes nearly 100M100𝑀100M100 italic_M tests performed between 2017 and 2023, divided as follows: 47.5M47.5𝑀47.5M47.5 italic_M in São Paulo, 24.1M24.1𝑀24.1M24.1 italic_M in Rio de Janeiro, 8.1M8.1𝑀8.1M8.1 italic_M in Belo Horizonte, 6.5M6.5𝑀6.5M6.5 italic_M in Brasília, 6.1M6.1𝑀6.1M6.1 italic_M in Fortaleza, and 4.9M4.9𝑀4.9M4.9 italic_M in Manaus.

We preprocess the data by excluding all tests displaying a download speed of 00 Mbps, as these typically represent failed tests and do not provide informative insights into the actual network quality. Additionally, to limit the impact of outliers, we filter out tests with a download speed >2absent2>2> 2 Gigabits per second, as this threshold is regarded the maximum value for broadband technology. After preprocessing, we compute Internet speed following a procedure similar to the one presented in Ref. [7].

To compute Internet speed in a geographical area g𝑔gitalic_g over a timeframe (t1,t2)subscript𝑡1subscript𝑡2(t_{1},t_{2})( italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ), we gather all tests conducted within that area during that period. Then, we calculate the median of the results obtained from tests conducted by individual users. In other words, for each user u𝑢uitalic_u, we compute the associated download speed as follows:

Mbps(t1,t2)u,g=medi(Mbpsi,(t1,t2)u,g)𝑀𝑏𝑝subscriptsuperscript𝑠𝑢𝑔subscript𝑡1subscript𝑡2𝑚𝑒subscript𝑑𝑖𝑀𝑏𝑝subscriptsuperscript𝑠𝑢𝑔𝑖subscript𝑡1subscript𝑡2Mbps^{u,g}_{(t_{1},t_{2})}=med_{i}(Mbps^{u,g}_{i,(t_{1},t_{2})})italic_M italic_b italic_p italic_s start_POSTSUPERSCRIPT italic_u , italic_g end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT = italic_m italic_e italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_M italic_b italic_p italic_s start_POSTSUPERSCRIPT italic_u , italic_g end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i , ( italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT )

This step is taken to prevent bias caused by users who utilize the service more frequently than others. Finally, the download speed associated to area g𝑔gitalic_g in timeframe (t1,t2)subscript𝑡1subscript𝑡2(t_{1},t_{2})( italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) is calculated as the median download speed across all users:

Mbps(t1,t2)g=medu(Mbps(t1,t2)u,g)𝑀𝑏𝑝subscriptsuperscript𝑠𝑔subscript𝑡1subscript𝑡2𝑚𝑒subscript𝑑𝑢𝑀𝑏𝑝subscriptsuperscript𝑠𝑢𝑔subscript𝑡1subscript𝑡2Mbps^{g}_{(t_{1},t_{2})}=med_{u}(Mbps^{u,g}_{(t_{1},t_{2})})italic_M italic_b italic_p italic_s start_POSTSUPERSCRIPT italic_g end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT = italic_m italic_e italic_d start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ( italic_M italic_b italic_p italic_s start_POSTSUPERSCRIPT italic_u , italic_g end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT )

4.2 Measuring wealth

We assess the socio-economic status of different geographical regions using the Relative Wealth Index (RWI) from Meta’s Data for Good Program [22]. This index, made publicly available in 2021, offers micro-estimates of the relative standard of living within countries. It is built considering non-traditional data sources such as satellite imagery and privacy-preserving Facebook connectivity data, and it is validated by Meta through ground truth measurements obtained from the Demographic and Health Surveys. The RWI covers approximately 93939393 low and middle-income countries globally, providing data at a high spatial resolution (2.4km22.4𝑘superscript𝑚22.4km^{2}2.4 italic_k italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT micro-regions). In this study, we aggregate the RWI at the desired geographical resolution by computing the average RWI of all micro-regions contained in the considered geography.

4.3 Hexagonal grid

We partition the area of each city considered into an hexagonal grid. This allows us to obtain a regular uniform spatial grid. We consider a resolution such that hexagonal units have an area of approximately 5.2km25.2𝑘superscript𝑚25.2km^{2}5.2 italic_k italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Figure 6A illustrates the resulting hexagonal grid for Manaus.

4.4 Voronoi tessellation

We collect the location data of educational facilities within the six cities under investigation from OpenStreetMap [24]. We include all entities categorized with the tag “amenity=school.” Each facility is then condensed to its centroid, so that all facilities are represented by a unique set of coordinates. To prevent excessive fragmentation, we merge facilities located within a 1km1𝑘𝑚1km1 italic_k italic_m radius of each other. Subsequently, we employ Voronoi tessellation on the resulting centroids. This process generates a Voronoi cell for each centroid, including all points on the plane that are closer to that seed point than to any other. This approach allows us to define the catchment areas of each educational facility. Figure 6B illustrates the location of education facilities and the resulting Voronoi tessellation for Rio de Janeiro.

Refer to caption
Figure 6: A) Boundaries of Manaus and obtained hexagonal grid. B) Location of education facilities and obtained Voronoi tessellation for Rio de Janeiro.

Acknowledgements

This report was supported by the Digital Development Partnership, which aims to advance digital transformation in low and middle-income countries by building strong digital foundations and accelerators, facilitating digital use cases for the digital economy to thrive. All authors thank Ookla, The World Bank and the Development Data Partnership. All authors thank James Carroll, Katherine Macdonald, and Luciano Charlita De Freitas for their support and review.

5 Supporting Information

5.1 Internet speed in the favelas of Rio de Janeiro

In the case of Rio de Janeiro, we extend our analysis to include tests conducted both inside and outside favelas. Favelas are informal, densely populated urban settlements in Brazil, typically characterized by substandard housing and a lack of basic services, arising from socio-economic disparities and rapid urbanization. For this analysis, we considered all tests performed inside and outside favelas without prior aggregation into hexagonal units. The shapefile containing the boundaries of favelas in Rio de Janeiro is sourced from Ref. [29].

Figure 7 shows the evolution of internet speeds for tests performed inside and outside favelas. Unsurprisingly, tests conducted within favelas generally exhibit lower internet speeds. Additionally, this disparity has increased over the years. In 2017201720172017, the median speeds for tests conducted inside and outside favelas were 13.713.713.713.7 Mbps and 14.314.314.314.3 Mbps, respectively, reflecting a 4%percent44\%4 % difference. By 2023202320232023, these speeds had changed to 94.294.294.294.2 Mbps and 40.140.140.140.1 Mbps, respectively, resulting in a 57.4%percent57.457.4\%57.4 % difference.

Refer to caption
Figure 7: Evolution of fixed Internet speed, expressed in download speed (Mbps), inside and outside the favelas of Rio de Janeiro. Boxplots show Internet speed of each speed test performed witin and outside of favelas.

5.2 Mobile Internet

We repeat here the analysis presented in the main text considering mobile Internet speed instead of fixed.

Figure 8 shows the evolution of mobile Internet speed, from 2017201720172017 to 2023202320232023 in the six cities. Across the board, our analysis reveals a significant improvement also in mobile Internet speed throughout all cities over the past six years. Specifically, Rio de Janeiro exhibits the highest median mobile download speed (45mbps45𝑚𝑏𝑝𝑠45mbps45 italic_m italic_b italic_p italic_s) in 2023202320232023, followed by Brasília (40mpbs40𝑚𝑝𝑏𝑠40mpbs40 italic_m italic_p italic_b italic_s), São Paulo and Belo Horizonte (35mpbs35𝑚𝑝𝑏𝑠35mpbs35 italic_m italic_p italic_b italic_s), Fortaleza (31mpbs31𝑚𝑝𝑏𝑠31mpbs31 italic_m italic_p italic_b italic_s), and finally Manaus (30mpbs30𝑚𝑝𝑏𝑠30mpbs30 italic_m italic_p italic_b italic_s). On the other hand, Manaus experienced the highest growth during the period, marking a +348%percent348+348\%+ 348 % increase, followed closely by São Paulo (+343%percent343+343\%+ 343 %) Rio de Janeiro (+337%percent337+337\%+ 337 %) and Brasília (+333%percent333+333\%+ 333 %), and at a higher gap by Belo Horizonte (+236%percent236+236\%+ 236 %) and Fortaleza (+211%percent211+211\%+ 211 %). In the same plot we show the coefficient of variation of the distribution of mobile Internet speed within each city across the years. Our findings indicate an increasing trend in the coefficient of variation index across the six cities, suggesting a trend towards a more disperse distribution of mobile Internet speed as it improved. However, we acknowledge differences among the cities examined. This contrasts the finding obtained when considering fixed Internet speed, where we observed a trend towards more homogeneous distribution. Brasília exhibits the highest dispersion in mobile Internet speed distribution in 2023202320232023 (CV=1.20𝐶𝑉1.20CV=1.20italic_C italic_V = 1.20), while Manaus features the most homogeneous distribution (CV=0.57𝐶𝑉0.57CV=0.57italic_C italic_V = 0.57).

Refer to caption
Figure 8: Evolution of mobile Internet speed, expressed in download speed (Mbps), across the six cities considered. Boxplots show the distribution of Internet speed within each hexagonal unit in each city. The orange line indicates the evolution of the coefficient of variation of the Internet speed distribution over the years.

Figure 9 shows the logarithm of the ratio between the average mobile Internet speed measured in cells with wealth higher than the 75thsuperscript75𝑡75^{th}75 start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT quantile and those with wealth lower than the 25thsuperscript25𝑡25^{th}25 start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT quantile. This metric is meant to compare and highlight the differences between the wealthiest and the poorest units. A value close to zero indicates similar Internet quality for both wealthy and less wealthy areas, while positive (negative) values denote better Internet quality for the more wealthy (less wealthy). Similarly to what found with fixed Internet speed, across various years and cities wealthy areas generally experience better mobile Internet quality compared to less wealthy areas. On the contrary, however, we note a dicreasing disparity in Internet quality between more and less wealthy areas over the years. Indeed, also the Pearson correlation coefficient between Internet speed and RWI tends to show a decreasing trend.

Refer to caption
Figure 9: Logarithm of the ratio between mobile Internet speeds measured in spatial units with wealth higher than the 75thsuperscript75𝑡75^{th}75 start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT quantile and those with wealth lower than the 25thsuperscript25𝑡25^{th}25 start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT quantile. The orange line represents the Pearson correlation coefficient between mobile Internet speed and RWI of different spatial units. Circles indicate where the coefficient is significant at the 5%percent55\%5 % level.

In Figure 10 we show the evolution of the Moran’s I𝐼Iitalic_I index of mobile Internet speed across different cities and years. We observe positive and significant indices in all cities, indicating spatial autocorrelation also of mobile Internet speed (i.e., areas with high mobile speed tend to be closer to areas also with high mobile Internet). Nonetheless, in the case of mobile Internet speed, the trend over the years is less clear. Overall, we notice a decrease in spatial autocorrelation, however this trend is not consistent and shared by all cities.

Refer to caption
Figure 10: Spatial Clustering of Internet Mobile Speed. Evolution of global Moran’s I𝐼Iitalic_I in each city between 2017201720172017 and 2023202320232023. Circles indicate where the statistic is significant at the 5%percent55\%5 % level.

In Figure 11A, we present the median daily mobile download speeds in the six cities between March and June 2020202020202020. Additionally, we mark the date when Brazil declared a national emergency with a vertical dashed line and we show the increase in the percentage of individuals staying at home measured using data from the COVID-19 Community Mobility Reports published by Google [26]. Across all cities, we observe a sharp decline in mobile network quality, as measured by download speed, following the declaration of the national emergency. Concurrently, the fraction of population staying at home increased. After the initial drop, we observe a gradual recovery, with mobile download speeds approaching pre-emergency levels by June 2020202020202020. Among the cities considered, Fortaleza experienced the most significant drop in median mobile download speed computed in periods March 1stsuperscript1𝑠𝑡1^{st}1 start_POSTSUPERSCRIPT italic_s italic_t end_POSTSUPERSCRIPT-March 20thsuperscript20𝑡20^{th}20 start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT and March 20thsuperscript20𝑡20^{th}20 start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT-April 1stsuperscript1𝑠𝑡1^{st}1 start_POSTSUPERSCRIPT italic_s italic_t end_POSTSUPERSCRIPT, with a decline of 20%percent20-20\%- 20 %, while Brasília is the only city showing a slight increase in speed +3%percent3+3\%+ 3 %. All other cities experienced declines ranging from Rio de Janeiro (17%percent17-17\%- 17 %), São Paulo (9%percent9-9\%- 9 %), Manaus (8%percent8-8\%- 8 %), to Belo Horizonte (6%percent6-6\%- 6 %). Furthermore, in Figure 11B, we illustrate these drops for the top and bottom quartiles of the RWI. We obtain a different picture from fixed Internet speed. Foraleza is the only cities where less wealthy areas experienced larger drop in mobile speed respect to more wealthy areas. In all other cities we observe an opposite trend.

Refer to caption
Figure 11: Mobile Network Resilience During the COVID-19 Pandemic. A) Daily median mobile download speed in the six cities between March and June 2020202020202020. The vertical dashed line indicates when Brazil declared the national emergency. The percentage change in individuals staying at home as measured via Google Community Mobility Reports is also shown. B) Drop in mobile Internet speed following the national emergency declaration in the top and bottom quartiles of the RWI in each city.
Refer to caption
Figure 12: Spatial Clustering of Fixed Internet Speed. Distribution of spatial units with significant local Moran’s I𝐼Iitalic_I in all cities in 2017201720172017, 2020202020202020, and 2023202320232023. Clusters of low (high) Internet speed are shown in red (blue).

References

  • [1] Jorge Mora-Rivera and Fernando García-Mora. Internet access and poverty reduction: Evidence from rural and urban Mexico. Telecommunications Policy, 45(2):102076, 2021.
  • [2] Victor Medeiros, Rafael Saulo Marques Ribeiro, and Pedro Vasconcelos Maia do Amaral. Infrastructure and household poverty in Brazil: A regional approach using multilevel models. World Development, 137:105118, 2021.
  • [3] Hernan Galperin and M Fernanda Viecens. Connected for development? Theory and evidence about the impact of internet technologies on poverty alleviation. Development Policy Review, 35(3):315–336, 2017.
  • [4] Elisa V. Mariscal, Alexander Elbittar, Martin Cave, Ruben Guerrero, Antonio Garcia-Zaballos, Enrique Iglesias, and William Webb. The Impact of Digital Infrastructure on the Consequences of COVID-19 and on the Mitigation of Future Effects. SSRN, 2020.
  • [5] Lesley Chiou and Catherine Tucker. Social distancing, internet access and inequality. Working Paper 26982, National Bureau of Economic Research, April 2020.
  • [6] Elisa V Mariscal, Alexander Elbittar, Martin Cave, Ruben Guerrero, Antonio Garcia-Zaballos, Enrique Iglesias, and William Webb. The Impact of Digital Infrastructure on the Consequences of COVID-19 and on the Mitigation of Future Effects. IDB Institutions for Development Sector Connectivity, Markets, and Finance Division Discussion Paper No. IDB-DP-827, 2020.
  • [7] Nicolò Gozzi, Niccolò Comini, and Nicola Perra. The adoption of non-pharmaceutical interventions and the role of digital infrastructure during the COVID-19 pandemic in Colombia, Ecuador, and El Salvador. EPJ Data Science, 12(1):18, 2023.
  • [8] Anna Torres, Ewa Domańska-Glonek, Wojciech Dzikowski, Jan Korulczyk, and Kamil Torres. Transition to on-line is possible: solution for simulation-based teaching during pandemic. Medical education, 2020.
  • [9] Johannes M. Bauer, Keith N. Hampton, Laleah Fernandez, and Craig T. Robertson. Overcoming michigan’s homework gap: The role of broadband internet connectivity for student success and career outlooks. IRPN: Innovation & Information Management (Topic), 2020.
  • [10] Matilde Taddei and Sara Bulgheroni. Facing the real time challenges of the covid-19 emergency for child neuropsychology service in milan. Research in Developmental Disabilities, 107:103786, 2020.
  • [11] Joseph Taylor and Rickey Taylor. Decreasing work-related movement during a pandemic. Location analytics and the implications of the digital divide. International Journal of Development Issues, 20:293–308, 2021.
  • [12] Kamal Ahmed Soomro, Ugur Kale, Reagan Curtis, Mete Akcaoglu, and Malayna Bernstein. Digital divide among higher education faculty. International Journal of Educational Technology in Higher Education, 17(1):21, 2020.
  • [13] Obiageri Bridget Azubuike, Oyindamola Adegboye, and Habeeb Quadri. Who gets to learn in a pandemic? exploring the digital divide in remote learning during the covid-19 pandemic in nigeria. International Journal of Educational Research Open, 2-2:100022, 2021.
  • [14] Chukwuma N Eruchalu, Margaret S Pichardo, Maheetha Bharadwaj, Carmen B Rodriguez, Jorge A Rodriguez, Regan W Bergmark, David W Bates, and Gezzer Ortega. The Expanding Digital Divide: Digital Health Access Inequities during the COVID-19 Pandemic in New York City. Journal of Urban Health, 98(2):183–186, 2021.
  • [15] Geoff Watts. COVID-19 and the digital divide in the UK. The Lancet Digital Health, 2(8):e395–e396, aug 2020.
  • [16] Joseph Taylor and Rickey Taylor. Decreasing work-related movement during a pandemic. Location analytics and the implications of the digital divide. International journal of development issues, 20(3):293–308, 2021.
  • [17] GINI Index, The World Bank. https://data.worldbank.org/indicator/SI.POV.GINI, 2024. Accessed: 2024-04-23.
  • [18] Anatel, 2024. https://informacoes.anatel.gov.br/paineis/acessos, 2024. Accessed: 2024-04-23.
  • [19] Nick Feamster and Jason Livingood. Measuring internet speed: current challenges and future recommendations. Communications of the ACM, 63(12):72–80, 2020.
  • [20] Siope Vakataki‘Ofa and Cristina Bernal Aparicio. Visualizing broadband speeds in Asia and the Pacific. 2021.
  • [21] George S Ford. Form 477, Speed-Tests, and the American Broadband User’s Experience. Speed-Tests, and the American Broadband User’s Experience (March 31, 2021), 2021.
  • [22] Guanghua Chi, Han Fang, Sourav Chatterjee, and Joshua E Blumenstock. Microestimates of wealth for all low-and middle-income countries. Proceedings of the National Academy of Sciences, 119(3):e2113658119, 2022.
  • [23] Xiaobo Zhou and Henry Lin. Moran’s I, pages 725–725. Springer US, Boston, MA, 2008.
  • [24] Humanitarian OpenStreetMap - Education Facilities. https://data.humdata.org/dataset/?dataseries_name=HOTOSM+-+Education+Facilities, 2024. Accessed: 2024-03-28.
  • [25] FCC increases broadband speed benchmark. https://docs.fcc.gov/public/attachments/DOC-401205A1.pdf, 2024. Accessed: 2024-04-23.
  • [26] Google LLC ”Google COVID-19 Community Mobility Reports”. https://www.google.com/covid19/mobility/, 2020. Accessed: 2021-08-01.
  • [27] Conexis, Statistics. https://conexis.org.br/numeros/estatisticas/, 2024. Accessed: 2024-04-23.
  • [28] Ookla for Good. https://www.ookla.com/ookla-for-good, 2024. Accessed: 2024-05-23.
  • [29] Limite Favelas 2019, Rio de Janeiro. https://www.data.rio/datasets/limite-favelas-2019/explore, 2024. Accessed: 2024-04-23.