-
Attitudes Towards Migration in a COVID-19 Context: Testing a Behavioral Immune System Hypothesis with Twitter Data
Authors:
Yerka Freire-Vidal,
Gabriela Fajardo,
Carlos Rodríguez-Sickert,
Eduardo Graells-Garrido,
José Antonio Muñoz-Reyes,
Oriana Figueroa
Abstract:
The COVID-19 outbreak implied many changes in the daily life of most of the world's population for a long time, prompting severe restrictions on sociality. The Behavioral Immune System (BIS) suggests that when facing pathogens, a psychological mechanism would be activated that, among other things, would generate an increase in prejudice and discrimination towards marginalized groups, including imm…
▽ More
The COVID-19 outbreak implied many changes in the daily life of most of the world's population for a long time, prompting severe restrictions on sociality. The Behavioral Immune System (BIS) suggests that when facing pathogens, a psychological mechanism would be activated that, among other things, would generate an increase in prejudice and discrimination towards marginalized groups, including immigrants. This study aimed to test if people tend to enhance their rejection of minorities and foreign groups under the threat of contagious diseases, using the users' attitudes towards migrants in Twitter data from Chile, for pre-pandemic and pandemic contexts. Our results only partially support the BIS hypothesis, since threatened users increased their tweet production in the pandemic period, compared to empathetic users, but the latter grew in number and also increased the reach of their tweets between the two periods. We also found differences in the use of language between these types of users. Alternative explanations for these results may be context-dependent.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Feel Old Yet? Updating Mode of Transportation Distributions from Travel Surveys using Data Fusion with Mobile Phone Data
Authors:
Eduardo Graells-Garrido,
Daniela Opitz,
Francisco Rowe,
Jacqueline Arriagada
Abstract:
Up-to-date information on different modes of travel to monitor transport traffic and evaluate rapid urban transport planning interventions is often lacking. Transport systems typically rely on traditional data sources providing outdated mode-of-travel data due to their data latency, infrequent data collection and high cost. To address this issue, we propose a method that leverages mobile phone dat…
▽ More
Up-to-date information on different modes of travel to monitor transport traffic and evaluate rapid urban transport planning interventions is often lacking. Transport systems typically rely on traditional data sources providing outdated mode-of-travel data due to their data latency, infrequent data collection and high cost. To address this issue, we propose a method that leverages mobile phone data as a cost-effective and rich source of geospatial information to capture current human mobility patterns at unprecedented spatiotemporal resolution. Our approach employs mobile phone application usage traces to infer modes of transportation that are challenging to identify (bikes and ride-hailing/taxi services) based on mobile phone location data. Using data fusion and matrix factorization techniques, we integrate official data sources (household surveys and census data) with mobile phone application usage data. This integration enables us to reconstruct the official data and create an updated dataset that incorporates insights from digital footprint data from application usage. We illustrate our method using a case study focused on Santiago, Chile successfully inferring four modes of transportation: mass-transit, motorised, active, and taxi. Our analysis revealed significant changes in transportation patterns between 2012 and 2020. We quantify a reduction in mass-transit usage across municipalities in Santiago, except where metro/rail lines have been more recently introduced, highlighting added resilience to the public transport network of these infrastructure enhancements. Additionally, we evidence an overall increase in motorised transport throughout Santiago, revealing persistent challenges in promoting urban sustainable transportation. We validate our findings comparing our updated estimates with official smart card transaction data.
△ Less
Submitted 30 May, 2023; v1 submitted 20 April, 2022;
originally announced April 2022.
-
Bots don't Vote, but They Surely Bother! A Study of Anomalous Accounts in a National Referendum
Authors:
Eduardo Graells-Garrido,
Ricardo Baeza-Yates
Abstract:
The Web contains several social media platforms for discussion, exchange of ideas, and content publishing. These platforms are used by people, but also by distributed agents known as bots. Although bots have existed for decades, with many of them being benevolent, their influence in propagating and generating deceptive information in the last years has increased. Here we present a characterization…
▽ More
The Web contains several social media platforms for discussion, exchange of ideas, and content publishing. These platforms are used by people, but also by distributed agents known as bots. Although bots have existed for decades, with many of them being benevolent, their influence in propagating and generating deceptive information in the last years has increased. Here we present a characterization of the discussion on Twitter about the 2020 Chilean constitutional referendum. The characterization uses a profile-oriented analysis that enables the isolation of anomalous content using machine learning. As result, we obtain a characterization that matches national vote turnout, and we measure how anomalous accounts (some of which are automated bots) produce content and interact promoting (false) information.
△ Less
Submitted 8 March, 2022;
originally announced March 2022.
-
A physiology-inspired framework for holistic city simulations
Authors:
Irene Meta,
Fernando M. Cucchietti,
Diego Navarro,
Eduardo Graells-Garrido,
Vicente Guallart
Abstract:
Life, services and activities within cities have commonly been studied by separate disciplines, each one independent from the others. One such approach is the computer simulation, which enables in-depth modelling and cost-effective evaluation of city phenomena. However, the adoption of integrated city simulations faces several barriers, such as managerial, social, and technical, despite its potent…
▽ More
Life, services and activities within cities have commonly been studied by separate disciplines, each one independent from the others. One such approach is the computer simulation, which enables in-depth modelling and cost-effective evaluation of city phenomena. However, the adoption of integrated city simulations faces several barriers, such as managerial, social, and technical, despite its potential to support city planning and policymaking. This paper introduces the City Physiology: a new conceptual framework to facilitate the integration of city layers when designing holistic simulators. The physiology is introduced and applied through a process of three steps. Firstly, a literature review is offered in order to study the terminology and the progress already made towards integrated modelling of different urban systems. Secondly, interactions between urban systems are extracted from the approaches studied before. Finally, the pipeline to carry out the integration strategy is described. In addition to providing a conceptual tool for holistic simulations, the framework enables the discovery of new research lines generated by previously unseen connections between city layers. Being an open framework, available to all researchers to use and broaden, the authors of this paper envisage that it will be a valuable resource in establishing an exact science of cities.
△ Less
Submitted 15 October, 2021; v1 submitted 21 July, 2021;
originally announced August 2021.
-
A city of cities: Measuring how 15-minutes urban accessibility shapes human mobility in Barcelona
Authors:
Eduardo Graells-Garrido,
Feliu Serra-Burriel,
Francisco Rowe,
Fernando M. Cucchietti,
Patricio Reyes
Abstract:
As cities expand, human mobility has become a central focus of urban planning and policy making to make cities more inclusive and sustainable. Initiatives such as the "15-minutes city" have been put in place to shift the attention from monocentric city configurations to polycentric structures, increasing the availability and diversity of local urban amenities. Ultimately they expect to increase lo…
▽ More
As cities expand, human mobility has become a central focus of urban planning and policy making to make cities more inclusive and sustainable. Initiatives such as the "15-minutes city" have been put in place to shift the attention from monocentric city configurations to polycentric structures, increasing the availability and diversity of local urban amenities. Ultimately they expect to increase local walkability and increase mobility within residential areas. While we know how urban amenities influence human mobility at the city level, little is known about spatial variations in this relationship. Here, we use mobile phone, census, and volunteered geographical data to measure geographic variations in the relationship between origin-destination flows and local urban accessibility in Barcelona. Using a Negative Binomial Geographically Weighted Regression model, we show that, globally, people tend to visit neighborhoods with better access to education and retail. Locally, these and other features change in sign and magnitude through the different neighborhoods of the city in ways that are not explained by administrative boundaries, and that provide deeper insights regarding urban characteristics such as rental prices. In conclusion, our work suggests that the qualities of a 15-minutes city can be measured at scale, delivering actionable insights on the polycentric structure of cities, and how people use and access this structure.
△ Less
Submitted 22 March, 2021;
originally announced March 2021.
-
Every Colour You Are: Stance Prediction and Turnaround in Controversial Issues
Authors:
Eduardo Graells-Garrido,
Ricardo Baeza-Yates,
Mounia Lalmas
Abstract:
Web platforms have allowed political manifestation and debate for decades. Technology changes have brought new opportunities for expression, and the availability of longitudinal data of these debates entice new questions regarding who participates, and who updates their opinion. The aim of this work is to provide a methodology to measure these phenomena, and to test this methodology on a specific…
▽ More
Web platforms have allowed political manifestation and debate for decades. Technology changes have brought new opportunities for expression, and the availability of longitudinal data of these debates entice new questions regarding who participates, and who updates their opinion. The aim of this work is to provide a methodology to measure these phenomena, and to test this methodology on a specific topic, abortion, as observed on one of the most popular micro-blogging platforms. To do so, we followed the discussion on Twitter about abortion in two Spanish-speaking countries from 2015 to 2018. Our main insights are two fold. On the one hand, people adopted new technologies to express their stances, particularly colored variations of heart emojis ([green heart] & [purple heart]) in a way that mirrored physical manifestations on abortion. On the other hand, even on issues with strong opinions, opinions can change, and these changes show differences in demographic groups. These findings imply that debate on the Web embraces new ways of stance adherence, and that changes of opinion can be measured and characterized.
△ Less
Submitted 19 May, 2020;
originally announced May 2020.
-
Measuring Spatial Subdivisions in Urban Mobility with Mobile Phone Data
Authors:
Eduardo Graells-Garrido,
Irene Meta,
Feliu Serra-Burriel,
Patricio Reyes,
Fernando M. Cucchietti
Abstract:
Urban population grows constantly. By 2050 two thirds of the world population will reside in urban areas. This growth is faster and more complex than the ability of cities to measure and plan for their sustainability. To understand what makes a city inclusive for all, we define a methodology to identify and characterize spatial subdivisions: areas with over- and under-representation of specific po…
▽ More
Urban population grows constantly. By 2050 two thirds of the world population will reside in urban areas. This growth is faster and more complex than the ability of cities to measure and plan for their sustainability. To understand what makes a city inclusive for all, we define a methodology to identify and characterize spatial subdivisions: areas with over- and under-representation of specific population groups, named hot and cold spots respectively. Using aggregated mobile phone data, we apply this methodology to the city of Barcelona to assess the mobility of three groups of people: women, elders, and tourists. We find that, within the three groups, cold spots have a lower diversity of amenities and services than hot spots. Also, cold spots of women and tourists tend to have lower population income. These insights apply to the floating population of Barcelona, thus augmenting the scope of how inclusiveness can be analyzed in the city.
△ Less
Submitted 20 February, 2020;
originally announced February 2020.
-
Toward An Interdisciplinary Methodology to Solve New (Old) Transportation Problems
Authors:
Eduardo Graells-Garrido,
Vanessa Peña-Araya
Abstract:
The rising availability of digital traces provides a fertile ground for new solutions to both, new and old problems in cities. Even though a massive data set analyzed with Data Science methods may provide a powerful solution to a problem, its adoption by relevant stakeholders is not guaranteed, due to adoption blockers such as lack of interpretability and transparency. In this context, this paper…
▽ More
The rising availability of digital traces provides a fertile ground for new solutions to both, new and old problems in cities. Even though a massive data set analyzed with Data Science methods may provide a powerful solution to a problem, its adoption by relevant stakeholders is not guaranteed, due to adoption blockers such as lack of interpretability and transparency. In this context, this paper proposes a preliminary methodology toward bridging two disciplines, Data Science and Transportation, to solve urban problems with methods that are suitable for adoption. The methodology is defined by four steps where people from both disciplines go from algorithm and model definition to the building of a potentially adoptable solution. As case study, we describe how this methodology was applied to define a model to infer commuting trips with mode of transportation from mobile phone data.
△ Less
Submitted 20 February, 2020;
originally announced February 2020.
-
Characterization of Local Attitudes Toward Immigration Using Social Media
Authors:
Yerka Freire,
Eduardo Graells-Garrido
Abstract:
Migration is a worldwide phenomenon that may generate different reactions in the population. Attitudes vary from those that support multiculturalism and communion between locals and foreigners, to contempt and hatred toward immigrants. Since anti-immigration attitudes are often materialized in acts of violence and discrimination, it is important to identify factors that characterize these attitude…
▽ More
Migration is a worldwide phenomenon that may generate different reactions in the population. Attitudes vary from those that support multiculturalism and communion between locals and foreigners, to contempt and hatred toward immigrants. Since anti-immigration attitudes are often materialized in acts of violence and discrimination, it is important to identify factors that characterize these attitudes. However, doing so is expensive and impractical, as traditional methods require enormous efforts to collect data. In this paper, we propose to leverage Twitter to characterize local attitudes toward immigration, with a case study on Chile, where immigrant population has drastically increased in recent years. Using semi-supervised topic modeling, we situated 49K users into a spectrum ranging from in-favor to against immigration. We characterized both sides of the spectrum in two aspects: the emotions and lexical categories relevant for each attitude, and the discussion network structure. We found that the discussion is mostly driven by Haitian immigration; that there are temporal trends in tendency and polarity of discussion; and that assortative behavior on the network differs with respect to attitude. These insights may inform policy makers on how people feel with respect to migration, with potential implications on communication of policy and the design of interventions to improve inter-group relations.
△ Less
Submitted 12 March, 2019;
originally announced March 2019.
-
Shop** Mall Attraction and Social Mixing at a City Scale
Authors:
Mariano G. Beiró,
Loreto Bravo,
Diego Caro,
Ciro Cattuto,
Leo Ferres,
Eduardo Graells-Garrido
Abstract:
The social inclusion aspects of shop** malls and their effects on our understanding of urban spaces have been a controversial argument largely discussed in the literature. Shop** malls offer an open, safe and democratic version of the public space. Many of their detractors suggest that malls target their customers in subtle ways, promoting social exclusion. In this work, we analyze whether mal…
▽ More
The social inclusion aspects of shop** malls and their effects on our understanding of urban spaces have been a controversial argument largely discussed in the literature. Shop** malls offer an open, safe and democratic version of the public space. Many of their detractors suggest that malls target their customers in subtle ways, promoting social exclusion. In this work, we analyze whether malls offer opportunities for social mixing by analyzing the patterns of shop** mall visits in a large Latin-American city: Santiago de Chile.
We use a large XDR (Data Detail Records) dataset from a telecommunication company to analyze the mobility of $387,152$ cell phones around $16$ large malls in Santiago de Chile during one month. We model the influx of people to malls in terms of a gravity model of mobility, and we are able to predict the customer profile distribution of each mall, explaining it in terms of mall location, the population distribution, and mall size.
Then, we analyze the concept of social attraction, expressed as people from low and middle classes being attracted by malls that target high-income customers. We include a social attraction factor in our model and find that it is negligible in the process of choosing a mall. We observe that social mixing arises only in peripheral malls located farthest from the city center, which both low and middle class people visit. Using a co-visitation model we show that people tend to choose a restricted profile of malls according to their socio-economic status and their distance from the mall. We conclude that the potential for social mixing in malls could be capitalized by designing public policies regarding transportation and mobility.
△ Less
Submitted 9 February, 2018; v1 submitted 31 January, 2018;
originally announced February 2018.
-
Toward Finding Latent Cities with Non-Negative Matrix Factorization
Authors:
Eduardo Graells-Garrido,
Diego Caro,
Denis Parra
Abstract:
In the last decade, digital footprints have been used to cluster population activity into functional areas of cities.
However, a key aspect has been overlooked: we experience our cities not only by performing activities at specific destinations, but also by moving from one place to another.
In this paper, we propose to analyze and cluster the city based on how people move through it. Particula…
▽ More
In the last decade, digital footprints have been used to cluster population activity into functional areas of cities.
However, a key aspect has been overlooked: we experience our cities not only by performing activities at specific destinations, but also by moving from one place to another.
In this paper, we propose to analyze and cluster the city based on how people move through it. Particularly, we introduce Mobilicities, automatically generated travel patterns inferred from mobile phone network data using NMF, a matrix factorization model.
We evaluate our method in a large city and we find that mobilicities reveal latent but at the same time interpretable mobility structures of the city. Our results provide evidence on how clustering and visualization of aggregated phone logs could be used in planning systems to interactively analyze city structure and population activity.
△ Less
Submitted 27 January, 2018;
originally announced January 2018.
-
Organic Visualization of Document Evolution
Authors:
Ignacio Perez-Messina,
Claudio Gutierrez,
Eduardo Graells-Garrido
Abstract:
Recent availability of data of writing processes at keystroke-granularity has enabled research on the evolution of document writing. A natural step is to develop systems that can actually show this data and make it understandable. Here we propose a data structure that captures a document's fine-grained history and an organic visualization that serves as an interface to it. We evaluate a proof-of-c…
▽ More
Recent availability of data of writing processes at keystroke-granularity has enabled research on the evolution of document writing. A natural step is to develop systems that can actually show this data and make it understandable. Here we propose a data structure that captures a document's fine-grained history and an organic visualization that serves as an interface to it. We evaluate a proof-of-concept implementation of the system through a pilot study with documents written by students at a public university. Our results are promising and reveal facets such as general strategies adopted, local edition density and hierarchical structure of the final text.
△ Less
Submitted 17 December, 2017;
originally announced December 2017.
-
The Effect of Pokémon Go on The Pulse of the City: A Natural Experiment
Authors:
Eduardo Graells-Garrido,
Leo Ferres,
Diego Caro,
Loreto Bravo
Abstract:
Pokémon Go, a location-based game that uses augmented reality techniques, received unprecedented media coverage due to claims that it allowed for greater access to public spaces, increasing the number of people out on the streets, and generally improving health, social, and security indices. However, the true impact of Pokémon Go on people's mobility patterns in a city is still largely unknown. In…
▽ More
Pokémon Go, a location-based game that uses augmented reality techniques, received unprecedented media coverage due to claims that it allowed for greater access to public spaces, increasing the number of people out on the streets, and generally improving health, social, and security indices. However, the true impact of Pokémon Go on people's mobility patterns in a city is still largely unknown. In this paper, we perform a natural experiment using data from mobile phone networks to evaluate the effect of Pokémon Go on the pulse of a big city: Santiago, capital of Chile. We found significant effects of the game on the floating population of Santiago compared to movement prior to the game's release in August 2016: in the following week, up to 13.8\% more people spent time outside at certain times of the day, even if they do not seem to go out of their usual way. These effects were found by performing regressions using count models over the states of the cellphone network during each day under study. The models used controlled for land use, daily patterns, and points of interest in the city.
Our results indicate that, on business days, there are more people on the street at commuting times, meaning that people did not change their daily routines but slightly adapted them to play the game. Conversely, on Saturday and Sunday night, people indeed went out to play, but favored places close to where they live.
Even if the statistical effects of the game do not reflect the massive change in mobility behavior portrayed by the media, at least in terms of expanse, they do show how "the street" may become a new place of leisure. This change should have an impact on long-term infrastructure investment by city officials, and on the drafting of public policies aimed at stimulating pedestrian traffic.
△ Less
Submitted 18 September, 2017; v1 submitted 25 October, 2016;
originally announced October 2016.
-
A Day of Your Days: Estimating Individual Daily Journeys Using Mobile Data to Understand Urban Flow
Authors:
Eduardo Graells-Garrido,
Diego Saez-Trumper
Abstract:
Nowadays, travel surveys provide rich information about urban mobility and commuting patterns. But, at the same time, they have drawbacks: they are static pictures of a dynamic phenomena, are expensive to make, and take prolonged periods of time to finish. However, the availability of mobile usage data (Call Detail Records) makes the study of urban mobility possible at levels not known before. Thi…
▽ More
Nowadays, travel surveys provide rich information about urban mobility and commuting patterns. But, at the same time, they have drawbacks: they are static pictures of a dynamic phenomena, are expensive to make, and take prolonged periods of time to finish. However, the availability of mobile usage data (Call Detail Records) makes the study of urban mobility possible at levels not known before. This has been done in the past with good results--mobile data makes possible to find and understand aggregated mobility patterns. In this paper, we propose to analyze mobile data at individual level by estimating daily journeys, and use those journeys to build Origin-Destiny matrices to understand urban flow. We evaluate this approach with large anonymized CDRs from Santiago, Chile, and find that our method has a high correlation ($ρ= 0.89$) with the current travel survey, and that it captures external anomalies in daily travel patterns, making our method suitable for inclusion into urban computing applications.
△ Less
Submitted 29 February, 2016;
originally announced February 2016.
-
Women Through the Glass Ceiling: Gender Asymmetries in Wikipedia
Authors:
Claudia Wagner,
Eduardo Graells-Garrido,
David Garcia,
Filippo Menczer
Abstract:
Contributing to the writing of history has never been as easy as it is today thanks to Wikipedia, a community-created encyclopedia that aims to document the world's knowledge from a neutral point of view. Though everyone can participate it is well known that the editor community has a narrow diversity, with a majority of white male editors. While this participatory \emph{gender gap} has been studi…
▽ More
Contributing to the writing of history has never been as easy as it is today thanks to Wikipedia, a community-created encyclopedia that aims to document the world's knowledge from a neutral point of view. Though everyone can participate it is well known that the editor community has a narrow diversity, with a majority of white male editors. While this participatory \emph{gender gap} has been studied extensively in the literature, this work sets out to \emph{assess potential gender inequalities in Wikipedia articles} along different dimensions: notability, topical focus, linguistic bias, structural properties, and meta-data presentation.
We find that (i) women in Wikipedia are more notable than men, which we interpret as the outcome of a subtle glass ceiling effect; (ii) family-, gender-, and relationship-related topics are more present in biographies about women; (iii) linguistic bias manifests in Wikipedia since abstract terms tend to be used to describe positive aspects in the biographies of men and negative aspects in the biographies of women; and (iv) there are structural differences in terms of meta-data and hyperlinks, which have consequences for information-seeking activities. While some differences are expected, due to historical and social contexts, other differences are attributable to Wikipedia editors. The implications of such differences are discussed having Wikipedia contribution policies in mind. We hope that the present work will contribute to increased awareness about, first, gender issues in the content of Wikipedia, and second, the different levels on which gender biases can manifest on the Web.
△ Less
Submitted 2 March, 2016; v1 submitted 19 January, 2016;
originally announced January 2016.
-
Sentiment Visualisation Widgets for Exploratory Search
Authors:
Eduardo Graells-Garrido,
Mounia Lalmas,
Ricardo Baeza-Yates
Abstract:
This paper proposes the usage of \emph{visualisation widgets} for exploratory search with \emph{sentiment} as a facet. Starting from specific design goals for depiction of ambivalence in sentiment, two visualization widgets were implemented: \emph{scatter plot} and \emph{parallel coordinates}. Those widgets were evaluated against a text baseline in a small-scale usability study with exploratory ta…
▽ More
This paper proposes the usage of \emph{visualisation widgets} for exploratory search with \emph{sentiment} as a facet. Starting from specific design goals for depiction of ambivalence in sentiment, two visualization widgets were implemented: \emph{scatter plot} and \emph{parallel coordinates}. Those widgets were evaluated against a text baseline in a small-scale usability study with exploratory tasks using Wikipedia as dataset. The study results indicate that users spend more time browsing with scatter plots in a positive way. A post-hoc analysis of individual differences in behavior revealed that when considering two types of users, \emph{explorers} and \emph{achievers}, engagement with scatter plots is positive and significantly greater \textit{when users are explorers}. We discuss the implications of these findings for sentiment-based exploratory search and personalised user interfaces.
△ Less
Submitted 8 January, 2016;
originally announced January 2016.
-
Data Portraits and Intermediary Topics: Encouraging Exploration of Politically Diverse Profiles
Authors:
Eduardo Graells-Garrido,
Mounia Lalmas,
Ricardo Baeza-Yates
Abstract:
In micro-blogging platforms, people connect and interact with others. However, due to cognitive biases, they tend to interact with like-minded people and read agreeable information only. Many efforts to make people connect with those who think differently have not worked well. In this paper, we hypothesize, first, that previous approaches have not worked because they have been direct -- they have…
▽ More
In micro-blogging platforms, people connect and interact with others. However, due to cognitive biases, they tend to interact with like-minded people and read agreeable information only. Many efforts to make people connect with those who think differently have not worked well. In this paper, we hypothesize, first, that previous approaches have not worked because they have been direct -- they have tried to explicitly connect people with those having opposing views on sensitive issues. Second, that neither recommendation or presentation of information by themselves are enough to encourage behavioral change. We propose a platform that mixes a recommender algorithm and a visualization-based user interface to explore recommendations. It recommends politically diverse profiles in terms of distance of latent topics, and displays those recommendations in a visual representation of each user's personal content. We performed an "in the wild" evaluation of this platform, and found that people explored more recommendations when using a biased algorithm instead of ours. In line with our hypothesis, we also found that the mixture of our recommender algorithm and our user interface, allowed politically interested users to exhibit an unbiased exploration of the recommended profiles. Finally, our results contribute insights in two aspects: first, which individual differences are important when designing platforms aimed at behavioral change; and second, which algorithms and user interfaces should be mixed to help users avoid cognitive mechanisms that lead to biased behavior.
△ Less
Submitted 4 January, 2016;
originally announced January 2016.
-
Encouraging Diversity- and Representation-Awareness in Geographically Centralized Content
Authors:
Eduardo Graells-Garrido,
Mounia Lalmas,
Ricardo Baeza-Yates
Abstract:
In centralized countries, not only population, media and economic power are concentrated, but people give more attention to central locations. While this is not inherently bad, this behavior extends to micro-blogging platforms: central locations get more attention in terms of information flow. In this paper we study the effects of an information filtering algorithm that decentralizes content in su…
▽ More
In centralized countries, not only population, media and economic power are concentrated, but people give more attention to central locations. While this is not inherently bad, this behavior extends to micro-blogging platforms: central locations get more attention in terms of information flow. In this paper we study the effects of an information filtering algorithm that decentralizes content in such platforms. Particularly, we find that users from non-central locations were not able to identify the geographical diversity on timelines generated by the algorithm, which were diverse by construction. To make users see the inherent diversity, we define a design rationale to approach this problem, focused on an already known visualization technique: treemaps. Using interaction data from an "in the wild" deployment of our proposed system, we find that, even though there are effects of centralization in exploratory user behavior, the treemap was able to make users see the inherent geographical diversity of timelines, and engage with user generated content. With these results in mind, we propose practical actions for micro-blogging platforms to account for the differences and biased behavior induced by centralization.
△ Less
Submitted 7 October, 2015;
originally announced October 2015.
-
Finding Intermediary Topics Between People of Opposing Views: A Case Study
Authors:
Eduardo Graells-Garrido,
Mounia Lalmas,
Ricardo Baeza-Yates
Abstract:
In micro-blogging platforms, people can connect with others and have conversations on a wide variety of topics. However, because of homophily and selective exposure, users tend to connect with like-minded people and only read agreeable information. Motivated by this scenario, in this paper we study the diversity of intermediary topics, which are latent topics estimated from user generated content.…
▽ More
In micro-blogging platforms, people can connect with others and have conversations on a wide variety of topics. However, because of homophily and selective exposure, users tend to connect with like-minded people and only read agreeable information. Motivated by this scenario, in this paper we study the diversity of intermediary topics, which are latent topics estimated from user generated content. These topics can be used as features in recommender systems aimed at introducing people of diverse political viewpoints. We conducted a case study on Twitter, considering the debate about a sensitive issue in Chile, where we quantified homophilic behavior in terms of political discussion and then we evaluated the diversity of intermediary topics in terms of political stances of users.
△ Less
Submitted 30 July, 2015; v1 submitted 2 June, 2015;
originally announced June 2015.
-
Language, Twitter and Academic Conferences
Authors:
Ruth García,
Diego Gómez,
Denis Parra,
Christoph Trattner,
Andreas Kaltenbrunner,
Eduardo Graells-Garrido
Abstract:
Using Twitter during academic conferences is a way of engaging and connecting an audience inherently multicultural by the nature of scientific collaboration. English is expected to be the lingua franca bridging the communication and integration between native speakers of different mother tongues. However, little research has been done to support this assumption. In this paper we analyzed how integ…
▽ More
Using Twitter during academic conferences is a way of engaging and connecting an audience inherently multicultural by the nature of scientific collaboration. English is expected to be the lingua franca bridging the communication and integration between native speakers of different mother tongues. However, little research has been done to support this assumption. In this paper we analyzed how integrated language communities are by analyzing the scholars' tweets used in 26 Computer Science conferences over a time span of five years. We found that although English is the most popular language used to tweet during conferences, a significant proportion of people also tweet in other languages. In addition, people who tweet solely in English interact mostly within the same group (English monolinguals), while people who speak other languages tend to show a more diverse interaction with other lingua groups. Finally, we also found that the people who interact with other Twitter users show a more diverse language distribution, while people who do not interact mostly post tweets in a single language. These results suggest a relation between the number of languages a user speaks, which can affect the interaction dynamics of online communities.
△ Less
Submitted 13 April, 2015;
originally announced April 2015.
-
First Women, Second Sex: Gender Bias in Wikipedia
Authors:
Eduardo Graells-Garrido,
Mounia Lalmas,
Filippo Menczer
Abstract:
Contributing to history has never been as easy as it is today. Anyone with access to the Web is able to play a part on Wikipedia, an open and free encyclopedia. Wikipedia, available in many languages, is one of the most visited websites in the world and arguably one of the primary sources of knowledge on the Web. However, not everyone is contributing to Wikipedia from a diversity point of view; se…
▽ More
Contributing to history has never been as easy as it is today. Anyone with access to the Web is able to play a part on Wikipedia, an open and free encyclopedia. Wikipedia, available in many languages, is one of the most visited websites in the world and arguably one of the primary sources of knowledge on the Web. However, not everyone is contributing to Wikipedia from a diversity point of view; several groups are severely underrepresented. One of those groups is women, who make up approximately 16% of the current contributor community, meaning that most of the content is written by men. In addition, although there are specific guidelines of verifiability, notability, and neutral point of view that must be adhered by Wikipedia content, these guidelines are supervised and enforced by men.
In this paper, we propose that gender bias is not about participation and representation only, but also about characterization of women. We approach the analysis of gender bias by defining a methodology for comparing the characterizations of men and women in biographies in three aspects: meta-data, language, and network structure. Our results show that, indeed, there are differences in characterization and structure. Some of these differences are reflected from the off-line world documented by Wikipedia, but other differences can be attributed to gender bias in Wikipedia content. We contextualize these differences in feminist theory and discuss their implications for Wikipedia policy.
△ Less
Submitted 2 June, 2015; v1 submitted 8 February, 2015;
originally announced February 2015.
-
Data Portraits: Connecting People of Opposing Views
Authors:
Eduardo Graells-Garrido,
Mounia Lalmas,
Daniele Quercia
Abstract:
Social networks allow people to connect with each other and have conversations on a wide variety of topics. However, users tend to connect with like-minded people and read agreeable information, a behavior that leads to group polarization. Motivated by this scenario, we study how to take advantage of partial homophily to suggest agreeable content to users authored by people with opposite views on…
▽ More
Social networks allow people to connect with each other and have conversations on a wide variety of topics. However, users tend to connect with like-minded people and read agreeable information, a behavior that leads to group polarization. Motivated by this scenario, we study how to take advantage of partial homophily to suggest agreeable content to users authored by people with opposite views on sensitive issues. We introduce a paradigm to present a data portrait of users, in which their characterizing topics are visualized and their corresponding tweets are displayed using an organic design. Among their tweets we inject recommended tweets from other people considering their views on sensitive issues in addition to topical relevance, indirectly motivating connections between dissimilar people. To evaluate our approach, we present a case study on Twitter about a sensitive topic in Chile, where we estimate user stances for regular people and find intermediary topics. We then evaluated our design in a user study. We found that recommending topically relevant content from authors with opposite views in a baseline interface had a negative emotional effect. We saw that our organic visualization design reverts that effect. We also observed significant individual differences linked to evaluation of recommendations. Our results suggest that organic visualization may revert the negative effects of providing potentially sensitive content.
△ Less
Submitted 19 November, 2013;
originally announced November 2013.
-
Caracterizando la Web Chilena
Authors:
Eduardo Graells-Garrido,
Ricardo Baeza-Yates
Abstract:
This article presents a characterization of the web space from Chile in 2007. The characterization shows distributions of sites and domains, analysis of document content and server configuration. In addition, the network structure of the chilean Web is analyzed, determining components based on hyperlink structure at the document and site levels.
Original Abstract: En este artículo se muestra una…
▽ More
This article presents a characterization of the web space from Chile in 2007. The characterization shows distributions of sites and domains, analysis of document content and server configuration. In addition, the network structure of the chilean Web is analyzed, determining components based on hyperlink structure at the document and site levels.
Original Abstract: En este artículo se muestra una caracterización del espacio web de Chile para el año 2007. Se muestran distribuciones de sitios y dominios, caracterización del contenido en base a tipos de documento, asi como configuración de los servidores. Se estudia la estructura de la red creada mediante hipervínculos en los documentos y cómo las diferentes componentes de esta estructura varían cuando los hipervínculos son agregados a nivel de sitios.
△ Less
Submitted 10 September, 2013;
originally announced September 2013.
-
Zahir: a Object-Oriented Framework for Computer Graphics
Authors:
Eduardo Graells-Garrido,
María Cecilia Rivara
Abstract:
In this article we present Zahir, a framework for experimentation in Computer Graphics that provides a group of object-oriented base components that take care of common tasks in rendering techniques and algorithms, specially those of Non Photo-realistic Rendering (NPR). These components allow developers to implement rendering techniques and algorithms over static and animated meshes. Currently, Za…
▽ More
In this article we present Zahir, a framework for experimentation in Computer Graphics that provides a group of object-oriented base components that take care of common tasks in rendering techniques and algorithms, specially those of Non Photo-realistic Rendering (NPR). These components allow developers to implement rendering techniques and algorithms over static and animated meshes. Currently, Zahir is being used in a Master's Thesis and as support material in the undergraduate Computer Graphics course in University of Chile.
△ Less
Submitted 7 September, 2013;
originally announced September 2013.
-
Evolution of the Chilean Web: A Larger Study
Authors:
Eduardo Graells-Garrido,
Ricardo Baeza-Yates
Abstract:
In this paper we extend our previous and only study on the dynamics of the Chilean Web. This new study doubles the time period and to the best of our knowledge is the only study of its type known about any country in the Web. The new results corroborate the trends found before, in particular the exponential growth of the Web, and reinforce the conclusion that the Web is more chaotic than we would…
▽ More
In this paper we extend our previous and only study on the dynamics of the Chilean Web. This new study doubles the time period and to the best of our knowledge is the only study of its type known about any country in the Web. The new results corroborate the trends found before, in particular the exponential growth of the Web, and reinforce the conclusion that the Web is more chaotic than we would like. Hence, modeling most Web characteristics is not trivial.
△ Less
Submitted 7 September, 2013;
originally announced September 2013.
-
#Santiago is not #Chile, or is it? A Model to Normalize Social Media Impact
Authors:
Eduardo Graells-Garrido,
Barbara Poblete
Abstract:
Online social networks are known to be demographically biased. Currently there are questions about what degree of representativity of the physical population they have, and how population biases impact user-generated content. In this paper we focus on centralism, a problem affecting Chile. Assuming that local differences exist in a country, in terms of vocabulary, we built a methodology based on t…
▽ More
Online social networks are known to be demographically biased. Currently there are questions about what degree of representativity of the physical population they have, and how population biases impact user-generated content. In this paper we focus on centralism, a problem affecting Chile. Assuming that local differences exist in a country, in terms of vocabulary, we built a methodology based on the vector space model to find distinctive content from different locations, and use it to create classifiers to predict whether the content of a micro-post is related to a particular location, having in mind a geographically diverse selection of micro-posts. We evaluate them in a case study where we analyze the virtual population of Chile that participated in the Twitter social network during an event of national relevance: the municipal (local governments) elections held in 2012. We observe that the participating virtual population is spatially representative of the physical population, implying that there is centralism in Twitter. Our classifiers out-perform a non geographically-diverse baseline at the regional level, and have the same accuracy at a provincial level. However, our approach makes assumptions that need to be tested in multi-thematic and more general datasets. We leave this for future work.
△ Less
Submitted 6 September, 2013;
originally announced September 2013.
-
Ornitología Virtual: Caracterizando a #Chile en Twitter
Authors:
Eduardo Graells-Garrido
Abstract:
Este artículo presenta un análisis de los tweets recolectados el 28 de Octubre de 2012, en el contexto de las elecciones municipales de 2012 en Chile. Dicho análisis se realiza mediante una metodología basada en literatura previa, en particular en técnicas de recuperación de la información y de análisis de espacios de información. Como resultado, se determinan: 1) características demográficas bási…
▽ More
Este artículo presenta un análisis de los tweets recolectados el 28 de Octubre de 2012, en el contexto de las elecciones municipales de 2012 en Chile. Dicho análisis se realiza mediante una metodología basada en literatura previa, en particular en técnicas de recuperación de la información y de análisis de espacios de información. Como resultado, se determinan: 1) características demográficas básicas de la población virtual chilena, incluyendo su distribución geográfica, 2) el contenido que caracteriza a cada región, y cómo fluye información entre regiones, y 3) el grado de representatividad de la población virtual participante en el evento con respecto a la población física. Se determina que la muestra obtenida es representativa de la población en términos de distribución geográfica, que el centralismo que afecta al país se ve reflejado en Twitter, y que, a pesar de los sesgos poblacionales, es posible identificar el contenido que caracteriza a cada región. Se finaliza con una discusión de las implicaciones y conclusiones prácticas de este trabajo, así como futuras aplicaciones.
△ Less
Submitted 16 November, 2013; v1 submitted 30 June, 2013;
originally announced July 2013.