Skip to main content

Showing 1–50 of 61 results for author: Morales, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.05358  [pdf, other

    cs.CY cs.LG cs.SI stat.ML

    Variational Inference of Parameters in Opinion Dynamics Models

    Authors: Jacopo Lenti, Fabrizio Silvestri, Gianmarco De Francisci Morales

    Abstract: Despite the frequent use of agent-based models (ABMs) for studying social phenomena, parameter estimation remains a challenge, often relying on costly simulation-based heuristics. This work uses variational inference to estimate the parameters of an opinion dynamics ABM, by transforming the estimation problem into an optimization task that can be solved directly. Our proposal relies on probabili… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  2. arXiv:2402.18470  [pdf, other

    cs.SI physics.data-an

    A Higher-Order Lens for Social Systems

    Authors: Giulia Preti, Adriano Fazzone, Giovanni Petri, Gianmarco De Francisci Morales

    Abstract: Despite the widespread adoption of higher-order mathematical structures such as hypergraphs, methodological tools for their analysis lag behind those for traditional graphs. This work addresses a critical gap in this context by proposing two micro-canonical random null models for directed hypergraphs: the Directed Hypergraph Configuration Model (DHCM) and the Directed Hypergraph JOINT Model (DHJM)… ▽ More

    Submitted 7 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  3. arXiv:2402.13855  [pdf, other

    cs.CY cs.SI

    What we can learn from TikTok through its Research API

    Authors: Francesco Corso, Francesco Pierri, Gianmarco De Francisci Morales

    Abstract: TikTok is a social media platform that has gained immense popularity over the last few years, particularly among younger demographics, due to the viral trends and challenges shared worldwide. The recent release of a free Research API opens the door to collecting data on posted videos, associated comments, and user activities. Our study focuses on evaluating the reliability of the results returned… ▽ More

    Submitted 4 April, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: 11 pages, 8 Figures, submitted to DHOW at WebSci'24

  4. arXiv:2401.13656  [pdf, other

    cs.SI cs.CY physics.soc-ph stat.AP

    Navigating Multidimensional Ideologies with Reddit's Political Compass: Economic Conflict and Social Affinity

    Authors: Ernesto Colacrai, Federico Cinus, Gianmarco De Francisci Morales, Michele Starnini

    Abstract: The prevalent perspective in quantitative research on opinion dynamics flattens the landscape of the online political discourse into a traditional left--right dichotomy. While this approach helps simplify the analysis and modeling effort, it also neglects the intrinsic multidimensional richness of ideologies. In this study, we analyze social interactions on Reddit, under the lens of a multi-dimens… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  5. arXiv:2311.00118  [pdf, other

    cs.LG q-bio.NC stat.AP stat.ME stat.ML

    Extracting the Multiscale Causal Backbone of Brain Dynamics

    Authors: Gabriele D'Acunto, Francesco Bonchi, Gianmarco De Francisci Morales, Giovanni Petri

    Abstract: The bulk of the research effort on brain connectivity revolves around statistical associations among brain regions, which do not directly relate to the causal mechanisms governing brain dynamics. Here we propose the multiscale causal backbone (MCB) of brain dynamics, shared by a set of individuals across multiple temporal scales, and devise a principled methodology to extract it. Our approach le… ▽ More

    Submitted 19 March, 2024; v1 submitted 31 October, 2023; originally announced November 2023.

    Comments: Accepted at the 3rd conference on Causal Learning and Reasoning (CLeaR 2024)

  6. arXiv:2310.19951  [pdf, other

    cs.CY cs.SI physics.soc-ph

    Measuring Behavior Change with Observational Studies: a Review

    Authors: Arianna Pera, Gianmarco de Francisci Morales, Luca Maria Aiello

    Abstract: Exploring behavioral change in the digital age is imperative for societal progress in the context of 21st-century challenges. We analyzed 148 articles (2000-2023) and built a map that categorizes behaviors and change detection methodologies, platforms of reference, and theoretical frameworks that characterize online behavior change. Our findings uncover a focus on sentiment shifts, an emphasis on… ▽ More

    Submitted 2 November, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

  7. Generating collective counterfactual explanations in score-based classification via mathematical optimization

    Authors: Emilio Carrizosa, Jasone Ramírez-Ayerbe, Dolores Romero Morales

    Abstract: Due to the increasing use of Machine Learning models in high stakes decision making settings, it has become increasingly important to have tools to understand how models arrive at decisions. Assuming a trained Supervised Classification model, explanations can be obtained via counterfactual analysis: a counterfactual explanation of an instance indicates how this instance should be minimally modifie… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: This research has been funded in part by research projects EC H2020 MSCA RISE NeEDS (Grant agreement ID: 822214), FQM-329, P18-FR-2369 and US-1381178 (Junta de Andalucía, Spain), and PID2019-110886RB-I00 and PID2022-137818OB-I00 (Ministerio de Ciencia, Innovación y Universidades, Spain). This support is gratefully acknowledged

    Journal ref: Expert Systems with Applications, 2024

  8. Systematic discrepancies in the delivery of political ads on Facebook and Instagram

    Authors: Dominik Bär, Francesco Pierri, Gianmarco De Francisci Morales, Stefan Feuerriegel

    Abstract: Political advertising on social media has become a central element in election campaigns. However, granular information about political advertising on social media was previously unavailable, thus raising concerns regarding fairness, accountability, and transparency in the electoral process. In this paper, we analyze targeted political advertising on social media via a unique, large-scale dataset… ▽ More

    Submitted 24 June, 2024; v1 submitted 15 October, 2023; originally announced October 2023.

    Comments: Accepted for publication at PNAS NEXUS. The first two authors contributed equally to this research

    Journal ref: Dominik Bär, Francesco Pierri, Gianmarco De Francisci Morales, Stefan Feuerriegel, Systematic discrepancies in the delivery of political ads on Facebook and Instagram, PNAS Nexus, 2024;, pgae247

  9. arXiv:2310.02766  [pdf, other

    cs.SI cs.CY

    Likelihood-Based Methods Improve Parameter Estimation in Opinion Dynamics Models

    Authors: Jacopo Lenti, Corrado Monti, Gianmarco De Francisci Morales

    Abstract: We show that a maximum likelihood approach for parameter estimation in agent-based models (ABMs) of opinion dynamics outperforms the typical simulation-based approach. Simulation-based approaches simulate the model repeatedly in search of a set of parameters that generates data similar enough to the observed one. In contrast, likelihood-based approaches derive a likelihood function that connects t… ▽ More

    Submitted 5 October, 2023; v1 submitted 4 October, 2023; originally announced October 2023.

  10. arXiv:2309.08363  [pdf, other

    cs.CY cs.HC cs.SI

    Narratives of War: Ukrainian Memetic Warfare on Twitter

    Authors: Yelena Mejova, Arthur Capozzi, Corrado Monti, Gianmarco De Francisci Morales

    Abstract: The 2022 Russian invasion of Ukraine has seen an intensification in the use of social media by governmental actors in cyber warfare. Wartime communication via memes has been a successful strategy used not only by independent accounts such as @uamemesforces, but also-for the first time in a full-scale interstate war-by official Ukrainian government accounts such as @Ukraine and @DefenceU. We study… ▽ More

    Submitted 23 January, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    ACM Class: J.4; K.4

  11. arXiv:2308.10838  [pdf, other

    cs.SI physics.soc-ph

    An impossibility result for Markov Chain Monte Carlo sampling from micro-canonical bipartite graph ensembles

    Authors: Giulia Preti, Gianmarco De Francisci Morales, Matteo Riondato

    Abstract: Markov Chain Monte Carlo (MCMC) algorithms are commonly used to sample from graph ensembles. Two graphs are neighbors in the state space if one can be obtained from the other with only a few modifications, e.g., edge rewirings. For many common ensembles, e.g., those preserving the degree sequences of bipartite graphs, rewiring operations involving two edges are sufficient to create a fully-connect… ▽ More

    Submitted 19 April, 2024; v1 submitted 21 August, 2023; originally announced August 2023.

    Comments: Accepted for publication in Physical Review E

  12. arXiv:2306.02696  [pdf, other

    cs.DS

    Hyper-distance Oracles in Hypergraphs

    Authors: Giulia Preti, Gianmarco De Francisci Morales, Francesco Bonchi

    Abstract: We study point-to-point distance estimation in hypergraphs, where the query is parameterized by a positive integer s, which defines the required level of overlap for two hyperedges to be considered adjacent. To answer s-distance queries, we first explore an oracle based on the line graph of the given hypergraph and discuss its limitations: the main one is that the line graph is typically orders of… ▽ More

    Submitted 19 March, 2024; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: To appear in VLDBJ

  13. arXiv:2303.12014  [pdf, other

    cs.SI cs.CY

    Authority without Care: Moral Values behind the Mask Mandate Response

    Authors: Yelena Mejova, Kyrieki Kalimeri, Gianmarco De Francisci Morales

    Abstract: Face masks are one of the cheapest and most effective non-pharmaceutical interventions available against airborne diseases such as COVID-19. Unfortunately, they have been met with resistance by a substantial fraction of the populace, especially in the U.S. In this study, we uncover the latent moral values that underpin the response to the mask mandate, and paint them against the country's politica… ▽ More

    Submitted 30 March, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

  14. arXiv:2302.07598  [pdf, other

    cs.CY cs.SI physics.soc-ph

    Evidence of Demographic rather than Ideological Segregation in News Discussion on Reddit

    Authors: Corrado Monti, Jacopo D'Ignazi, Michele Starnini, Gianmarco De Francisci Morales

    Abstract: We evaluate homophily and heterophily among ideological and demographic groups in a typical opinion formation context: online discussions of current news. We analyze user interactions across five years in the r/news community on Reddit, one of the most visited websites in the United States. Then, we estimate demographic and ideological attributes of these users. Thanks to a comparison with a caref… ▽ More

    Submitted 5 July, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: Published at WWW '23

    ACM Class: J.4; K.4

    Journal ref: Proceedings of the ACM Web Conference 2023 (WWW '23), May 1-5, 2023, Austin, TX, USA. ACM

  15. The Thin Ideology of Populist Advertising on Facebook during the 2019 EU Elections

    Authors: Arthur Capozzi, Gianmarco De Francisci Morales, Yelena Mejova, Corrado Monti, André Panisson

    Abstract: Social media has been an important tool in the expansion of the populist message, and it is thought to have contributed to the electoral success of populist parties in the past decade. This study compares how populist parties advertised on Facebook during the 2019 European Parliamentary election. In particular, we examine commonalities and differences in which audiences they reach and on which iss… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

    Journal ref: In Proceedings of the ACM Web Conference 2023 (WWW '23), May 1-5, 2023, Austin, TX, USA. ACM, New York, NY, USA, 11 pages

  16. Supervised Feature Compression based on Counterfactual Analysis

    Authors: Veronica Piccialli, Dolores Romero Morales, Cecilia Salvatore

    Abstract: Counterfactual Explanations are becoming a de-facto standard in post-hoc interpretable machine learning. For a given classifier and an instance classified in an undesired class, its counterfactual explanation corresponds to small perturbations of that instance that allows changing the classification outcome. This work aims to leverage Counterfactual Explanations to detect the important decision bo… ▽ More

    Submitted 24 November, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: 30 pages, 45figures

    Journal ref: European Journal of Operational Research, 2023

  17. arXiv:2210.17234  [pdf, other

    cs.CY physics.soc-ph

    The language of opinion change on social media under the lens of communicative action

    Authors: Corrado Monti, Luca Maria Aiello, Gianmarco De Francisci Morales, Francesco Bonchi

    Abstract: Which messages are more effective at inducing a change of opinion in the listener? We approach this question within the frame of Habermas' theory of communicative action, which posits that the illocutionary intent of the message (its pragmatic meaning) is the key. Thanks to recent advances in natural language processing, we are able to operationalize this theory by extracting the latent social dim… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

    Comments: Main paper: 13 pages, 1 figure, 3 tables. Supplementary material: 9 pages, 6 figures, 8 tables

    ACM Class: H.4.0; K.4.0

    Journal ref: Nature Scientific Reports 12, 17920 (2022)

  18. arXiv:2208.14989  [pdf, other

    cs.LG stat.ME stat.ML

    Learning Multiscale Non-stationary Causal Structures

    Authors: Gabriele D'Acunto, Gianmarco De Francisci Morales, Paolo Bajardi, Francesco Bonchi

    Abstract: This paper addresses a gap in the current state of the art by providing a solution for modeling causal relationships that evolve over time and occur at different time scales. Specifically, we introduce the multiscale non-stationary directed acyclic graph (MN-DAG), a framework for modeling multivariate time series data. Our contribution is twofold. Firstly, we expose a probabilistic generative mode… ▽ More

    Submitted 17 November, 2023; v1 submitted 31 August, 2022; originally announced August 2022.

    Journal ref: Transactions on Machine Learning Research, 2023, ISSN 2835-8856

  19. arXiv:2207.12196  [pdf, other

    cs.SI cs.CY

    On the Relation Between Opinion Change and Information Consumption on Reddit

    Authors: Flavio Petruzzellis, Corrado Monti, Gianmarco De Francisci Morales, Francesco Bonchi

    Abstract: While much attention has been devoted to the causes of opinion change, little is known about its consequences. Our study sheds a light on the relationship between one user's opinion change episode and subsequent behavioral change on an online social media, Reddit. In particular, we look at r/ChangeMyView, an online community dedicated to debating one's own opinions. Interestingly, this forum adopt… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: To appear in Proceedings of the International AAAI Conference on Web and Social Media (ICWSM 2023)

    ACM Class: J.4; K.4

  20. arXiv:2205.05052  [pdf, other

    physics.soc-ph cs.LG econ.EM

    On learning agent-based models from data

    Authors: Corrado Monti, Marco Pangallo, Gianmarco De Francisci Morales, Francesco Bonchi

    Abstract: Agent-Based Models (ABMs) are used in several fields to study the evolution of complex systems from micro-level assumptions. However, ABMs typically can not estimate agent-specific (or "micro") variables: this is a major limitation which prevents ABMs from harnessing micro-level data availability and which greatly limits their predictive power. In this paper, we propose a protocol to learn the lat… ▽ More

    Submitted 23 November, 2022; v1 submitted 10 May, 2022; originally announced May 2022.

  21. arXiv:2205.00308  [pdf, other

    cs.SI cs.CY

    Modeling Political Activism around Gun Debate via Social Media

    Authors: Yelena Mejova, Jisun An, Gianmarco De Francisci Morales, Haewoon Kwak

    Abstract: The United States have some of the highest rates of gun violence among developed countries. Yet, there is a disagreement about the extent to which firearms should be regulated. In this study, we employ social media signals to examine the predictors of offline political activism, at both population and individual level. We show that it is possible to classify the stance of users on the gun issue, e… ▽ More

    Submitted 30 April, 2022; originally announced May 2022.

    Journal ref: ACM Transactions on Social Computing. 2022

  22. FreSCo: Mining Frequent Patterns in Simplicial Complexes

    Authors: Giulia Preti, Gianmarco De Francisci Morales, Francesco Bonchi

    Abstract: Simplicial complexes are a generalization of graphs that model higher-order relations. In this paper, we introduce simplicial patterns -- that we call simplets -- and generalize the task of frequent pattern mining from the realm of graphs to that of simplicial complexes. Our task is particularly challenging due to the enormous search space and the need for higher-order isomorphism. We show that fi… ▽ More

    Submitted 26 January, 2022; v1 submitted 20 January, 2022; originally announced January 2022.

    Comments: To appear at The Web Conference 2022

  23. The Evolving Causal Structure of Equity Risk Factors

    Authors: Gabriele D'Acunto, Paolo Bajardi, Francesco Bonchi, Gianmarco De Francisci Morales

    Abstract: In recent years, multi-factor strategies have gained increasing popularity in the financial industry, as they allow investors to have a better understanding of the risk drivers underlying their portfolios. Moreover, such strategies promise to promote diversification and thus limit losses in times of financial turmoil. However, recent studies have reported a significant level of redundancy between… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

    Journal ref: ACM International Conference on AI in Finance, 2021

  24. arXiv:2110.11952  [pdf, other

    stat.ML cs.LG math.OC

    Optimal randomized classification trees

    Authors: Rafael Blanquero, Emilio Carrizosa, Cristina Molero-Río, Dolores Romero Morales

    Abstract: Classification and Regression Trees (CARTs) are off-the-shelf techniques in modern Statistics and Machine Learning. CARTs are traditionally built by means of a greedy procedure, sequentially deciding the splitting predictor variable(s) and the associated threshold. This greedy approach trains trees very fast, but, by its nature, their classification accuracy may not be competitive against other st… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

    Comments: This research has been financed in part by research projects EC H2020 MSCA RISE NeEDS (Grant agreement ID: 822214), FQM-329 and P18-FR-2369 (Junta de Andalucía), and PID2019-110886RB-I00 (Ministerio de Ciencia, Innovación y Universidades, Spain). This support is gratefully acknowledged

    Journal ref: Computers & Operations Research, 2021

  25. On Clustering Categories of Categorical Predictors in Generalized Linear Models

    Authors: Emilio Carrizosa, Marcela Galvis Restrepo, Dolores Romero Morales

    Abstract: We propose a method to reduce the complexity of Generalized Linear Models in the presence of categorical predictors. The traditional one-hot encoding, where each category is represented by a dummy variable, can be wasteful, difficult to interpret, and prone to overfitting, especially when dealing with high-cardinality categorical predictors. This paper addresses these challenges by finding a reduc… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

    Journal ref: CARRIZOSA, Emilio; GALVIS RESTREPO, Marcela; ROMERO MORALES, Dolores. On clustering categories of categorical predictors in generalized linear models. Expert Systems with Applications, 2021, p. 115245

  26. arXiv:2107.07361  [pdf, other

    physics.soc-ph cs.CY

    From Reddit to Wall Street: The role of committed minorities in financial collective action

    Authors: Lorenzo Lucchini, Luca Maria Aiello, Laura Alessandretti, Gianmarco De Francisci Morales, Michele Starnini, Andrea Baronchelli

    Abstract: In January 2021, retail investors coordinated on Reddit to target short selling activity by hedge funds on GameStop shares, causing a surge in the share price and triggering significant losses for the funds involved. Such an effective collective action was unprecedented in finance, and its dynamics remain unclear. Here, we analyse Reddit and financial data and rationalise the events based on recen… ▽ More

    Submitted 13 September, 2021; v1 submitted 15 July, 2021; originally announced July 2021.

    Comments: Main: 9 pages, 3 figures, 3 tables. Supplementary: 7 pages, 7 figures

  27. Clandestino or Rifugiato? Anti-immigration Facebook Ad Targeting in Italy

    Authors: Arthur Capozzi, Gianmarco De Francisci Morales, Yelena Mejova, Corrado Monti, André Panisson, Daniela Paolotti

    Abstract: Monitoring advertising around controversial issues is an important step in ensuring accountability and transparency of political processes. To that end, we use the Facebook Ads Library to collect 2312 migration-related advertising campaigns in Italy over one year. Our pro- and anti-immigration classifier (F1=0.85) reveals a partisan divide among the major Italian political parties, with anti-immig… ▽ More

    Submitted 16 March, 2021; originally announced March 2021.

    Comments: Published at CHI21

    ACM Class: J.4; K.4; I.7

    Journal ref: CHI Conference on Human Factors in Computing Systems (CHI '21), May 8-13, 2021, Yokohama, Japan. ACM

  28. STruD: Truss Decomposition of Simplicial Complexes

    Authors: Giulia Preti, Gianmarco De Francisci Morales, Francesco Bonchi

    Abstract: A simplicial complex is a generalization of a graph: a collection of n-ary relationships (instead of binary as the edges of a graph), named simplices. In this paper, we develop a new tool to study the structure of simplicial complexes: we generalize the graph notion of truss decomposition to complexes, and show that this more powerful representation gives rise to different properties compared to t… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

    ACM Class: G.2.2

    Journal ref: Proceedings of The Web Conference 2021

  29. arXiv:2102.05477  [pdf, other

    physics.soc-ph cs.SI

    No Echo in the Chambers of Political Interactions on Reddit

    Authors: Gianmarco De Francisci Morales, Corrado Monti, Michele Starnini

    Abstract: Echo chambers in online social networks, whereby users' beliefs are reinforced by interactions with like-minded peers and insulation from others' points of view, have been decried as a cause of political polarization. Here, we investigate their role in the debate around the 2016 US elections on Reddit, a fundamental platform for the success of Donald Trump. We identify Trump vs Clinton supporters… ▽ More

    Submitted 10 February, 2021; originally announced February 2021.

    ACM Class: J.4; K.4

    Journal ref: Scientific Reports volume 11, Article number: 2818 (2021)

  30. Playing to distraction: towards a robust training of CNN classifiers through visual explanation techniques

    Authors: David Morales, Estefania Talavera, Beatriz Remeseiro

    Abstract: The field of deep learning is evolving in different directions, with still the need for more efficient training strategies. In this work, we present a novel and robust training scheme that integrates visual explanation techniques in the learning process. Unlike the attention mechanisms that focus on the relevant parts of images, we aim to improve the robustness of the model by making it pay attent… ▽ More

    Submitted 29 July, 2021; v1 submitted 28 December, 2020; originally announced December 2020.

    Comments: 20 pages,3 figures, 4 tables

    Journal ref: Neural Comput & Applic (2021)

  31. arXiv:2010.04458  [pdf, other

    cs.CY

    Facebook Ads: Politics of Migration in Italy

    Authors: Arthur Capozzi, Gianmarco De Francisci Morales, Yelena Mejova, Corrado Monti, Andre Panisson, Daniela Paolotti

    Abstract: Targeted online advertising is on the forefront of political communication, allowing hyper-local advertising campaigns around elections and issues. In this study, we employ a new resource for political ad monitoring -- Facebook Ads Library -- to examine advertising concerning the issue of immigration in Italy. A crucial topic in Italian politics, it has recently been a focus of several populist mo… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

    Report number: 978-3-030-60975-7

    Journal ref: Social Informatics 2020

  32. arXiv:2009.04088  [pdf, other

    astro-ph.IM cs.LG gr-qc physics.data-an

    Deep learning for gravitational-wave data analysis: A resampling white-box approach

    Authors: Manuel D. Morales, Javier M. Antelis, Claudia Moreno, Alexander I. Nesterov

    Abstract: In this work, we apply Convolutional Neural Networks (CNNs) to detect gravitational wave (GW) signals of compact binary coalescences, using single-interferometer data from LIGO detectors. As novel contribution, we adopted a resampling white-box approach to advance towards a statistical understanding of uncertainties intrinsic to CNNs in GW data analysis. Resampling is performed by repeated $k$-fol… ▽ More

    Submitted 8 September, 2020; originally announced September 2020.

    Comments: 29 pages, 28 figures

  33. arXiv:2006.01673  [pdf, other

    cs.SI cs.CY cs.LG

    Learning Opinion Dynamics From Social Traces

    Authors: Corrado Monti, Gianmarco De Francisci Morales, Francesco Bonchi

    Abstract: Opinion dynamics - the research field dealing with how people's opinions form and evolve in a social context - traditionally uses agent-based models to validate the implications of sociological theories. These models encode the causal mechanism that drives the opinion formation process, and have the advantage of being easy to interpret. However, as they do not exploit the availability of data, the… ▽ More

    Submitted 2 June, 2020; originally announced June 2020.

    Comments: Published at KDD2020

    ACM Class: J.4; G.3; I.6

    Journal ref: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD2020)

  34. Roots of Trumpism: Homophily and Social Feedback in Donald Trump Support on Reddit

    Authors: Joan Massachs, Corrado Monti, Gianmarco De Francisci Morales, Francesco Bonchi

    Abstract: We study the emergence of support for Donald Trump in Reddit's political discussion. With almost 800k subscribers, "r/The_Donald" is one of the largest communities on Reddit, and one of the main hubs for Trump supporters. It was created in 2015, shortly after Donald Trump began his presidential campaign. By using only data from 2012, we predict the likelihood of being a supporter of Donald Trump i… ▽ More

    Submitted 4 May, 2020; originally announced May 2020.

    Comments: 10 pages. Published at WebSci20

    MSC Class: 91D30 ACM Class: J.4; K.4

    Journal ref: Proceedings of the 12th ACM Conference on Web Science (WebSci 2020)

  35. arXiv:2004.09603  [pdf, other

    physics.soc-ph cs.CY cs.SI

    Echo Chambers on Social Media: A comparative analysis

    Authors: Matteo Cinelli, Gianmarco De Francisci Morales, Alessandro Galeazzi, Walter Quattrociocchi, Michele Starnini

    Abstract: Recent studies have shown that online users tend to select information adhering to their system of beliefs, ignore information that does not, and join groups - i.e., echo chambers - around a shared narrative. Although a quantitative methodology for their identification is still missing, the phenomenon of echo chambers is widely debated both at scientific and political level. To shed light on this… ▽ More

    Submitted 20 April, 2020; originally announced April 2020.

  36. arXiv:2003.11906  [pdf, other

    cs.CY

    Falling into the Echo Chamber: the Italian Vaccination Debate on Twitter

    Authors: Alessandro Cossard, Gianmarco De Francisci Morales, Kyriaki Kalimeri, Yelena Mejova, Daniela Paolotti, Michele Starnini

    Abstract: The reappearance of measles in the US and Europe, a disease considered eliminated in early 2000s, has been accompanied by a growing debate on the merits of vaccination on social media. In this study we examine the extent to which the vaccination debate on Twitter is conductive to potential outreach to the vaccination hesitant. We focus on Italy, one of the countries most affected by the latest mea… ▽ More

    Submitted 26 March, 2020; originally announced March 2020.

    Journal ref: International AAAI Conference on Web and Social Media (ICWSM) 2020

  37. arXiv:2003.03604  [pdf, other

    cs.DC

    Aion: Better Late than Never in Event-Time Streams

    Authors: Sérgio Esteves, Gianmarco De Francisci Morales, Rodrigo Rodrigues, Marco Serafini, Luís Veiga

    Abstract: Processing data streams in near real-time is an increasingly important task. In the case of event-timestamped data, the stream processing system must promptly handle late events that arrive after the corresponding window has been processed. To enable this late processing, the window state must be maintained for a long period of time. However, current systems maintain this state in memory, which ei… ▽ More

    Submitted 22 April, 2020; v1 submitted 7 March, 2020; originally announced March 2020.

    Comments: 14 pages, 28 figures

  38. Sparsity in Optimal Randomized Classification Trees

    Authors: Rafael Blanquero, Emilio Carrizosa, Cristina Molero-Río, Dolores Romero Morales

    Abstract: Decision trees are popular Classification and Regression tools and, when small-sized, easy to interpret. Traditionally, a greedy approach has been used to build the trees, yielding a very fast training process; however, controlling sparsity (a proxy for interpretability) is challenging. In recent studies, optimal decision trees, where all decisions are optimized simultaneously, have shown a better… ▽ More

    Submitted 21 February, 2020; originally announced February 2020.

    Comments: This research has been financed in part by research projects EC H2020 Marie Skłodowska-Curie Actions, Research and Innovation Staff Exchange Network of European Data Scientists, NeEDS, Grant agreement ID 822214, COSECLA - Fundación BBVA, MTM2015-65915R, Spain, P11-FQM-7603 and FQM-329, Junta de Andalucía. This support is gratefully acknowledged. Available online 16 December 2019

    Journal ref: European Journal of Operational Research, 2019

  39. arXiv:1910.02001  [pdf, ps, other

    cs.CL cs.AI cs.SI

    Predicting the Role of Political Trolls in Social Media

    Authors: Atanas Atanasov, Gianmarco De Francisci Morales, Preslav Nakov

    Abstract: We investigate the political roles of "Internet trolls" in social media. Political trolls, such as the ones linked to the Russian Internet Research Agency (IRA), have recently gained enormous attention for their ability to sway public opinion and even influence elections. Analysis of the online traces of trolls has shown different behavioral patterns, which target different slices of the populatio… ▽ More

    Submitted 4 October, 2019; originally announced October 2019.

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: CoNLL-2019

  40. arXiv:1902.06679  [pdf, other

    cs.SI cs.LG stat.ML

    Link Prediction via Higher-Order Motif Features

    Authors: Ghadeer Abuoda, Gianmarco De Francisci Morales, Ashraf Aboulnaga

    Abstract: Link prediction requires predicting which new links are likely to appear in a graph. Being able to predict unseen links with good accuracy has important applications in several domains such as social media, security, transportation, and recommendation systems. A common approach is to use features based on the common neighbors of an unconnected pair of nodes to predict whether the pair will form a… ▽ More

    Submitted 4 June, 2020; v1 submitted 8 February, 2019; originally announced February 2019.

    Comments: Extended version of paper that appears in ECML/PKDD 2019

  41. arXiv:1809.00394  [pdf, other

    cs.DS

    Mining Frequent Patterns in Evolving Graphs

    Authors: Cigdem Aslay, Muhammad Anis Uddin Nasir, Gianmarco De Francisci Morales, Aristides Gionis

    Abstract: Given a labeled graph, the frequent-subgraph mining (FSM) problem asks to find all the $k$-vertex subgraphs that appear with frequency greater than a given threshold. FSM has numerous applications ranging from biology to network science, as it provides a compact summary of the characteristics of the graph. However, the task is challenging, even more so for evolving graphs due to the streaming natu… ▽ More

    Submitted 10 September, 2018; v1 submitted 2 September, 2018; originally announced September 2018.

    Comments: 10 pages, accepted at CIKM 2018

  42. arXiv:1805.11477  [pdf, other

    cs.DC

    Large-Scale Learning from Data Streams with Apache SAMOA

    Authors: Nicolas Kourtellis, Gianmarco De Francisci Morales, Albert Bifet

    Abstract: Apache SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams. Big data is defined as datasets whose size is beyond the ability of typical software tools to capture, store, manage, and analyze, due to the time and memory complexity. Apache SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine le… ▽ More

    Submitted 26 May, 2018; originally announced May 2018.

    Comments: 31 pages, 7 Tables, 16 Figures, 26 References. arXiv admin note: substantial text overlap with arXiv:1607.08325

  43. arXiv:1802.02351  [pdf, other

    cs.OH

    Road Network Fusion for Incremental Map Updates

    Authors: Rade Stanojevic, Sofiane Abbar, Saravanan Thirumuruganathan, Gianmarco De Francisci Morales, Sanjay Chawla, Fethi Filali, Ahid Aleimat

    Abstract: In the recent years a number of novel, automatic map-inference techniques have been proposed, which derive road-network from a cohort of GPS traces collected by a fleet of vehicles. In spite of considerable attention, these maps are imperfect in many ways: they create an abundance of spurious connections, have poor coverage, and are visually confusing. Hence, commercial and crowd-sourced map** s… ▽ More

    Submitted 7 February, 2018; originally announced February 2018.

    Journal ref: In the special volume of Springer's Lecture Notes in Cartography and Geoinformation (LBS 2018.)

  44. arXiv:1801.01665  [pdf, other

    cs.SI

    Political Discourse on Social Media: Echo Chambers, Gatekeepers, and the Price of Bipartisanship

    Authors: Kiran Garimella, Gianmarco De Francisci Morales, Aristides Gionis, Michael Mathioudakis

    Abstract: Echo chambers, i.e., situations where one is exposed only to opinions that agree with their own, are an increasing concern for the political discourse in many democratic countries. This paper studies the phenomenon of political echo chambers on social media. We identify the two components in the phenomenon: the opinion that is shared ('echo'), and the place that allows its exposure ('chamber' ---… ▽ More

    Submitted 19 February, 2018; v1 submitted 5 January, 2018; originally announced January 2018.

    Comments: Published at The Web Conference 2018 (WWW2018). Please cite the WWW version

  45. arXiv:1705.06597  [pdf, other

    cs.SI

    Factors in Recommending Contrarian Content on Social Media

    Authors: Kiran Garimella, Gianmarco De Francisci Morales, Aristides Gionis, Michael Mathioudakis

    Abstract: Polarization is a troubling phenomenon that can lead to societal divisions and hurt the democratic process. It is therefore important to develop methods to reduce it. We propose an algorithmic solution to the problem of reducing polarization. The core idea is to expose users to content that challenges their point of view, with the hope broadening their perspective, and thus reduce their polarity… ▽ More

    Submitted 16 May, 2017; originally announced May 2017.

    Comments: accepted as a short paper at ACM WebScience 2017. arXiv admin note: substantial text overlap with arXiv:1703.10934

  46. arXiv:1705.05908  [pdf, other

    cs.SI physics.soc-ph

    The Effect of Collective Attention on Controversial Debates on Social Media

    Authors: Kiran Garimella, Gianmarco De Francisci Morales, Aristides Gionis, Michael Mathioudakis

    Abstract: We study the evolution of long-lived controversial debates as manifested on Twitter from 2011 to 2016. Specifically, we explore how the structure of interactions and content of discussion varies with the level of collective attention, as evidenced by the number of users discussing a topic. Spikes in the volume of users typically correspond to external events that increase the public attention on t… ▽ More

    Submitted 16 May, 2017; originally announced May 2017.

    Comments: accepted at ACM WebScience 2017

  47. arXiv:1703.10934  [pdf, other

    cs.SI

    Exposing Twitter Users to Contrarian News

    Authors: Kiran Garimella, Gianmarco De Francisci Morales, Aristides Gionis, Michael Mathioudakis

    Abstract: Polarized topics often spark discussion and debate on social media. Recent studies have shown that polarized debates have a specific clustered structure in the endorsement net- work, which indicates that users direct their endorsements mostly to ideas they already agree with. Understanding these polarized discussions and exposing social media users to content that broadens their views is of paramo… ▽ More

    Submitted 31 March, 2017; originally announced March 2017.

    Comments: Accepted as a demo at WWW 2017

  48. arXiv:1703.05994  [pdf, other

    cs.SI physics.soc-ph

    The Ebb and Flow of Controversial Debates on Social Media

    Authors: Kiran Garimella, Gianmarco De Francisci Morales, Aristides Gionis, Michael Mathioudakis

    Abstract: We explore how the polarization around controversial topics evolves on Twitter - over a long period of time (2011 to 2016), and also as a response to major external events that lead to increased related activity. We find that increased activity is typically associated with increased polarization; however, we find no consistent long-term trend in polarization over time among the topics we study.

    Submitted 17 March, 2017; originally announced March 2017.

    Comments: Accepted as a short paper at ICWSM 2017. Please cite the ICWSM version and not the ArXiv version

  49. arXiv:1611.00172  [pdf, other

    cs.SI physics.soc-ph

    Reducing Controversy by Connecting Opposing Views

    Authors: Kiran Garimella, Gianmarco De Francisci Morales, Aristides Gionis, Michael Mathioudakis

    Abstract: Society is often polarized by controversial issues, that split the population into groups of opposing views. When such issues emerge on social media, we often observe the creation of 'echo chambers', i.e., situations where like-minded people reinforce each other's opinion, but do not get exposed to the views of the opposing side. In this paper we study algorithmic techniques for bridging these cha… ▽ More

    Submitted 24 May, 2018; v1 submitted 1 November, 2016; originally announced November 2016.

    Comments: Proceedings of the Tenth ACM International Conference on Web Search and Data Mining (WSDM 2017)

  50. Fully Dynamic Algorithm for Top-$k$ Densest Subgraphs

    Authors: Muhammad Anis Uddin Nasir, Aristides Gionis, Gianmarco De Francisci Morales, Sarunas Girdzijauskas

    Abstract: Given a large graph, the densest-subgraph problem asks to find a subgraph with maximum average degree. When considering the top-$k$ version of this problem, a naïve solution is to iteratively find the densest subgraph and remove it in each iteration. However, such a solution is impractical due to high processing cost. The problem is further complicated when dealing with dynamic graphs, since addin… ▽ More

    Submitted 29 August, 2017; v1 submitted 19 October, 2016; originally announced October 2016.

    Comments: 10 pages, 8 figures, accepted at CIKM 2017