Skip to main content

Showing 1–15 of 15 results for author: Flöck, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2109.07022  [pdf, other

    cs.CY

    How Does Counterfactually Augmented Data Impact Models for Social Computing Constructs?

    Authors: Indira Sen, Mattia Samory, Fabian Floeck, Claudia Wagner, Isabelle Augenstein

    Abstract: As NLP models are increasingly deployed in socially situated settings such as online abusive content detection, it is crucial to ensure that these models are robust. One way of improving model robustness is to generate counterfactually augmented data (CAD) for training models that can better learn to distinguish between core features and data artifacts. While models trained on this type of data ha… ▽ More

    Submitted 14 September, 2021; originally announced September 2021.

    Comments: Preprint of a paper accepted to EMNLP 2021

  2. arXiv:2010.03083  [pdf

    cs.CY cs.DL

    'I Updated the <ref>': The Evolution of References in the English Wikipedia and the Implications for Altmetrics

    Authors: Olga Zagovora, Roberto Ulloa, Katrin Weller, Fabian Flöck

    Abstract: With this work, we present a publicly available dataset of the history of all the references (more than 55 million) ever used in the English Wikipedia until June 2019. We have applied a new method for identifying and monitoring references in Wikipedia, so that for each reference we can provide data about associated actions: creation, modifications, deletions, and reinsertions. The high accuracy of… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

  3. arXiv:2004.12764  [pdf, other

    cs.CY cs.CL cs.SI

    "Call me sexist, but...": Revisiting Sexism Detection Using Psychological Scales and Adversarial Samples

    Authors: Mattia Samory, Indira Sen, Julian Kohne, Fabian Floeck, Claudia Wagner

    Abstract: Research has focused on automated methods to effectively detect sexism online. Although overt sexism seems easy to spot, its subtle forms and manifold expressions are not. In this paper, we outline the different dimensions of sexism by grounding them in their implementation in psychological scales. From the scales, we derive a codebook for sexism in social media, which we use to annotate existing… ▽ More

    Submitted 2 June, 2021; v1 submitted 27 April, 2020; originally announced April 2020.

    Comments: Indira Sen and Julian Kohne contributed equally to this work

    Journal ref: Proceedings of the 15th International AAAI Conference on Web and Social Media (ICWSM), 2021

  4. arXiv:1907.08228  [pdf, other

    cs.CY cs.HC cs.SI

    TED-On: A Total Error Framework for Digital Traces of Human Behavior on Online Platforms

    Authors: Indira Sen, Fabian Floeck, Katrin Weller, Bernd Weiss, Claudia Wagner

    Abstract: Peoples' activities and opinions recorded as digital traces online, especially on social media and other web-based platforms, offer increasingly informative pictures of the public. They promise to allow inferences about populations beyond the users of the platforms on which the traces are recorded, representing real potential for the Social Sciences and a complement to survey-based research. But t… ▽ More

    Submitted 3 June, 2021; v1 submitted 18 July, 2019; originally announced July 2019.

    Comments: 20 pages, 2 figures, Longer version of paper set to appear in Public Opinion Quarterly. Updating terminology

  5. arXiv:1905.05961  [pdf, other

    cs.CY cs.CL cs.CV cs.LG

    Demographic Inference and Representative Population Estimates from Multilingual Social Media Data

    Authors: Zijian Wang, Scott A. Hale, David Adelani, Przemyslaw A. Grabowicz, Timo Hartmann, Fabian Flöck, David Jurgens

    Abstract: Social media provide access to behavioural data at an unprecedented scale and granularity. However, using these data to understand phenomena in a broader population is difficult due to their non-representativeness and the bias of statistical inference tools towards dominant languages and groups. While demographic attribute inference could be used to mitigate such bias, current techniques are almos… ▽ More

    Submitted 15 May, 2019; originally announced May 2019.

    Comments: 12 pages, 10 figures, Proceedings of the 2019 World Wide Web Conference (WWW '19)

    Journal ref: Proceedings of the 2019 World Wide Web Conference (WWW '19), May 13--17, 2019, San Francisco, CA, USA

  6. Characterizing the Global Crowd Workforce: A Cross-Country Comparison of Crowdworker Demographics

    Authors: Lisa Posch, Arnim Bleier, Fabian Flöck, Clemens M. Lechner, Katharina Kinder-Kurlanda, Denis Helic, Markus Strohmaier

    Abstract: Since its emergence roughly a decade ago, microtask crowdsourcing has been attracting a heterogeneous set of workers from all over the globe. This paper sets out to explore the characteristics of the international crowd workforce and offers a cross-national comparison of crowdworker populations from ten countries. We provide an analysis and comparison of demographic characteristics and shed light… ▽ More

    Submitted 3 November, 2022; v1 submitted 14 December, 2018; originally announced December 2018.

    Comments: 36 pages, 20 figures, final version as published in Human Computation

    ACM Class: K.4

    Journal ref: Human Computation, 9(1), 22-57 (2022)

  7. Query for Architecture, Click through Military: Comparing the Roles of Search and Navigation on Wikipedia

    Authors: Dimitar Dimitrov, Florian Lemmerich, Fabian Flöck, Markus Strohmaier

    Abstract: As one of the richest sources of encyclopedic information on the Web, Wikipedia generates an enormous amount of traffic. In this paper, we study large-scale article access data of the English Wikipedia in order to compare articles with respect to the two main paradigms of information seeking, i.e., search by formulating a query, and navigation by following hyperlinks. To this end, we propose and e… ▽ More

    Submitted 10 May, 2018; originally announced May 2018.

  8. arXiv:1711.03115  [pdf, other

    cs.SI cs.CY cs.HC

    A Cross-Country Comparison of Crowdworker Motivations

    Authors: Lisa Posch, Arnim Bleier, Fabian Flöck, Markus Strohmaier

    Abstract: Crowd employment is a new form of short term employment that has been rapidly becoming a source of income for a vast number of people around the globe. It differs considerably from more traditional forms of work, yet similar ethical and optimization issues arise. One key to tackle such challenges is to understand what motivates the international crowd workforce. In this work, we study the motivati… ▽ More

    Submitted 8 November, 2017; originally announced November 2017.

    Comments: 3rd Annual International Conference on Computational Social Science (IC2S2), 2017

  9. "(Weitergeleitet von Journalistin)": The Gendered Presentation of Professions on Wikipedia

    Authors: Olga Zagovora, Fabian Flöck, Claudia Wagner

    Abstract: Previous research has shown the existence of gender biases in the depiction of professions and occupations in search engine results. Such an unbalanced presentation might just as likely occur on Wikipedia, one of the most popular knowledge resources on the Web, since the encyclopedia has already been found to exhibit such tendencies in past studies. Under this premise, our work assesses gender bia… ▽ More

    Submitted 12 June, 2017; originally announced June 2017.

    Comments: In the 9th International ACM Web Science Conference 2017 (WebSci'17), June 25-28, 2017, Troy, NY, USA. Based on the results of the thesis: arXiv:1702.00829

  10. arXiv:1703.08244  [pdf, other

    cs.CL

    TokTrack: A Complete Token Provenance and Change Tracking Dataset for the English Wikipedia

    Authors: Fabian Flöck, Kenan Erdogan, Maribel Acosta

    Abstract: We present a dataset that contains every instance of all tokens (~ words) ever written in undeleted, non-redirect English Wikipedia articles until October 2016, in total 13,545,349,787 instances. Each token is annotated with (i) the article revision it was originally created in, and (ii) lists with all the revisions in which the token was ever deleted and (potentially) re-added and re-deleted from… ▽ More

    Submitted 23 March, 2017; originally announced March 2017.

  11. arXiv:1702.01661  [pdf, other

    cs.SI cs.CY cs.HC

    Measuring Motivations of Crowdworkers: The Multidimensional Crowdworker Motivation Scale

    Authors: Lisa Posch, Arnim Bleier, Clemens Lechner, Daniel Danner, Fabian Flöck, Markus Strohmaier

    Abstract: Crowd employment is a new form of short-term and flexible employment which has emerged during the past decade. In order to understand this new form of employment, it is crucial to illuminate the underlying motivations of the workforce involved in it. This paper introduces the Multidimensional Crowdworker Motivation Scale (MCMS), a scale for measuring the motivation of crowdworkers on micro-task pl… ▽ More

    Submitted 15 March, 2019; v1 submitted 6 February, 2017; originally announced February 2017.

    Comments: 33 pages; added section; additional validation; corrected typos

  12. arXiv:1612.00985  [pdf, other

    cs.HC cs.CY

    Wikiwhere: An interactive tool for studying the geographical provenance of Wikipedia references

    Authors: Martin Körner, Tatiana Sennikova, Florian Windhäuser, Claudia Wagner, Fabian Flöck

    Abstract: Wikipedia articles about the same topic in different language editions are built around different sources of information. For example, one can find very different news articles linked as references in the English Wikipedia article titled "Annexation of Crimea by the Russian Federation" than in its German counterpart (determined via Wikipedia's language links). Some of this difference can of course… ▽ More

    Submitted 16 December, 2016; v1 submitted 3 December, 2016; originally announced December 2016.

    Comments: 4 pages, 2 tables, 1 figure

  13. arXiv:1503.02911  [pdf, other

    cs.DB

    RDF-Hunter: Automatically Crowdsourcing the Execution of Queries Against RDF Data Sets

    Authors: Maribel Acosta, Elena Simperl, Fabian Flöck, Maria-Esther Vidal, Rudi Studer

    Abstract: In the last years, a large number of RDF data sets has become available on the Web. However, due to the semi-structured nature of RDF data, missing values affect answer completeness of queries that are posed against this data. To overcome this limitation, we propose RDF-Hunter, a novel hybrid query processing approach that brings together machine and human computation to execute queries against RD… ▽ More

    Submitted 10 March, 2015; originally announced March 2015.

  14. arXiv:1411.4484  [pdf, other

    cs.SI cs.CY physics.soc-ph

    Mining cross-cultural relations from Wikipedia - A study of 31 European food cultures

    Authors: Paul Laufer, Claudia Wagner, Fabian Flöck, Markus Strohmaier

    Abstract: For many people, Wikipedia represents one of the primary sources of knowledge about foreign cultures. Yet, different Wikipedia language editions offer different descriptions of cultural practices. Unveiling diverging representations of cultures provides an important insight, since they may foster the formation of cross-cultural stereotypes, misunderstandings and potentially even conflict. In this… ▽ More

    Submitted 12 July, 2015; v1 submitted 17 November, 2014; originally announced November 2014.

  15. arXiv:1402.1386  [pdf, other

    cs.SI cs.CY physics.soc-ph

    Evolution of Reddit: From the Front Page of the Internet to a Self-referential Community?

    Authors: Philipp Singer, Fabian Flöck, Clemens Meinhart, Elias Zeitfogel, Markus Strohmaier

    Abstract: In the past few years, Reddit -- a community-driven platform for submitting, commenting and rating links and text posts -- has grown exponentially, from a small community of users into one of the largest online communities on the Web. To the best of our knowledge, this work represents the most comprehensive longitudinal study of Reddit's evolution to date, studying both (i) how user submissions ha… ▽ More

    Submitted 23 June, 2014; v1 submitted 6 February, 2014; originally announced February 2014.

    Comments: Published in the proceedings of WWW'14 companion

    ACM Class: H.3.5