Skip to main content

Showing 1–18 of 18 results for author: Bleier, A

.
  1. arXiv:2307.01918  [pdf, other

    cs.CY

    Computational Reproducibility in Computational Social Science

    Authors: David Schoch, Chung-hong Chan, Claudia Wagner, Arnim Bleier

    Abstract: Replication crises have shaken the scientific landscape during the last decade. As potential solutions, open science practices were heavily discussed and have been implemented with varying success in different disciplines. We argue that computational-x disciplines such as computational social science, are also susceptible for the symptoms of the crises, but in terms of reproducibility. We expand t… ▽ More

    Submitted 4 October, 2023; v1 submitted 4 July, 2023; originally announced July 2023.

    Comments: v1: Working Paper; v2: fixed missing citation in text; v3: fixed some minor errors and formatting; v4: shortened paper

  2. arXiv:2303.18200  [pdf, other

    cs.CR cs.DC cs.LG

    PADME-SoSci: A Platform for Analytics and Distributed Machine Learning for the Social Sciences

    Authors: Zeyd Boukhers, Arnim Bleier, Yeliz Ucer Yediel, Mio Hienstorfer-Heitmann, Mehrshad Jaberansary, Adamantios Koumpis, Oya Beyan

    Abstract: Data privacy and ownership are significant in social data science, raising legal and ethical concerns. Sharing and analyzing data is difficult when different parties own different parts of it. An approach to this challenge is to apply de-identification or anonymization techniques to the data before collecting it for analysis. However, this can reduce data utility and increase the risk of re-identi… ▽ More

    Submitted 3 April, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

    Comments: accepted to be published @ ACM/IEEE JCDL 2023 - Joint Conference on Digital Libraries

  3. Characterizing the Global Crowd Workforce: A Cross-Country Comparison of Crowdworker Demographics

    Authors: Lisa Posch, Arnim Bleier, Fabian Flöck, Clemens M. Lechner, Katharina Kinder-Kurlanda, Denis Helic, Markus Strohmaier

    Abstract: Since its emergence roughly a decade ago, microtask crowdsourcing has been attracting a heterogeneous set of workers from all over the globe. This paper sets out to explore the characteristics of the international crowd workforce and offers a cross-national comparison of crowdworker populations from ten countries. We provide an analysis and comparison of demographic characteristics and shed light… ▽ More

    Submitted 3 November, 2022; v1 submitted 14 December, 2018; originally announced December 2018.

    Comments: 36 pages, 20 figures, final version as published in Human Computation

    ACM Class: K.4

    Journal ref: Human Computation, 9(1), 22-57 (2022)

  4. arXiv:1805.11404  [pdf, other

    cs.IR cs.CL

    iLCM - A Virtual Research Infrastructure for Large-Scale Qualitative Data

    Authors: Andreas Niekler, Arnim Bleier, Christian Kahmann, Lisa Posch, Gregor Wiedemann, Kenan Erdogan, Gerhard Heyer, Markus Strohmaier

    Abstract: The iLCM project pursues the development of an integrated research environment for the analysis of structured and unstructured data in a "Software as a Service" architecture (SaaS). The research environment addresses requirements for the quantitative evaluation of large amounts of qualitative data with text mining methods as well as requirements for the reproducibility of data-driven research desi… ▽ More

    Submitted 11 May, 2018; originally announced May 2018.

    Comments: 11th edition of the Language Resources and Evaluation Conference (LREC)

  5. Systematically Monitoring Social Media: The case of the German federal election 2017

    Authors: Sebastian Stier, Arnim Bleier, Malte Bonart, Fabian Mörsheim, Mahdi Bohlouli, Margarita Nizhegorodov, Lisa Posch, Jürgen Maier, Tobias Rothmund, Steffen Staab

    Abstract: It is a considerable task to collect digital trace data at a large scale and at the same time adhere to established academic standards. In the context of political communication, important challenges are (1) defining the social media accounts and posts relevant to the campaign (content validity), (2) operationalizing the venues where relevant social media activity takes place (construct validity),… ▽ More

    Submitted 9 April, 2018; originally announced April 2018.

    Comments: PID: http://nbn-resolving.de/urn:nbn:de:0168-ssoar-56149-4, GESIS Papers 2018|4

  6. arXiv:1801.08825  [pdf, other

    cs.CY

    Election campaigning on social media: Politicians, audiences and the mediation of political communication on Facebook and Twitter

    Authors: Sebastian Stier, Arnim Bleier, Haiko Lietz, Markus Strohmaier

    Abstract: Although considerable research has concentrated on online campaigning, it is still unclear how politicians use different social media platforms in political communication. Focusing on the German federal election campaign 2013, this article investigates whether election candidates address the topics most important to the mass audience and to which extent their communication is shaped by the charact… ▽ More

    Submitted 26 January, 2018; originally announced January 2018.

  7. arXiv:1711.03115  [pdf, other

    cs.SI cs.CY cs.HC

    A Cross-Country Comparison of Crowdworker Motivations

    Authors: Lisa Posch, Arnim Bleier, Fabian Flöck, Markus Strohmaier

    Abstract: Crowd employment is a new form of short term employment that has been rapidly becoming a source of income for a vast number of people around the globe. It differs considerably from more traditional forms of work, yet similar ethical and optimization issues arise. One key to tackle such challenges is to understand what motivates the international crowd workforce. In this work, we study the motivati… ▽ More

    Submitted 8 November, 2017; originally announced November 2017.

    Comments: 3rd Annual International Conference on Computational Social Science (IC2S2), 2017

  8. arXiv:1702.01661  [pdf, other

    cs.SI cs.CY cs.HC

    Measuring Motivations of Crowdworkers: The Multidimensional Crowdworker Motivation Scale

    Authors: Lisa Posch, Arnim Bleier, Clemens Lechner, Daniel Danner, Fabian Flöck, Markus Strohmaier

    Abstract: Crowd employment is a new form of short-term and flexible employment which has emerged during the past decade. In order to understand this new form of employment, it is crucial to illuminate the underlying motivations of the workforce involved in it. This paper introduces the Multidimensional Crowdworker Motivation Scale (MCMS), a scale for measuring the motivation of crowdworkers on micro-task pl… ▽ More

    Submitted 15 March, 2019; v1 submitted 6 February, 2017; originally announced February 2017.

    Comments: 33 pages; added section; additional validation; corrected typos

  9. arXiv:1701.03743  [pdf, other

    cs.LG stat.ML

    Truncation-free Hybrid Inference for DPMM

    Authors: Arnim Bleier

    Abstract: Dirichlet process mixture models (DPMM) are a cornerstone of Bayesian non-parametrics. While these models free from choosing the number of components a-priori, computationally attractive variational inference often reintroduces the need to do so, via a truncation on the variational distribution. In this paper we present a truncation-free hybrid inference for DPMM, combining the advantages of sampl… ▽ More

    Submitted 13 January, 2017; originally announced January 2017.

    Comments: NIPS 2016 Workshop: Advances in Approximate Bayesian Inference

  10. A System for Probabilistic Linking of Thesauri and Classification Systems

    Authors: Lisa Posch, Philipp Schaer, Arnim Bleier, Markus Strohmaier

    Abstract: This paper presents a system which creates and visualizes probabilistic semantic links between concepts in a thesaurus and classes in a classification system. For creating the links, we build on the Polylingual Labeled Topic Model (PLL-TM). PLL-TM identifies probable thesaurus descriptors for each class in the classification system by using information from the natural language text of documents,… ▽ More

    Submitted 21 March, 2016; originally announced March 2016.

    Journal ref: KI - Künstliche Intelligenz, 2015

  11. The Polylingual Labeled Topic Model

    Authors: Lisa Posch, Arnim Bleier, Philipp Schaer, Markus Strohmaier

    Abstract: In this paper, we present the Polylingual Labeled Topic Model, a model which combines the characteristics of the existing Polylingual Topic Model and Labeled LDA. The model accounts for multiple languages with separate topic distributions for each language while restricting the permitted topics of a document to a set of predefined labels. We explore the properties of the model in a two-language se… ▽ More

    Submitted 24 July, 2015; originally announced July 2015.

    Comments: Accepted for publication at KI 2015 (38th edition of the German Conference on Artificial Intelligence)

    ACM Class: G.3; I.2.7

  12. arXiv:1405.6824  [pdf, other

    cs.SI cs.CY physics.soc-ph

    When Politicians Talk: Assessing Online Conversational Practices of Political Parties on Twitter

    Authors: Haiko Lietz, Claudia Wagner, Arnim Bleier, Markus Strohmaier

    Abstract: Assessing political conversations in social media requires a deeper understanding of the underlying practices and styles that drive these conversations. In this paper, we present a computational approach for assessing online conversational practices of political parties. Following a deductive approach, we devise a number of quantitative measures from a discussion of theoretical constructs in socio… ▽ More

    Submitted 27 May, 2014; originally announced May 2014.

    Comments: 10 pages, 2 figures, 3 tables, Proc. 8th International AAAI Conference on Weblogs and Social Media (ICWSM 2014)

  13. arXiv:1312.4476  [pdf

    cs.SI cs.CY

    Social Media Monitoring of the Campaigns for the 2013 German Bundestag Elections on Facebook and Twitter

    Authors: Lars Kaczmirek, Philipp Mayr, Ravi Vatrapu, Arnim Bleier, Manuela Blumenberg, Tobias Gummer, Abid Hussain, Katharina Kinder-Kurlanda, Kaveh Manshaei, Mark Thamm, Katrin Weller, Alexander Wenz, Christof Wolf

    Abstract: As more and more people use social media to communicate their view and perception of elections, researchers have increasingly been collecting and analyzing data from social media platforms. Our research focuses on social media communication related to the 2013 election of the German parlia-ment [translation: Bundestagswahl 2013]. We constructed several social media datasets using data from Faceboo… ▽ More

    Submitted 1 April, 2014; v1 submitted 16 December, 2013; originally announced December 2013.

    Comments: 29 pages, 2 figures, GESIS-Working Papers No. 31

  14. arXiv:1312.0412  [pdf, other

    cs.LG

    Practical Collapsed Stochastic Variational Inference for the HDP

    Authors: Arnim Bleier

    Abstract: Recent advances have made it feasible to apply the stochastic variational paradigm to a collapsed representation of latent Dirichlet allocation (LDA). While the stochastic variational paradigm has successfully been applied to an uncollapsed representation of the hierarchical Dirichlet process (HDP), no attempts to apply this type of inference in a collapsed setting of non-parametric topic modeling… ▽ More

    Submitted 2 December, 2013; originally announced December 2013.

    Comments: NIPS Workshop; Topic Models: Computation, Application, and Evaluation

  15. arXiv:1309.5256  [pdf

    cs.DL

    Author Name Co-Mention Analysis: Testing a Poor Man's Author Co-Citation Analysis Method

    Authors: Andreas Strotmann, Arnim Bleier

    Abstract: As a social science information service for the German language countries, we document research projects, publications, and data in relevant fields. At the same time, we aim to provide well-founded bibliometric studies of these fields. Performing a citation analysis on an area of the German social sciences is, however, a serious challenge given the low and likely significantly biased coverage of t… ▽ More

    Submitted 20 September, 2013; originally announced September 2013.

    Comments: 14th International Society of Scientometrics and Informetrics Conference

  16. arXiv:1305.1734  [pdf

    cs.SI physics.soc-ph

    When Politicians Tweet: A Study on the Members of the German Federal Diet

    Authors: Mark Thamm, Arnim Bleier

    Abstract: In this preliminary study we compare the characteristics of retweets and replies on more than 350,000 messages collected by following members of the German Federal Diet on Twitter. We find significant differences in the characteristics pointing to distinct types of usages for retweets and replies. Using time series and regression analysis we observe that the likelihood of a politician using replie… ▽ More

    Submitted 8 May, 2013; originally announced May 2013.

    Comments: 6 pages, ACM Web Science 2013

    ACM Class: H.1.2

  17. arXiv:1305.1343  [pdf, other

    cs.DL cs.CL cs.IR

    Towards an Author-Topic-Term-Model Visualization of 100 Years of German Sociological Society Proceedings

    Authors: Arnim Bleier, Andreas Strotmann

    Abstract: Author co-citation studies employ factor analysis to reduce high-dimensional co-citation matrices to low-dimensional and possibly interpretable factors, but these studies do not use any information from the text bodies of publications. We hypothesise that term frequencies may yield useful information for scientometric analysis. In our work we ask if word features in combination with Bayesian analy… ▽ More

    Submitted 6 May, 2013; originally announced May 2013.

    Comments: Accepted: 14th International Society of Scientometrics and Informetrics Conference, Vienna Austria 15-19th July 2013

  18. arXiv:1211.6248  [pdf, ps, other

    cs.LG stat.ML

    A simple non-parametric Topic Mixture for Authors and Documents

    Authors: Arnim Bleier

    Abstract: This article reviews the Author-Topic Model and presents a new non-parametric extension based on the Hierarchical Dirichlet Process. The extension is especially suitable when no prior information about the number of components necessary is available. A blocked Gibbs sampler is described and focus put on staying as close as possible to the original model with only the minimum of theoretical and imp… ▽ More

    Submitted 4 December, 2012; v1 submitted 27 November, 2012; originally announced November 2012.