Skip to main content

Showing 1–4 of 4 results for author: Sebők, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:1605.02951  [pdf, other

    physics.soc-ph cs.SI

    Race, Religion and the City: Twitter Word Frequency Patterns Reveal Dominant Demographic Dimensions in the United States

    Authors: Eszter Bokányi, Dániel Kondor, László Dobos, Tamás Sebők, József Stéger, István Csabai, Gábor Vattay

    Abstract: Recently, numerous approaches have emerged in the social sciences to exploit the opportunities made possible by the vast amounts of data generated by online social networks (OSNs). Having access to information about users on such a scale opens up a range of possibilities, all without the limitations associated with often slow and expensive paper-based polls. A question that remains to be satisfact… ▽ More

    Submitted 11 May, 2016; v1 submitted 10 May, 2016; originally announced May 2016.

  2. arXiv:1311.1484  [pdf, ps, other

    physics.soc-ph cs.SI

    Regional properties of global communication as reflected in aggregated Twitter data

    Authors: Zsofia Kallus, Norbert Barankai, Daniel Kondor, Laszlo Dobos, Tamas Hanyecz, Janos Szule, Jozsef Steger, Tamas Sebok, Gabor Vattay, Istvan Csabai

    Abstract: Twitter is a popular public conversation platform with world-wide audience and diverse forms of connections between users. In this paper we introduce the concept of aggregated regional Twitter networks in order to characterize communication between geopolitical regions. We present the study of a follower and a mention graph created from an extensive data set collected during the second half of the… ▽ More

    Submitted 6 November, 2013; originally announced November 2013.

    Comments: 13 pages, 12 figures

  3. arXiv:1311.1169  [pdf, other

    cs.CL

    Using Robust PCA to estimate regional characteristics of language use from geo-tagged Twitter messages

    Authors: Dániel Kondor, István Csabai, László Dobos, János Szüle, Norbert Barankai, Tamás Hanyecz, Tamás Sebők, Zsófia Kallus, Gábor Vattay

    Abstract: Principal component analysis (PCA) and related techniques have been successfully employed in natural language processing. Text mining applications in the age of the online social media (OSM) face new challenges due to properties specific to these use cases (e.g. spelling issues specific to texts posted by users, the presence of spammers and bots, service announcements, etc.). In this paper, we emp… ▽ More

    Submitted 5 November, 2013; originally announced November 2013.

  4. arXiv:1311.0841  [pdf, other

    cs.DB

    A multi-terabyte relational database for geo-tagged social network data

    Authors: László Dobos, János Szüle, Tamás Bodnár, Tamás Hanyecz, Tamás Sebők, Dániel Kondor, Zsófia Kallus, József Stéger, István Csabai, Gábor Vattay

    Abstract: Despite their relatively low sampling factor, the freely available, randomly sampled status streams of Twitter are very useful sources of geographically embedded social network data. To statistically analyze the information Twitter provides via these streams, we have collected a year's worth of data and built a multi-terabyte relational database from it. The database is designed for fast data load… ▽ More

    Submitted 5 November, 2013; v1 submitted 4 November, 2013; originally announced November 2013.