Skip to main content

Showing 1–41 of 41 results for author: Stumme, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.02342  [pdf, other

    cs.DM cs.AI cs.LO

    The Birkhoff completion of finite lattices

    Authors: Mohammad Abdulla, Johannes Hirth, Gerd Stumme

    Abstract: We introduce the Birkhoff completion as the smallest distributive lattice in which a given finite lattice can be embedded as semi-lattice. We discuss its relationship to implicational theories, in particular to R. Wille's simply-implicational theories. By an example, we show how the Birkhoff completion can be used as a tool for ordinal data science.

    Submitted 2 May, 2024; originally announced May 2024.

  2. arXiv:2404.18940  [pdf, ps, other

    cs.SI cs.AI

    Conceptual Map** of Controversies

    Authors: Claude Draude, Dominik Dürrschnabel, Johannes Hirth, Viktoria Horn, Jonathan Kropf, Jörn Lamla, Gerd Stumme, Markus Uhlmann

    Abstract: With our work, we contribute towards a qualitative analysis of the discourse on controversies in online news media. For this, we employ Formal Concept Analysis and the economics of conventions to derive conceptual controversy maps. In our experiments, we analyze two maps from different news journals with methods from ordinal data science. We show how these methods can be used to assess the diversi… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  3. arXiv:2307.09477  [pdf, other

    cs.AI cs.DM cs.LG

    Towards Ordinal Data Science

    Authors: Gerd Stumme, Dominik Dürrschnabel, Tom Hanika

    Abstract: Order is one of the main instruments to measure the relationship between objects in (empirical) data. However, compared to methods that use numerical properties of objects, the amount of ordinal methods developed is rather small. One reason for this is the limited availability of computational resources in the last century that would have been required for ordinal computations. Another reason -- p… ▽ More

    Submitted 6 December, 2023; v1 submitted 13 July, 2023; originally announced July 2023.

    Comments: 40 pages, 7 figures, Transactions on Graph Data and Knowledge (TGDK)

    MSC Class: 03G10; 06A15; 68T27; 68T30 ACM Class: G.2.3; F.4.1; H.5.0; I.2.4

    Journal ref: TGDK 1(1): 6:1-6:39 (2023)

  4. Automatic Textual Explanations of Concept Lattices

    Authors: Johannes Hirth, Viktoria Horn, Gerd Stumme, Tom Hanika

    Abstract: Lattices and their order diagrams are an essential tool for communicating knowledge and insights about data. This is in particular true when applying Formal Concept Analysis. Such representations, however, are difficult to comprehend by untrained users and in general in cases where lattices are large. We tackle this problem by automatically generating textual explanations for lattices using standa… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    MSC Class: 06A15 03G10 68T30 68T27 ACM Class: F.4.1; I.2.6

    Journal ref: ICCS 2023: 138-152

  5. Ordinal Motifs in Lattices

    Authors: Johannes Hirth, Viktoria Horn, Gerd Stumme, Tom Hanika

    Abstract: Lattices are a commonly used structure for the representation and analysis of relational and ontological knowledge. In particular, the analysis of these requires a decomposition of a large and high-dimensional lattice into a set of understandably large parts. With the present work we propose /ordinal motifs/ as analytical units of meaning. We study these ordinal substructures (or standard scales)… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

    MSC Class: 06A15 03G10 68T30 68T27 ACM Class: F.4.1; I.2.6

  6. arXiv:2304.03338  [pdf, ps, other

    cs.AI cs.LG cs.LO math.CO

    Maximal Ordinal Two-Factorizations

    Authors: Dominik Dürrschnabel, Gerd Stumme

    Abstract: Given a formal context, an ordinal factor is a subset of its incidence relation that forms a chain in the concept lattice, i.e., a part of the dataset that corresponds to a linear order. To visualize the data in a formal context, Ganter and Glodeanu proposed a biplot based on two ordinal factors. For the biplot to be useful, it is important that these factors comprise as much data points as possib… ▽ More

    Submitted 20 June, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

    Comments: 15 pages, 6 figures, 2 algorithms, 28th International Conference on Conceptual Structures

    MSC Class: 06-08; 03G10; 90C27; 68R10; 06A07 ACM Class: I.2.4; G.2.1; F.2.2

  7. arXiv:2302.11554  [pdf, ps, other

    cs.LG cs.AI cs.LO

    Greedy Discovery of Ordinal Factors

    Authors: Dominik Dürrschnabel, Gerd Stumme

    Abstract: In large datasets, it is hard to discover and analyze structure. It is thus common to introduce tags or keywords for the items. In applications, such datasets are then filtered based on these tags. Still, even medium-sized datasets with a few tags result in complex and for humans hard-to-navigate systems. In this work, we adopt the method of ordinal factor analysis to address this problem. An ordi… ▽ More

    Submitted 19 February, 2023; originally announced February 2023.

    Comments: 11 pages, 6 figures, 2 tables, 3 algorithms

    MSC Class: 68T30; 03G10 ACM Class: I.2.6

  8. arXiv:2212.10208  [pdf, ps, other

    cs.DM

    Factorizing Lattices by Interval Relations

    Authors: Maren Koyda, Gerd Stumme

    Abstract: This work investigates the factorization of finite lattices to implode selected intervals while preserving the remaining order structure. We examine how complete congruence relations and complete tolerance relations can be utilized for this purpose and answer the question of finding the finest of those relations to implode a given interval in the generated factor lattice. To overcome the limitatio… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: 23 pages, 13 figures

    MSC Class: 06B05; 06B10; 68R01 ACM Class: G.2

  9. arXiv:2211.10446  [pdf, other

    cs.AI math.CO

    Discovering Locally Maximal Bipartite Subgraphs

    Authors: Dominik Dürrschnabel, Tom Hanika, Gerd Stumme

    Abstract: Induced bipartite subgraphs of maximal vertex cardinality are an essential concept for the analysis of graphs. Yet, discovering them in large graphs is known to be computationally hard. Therefore, we consider in this work a weaker notion of this problem, where we discard the maximality constraint in favor of inclusion maximality. Thus, we aim to discover locally maximal bipartite subgraphs. For th… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

    Comments: 12 pages, 3 figures, 3 tables

    MSC Class: 90C27 ACM Class: G.2.1; F.2.2

  10. Attribute Exploration with Multiple Contradicting Partial Experts

    Authors: Maximilian Felde, Gerd Stumme

    Abstract: Attribute exploration is a method from Formal Concept Analysis (FCA) that helps a domain expert discover structural dependencies in knowledge domains which can be represented as formal contexts (cross tables of objects and attributes). In this paper we present an extension of attribute exploration that allows for a group of domain experts and explores their shared views. Each expert has their own… ▽ More

    Submitted 31 May, 2022; originally announced May 2022.

    Comments: 22 pages (14 pages + 8 pages appendix)

  11. arXiv:2204.11859  [pdf, other

    cs.DL cs.IR cs.LG

    Map** Research Trajectories

    Authors: Bastian Schäfermeier, Gerd Stumme, Tom Hanika

    Abstract: Steadily growing amounts of information, such as annually published scientific papers, have become so large that they elude an extensive manual analysis. Hence, to maintain an overview, automated methods for the map** and visualization of knowledge domains are necessary and important, e.g., for scientific decision makers. Of particular interest in this field is the development of research topics… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    MSC Class: 68U35; 68U15

  12. The Mont Blanc of Twitter: Identifying Hierarchies of Outstanding Peaks in Social Networks

    Authors: Maximilian Stubbemann, Gerd Stumme

    Abstract: The investigation of social networks is often hindered by their size as such networks often consist of at least thousands of vertices and edges. Hence, it is of major interest to derive compact structures that represent important connections of the original network. In this work, we derive such structures with orometric methods that are originally designed to identify outstanding mountain peaks an… ▽ More

    Submitted 27 September, 2023; v1 submitted 26 October, 2021; originally announced October 2021.

    Comments: 15 pages, 3 figures, 2 tables. Accepted to ECML/PKDD 2023. Final version available at https://link.springer.com/chapter/10.1007/978-3-031-43418-1_11

  13. arXiv:2109.11343  [pdf, other

    cs.IR cs.AI

    Towards Explainable Scientific Venue Recommendations

    Authors: Bastian Schäfermeier, Gerd Stumme, Tom Hanika

    Abstract: Selecting the best scientific venue (i.e., conference/journal) for the submission of a research article constitutes a multifaceted challenge. Important aspects to consider are the suitability of research topics, a venue's prestige, and the probability of acceptance. The selection problem is exacerbated through the continuous emergence of additional venues. Previously proposed approaches for suppor… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

  14. LG4AV: Combining Language Models and Graph Neural Networks for Author Verification

    Authors: Maximilian Stubbemann, Gerd Stumme

    Abstract: The automatic verification of document authorships is important in various settings. Researchers are for example judged and compared by the amount and impact of their publications and public figures are confronted by their posts on social media platforms. Therefore, it is important that authorship information in frequently used web services and platforms is correct. The question whether a given do… ▽ More

    Submitted 3 September, 2021; originally announced September 2021.

    Comments: 9 pages, 1 figure

    ACM Class: I.2.7; I.2.6

  15. arXiv:2106.10978  [pdf, other

    cs.AI cs.LG

    Attribute Selection using Contranominal Scales

    Authors: Dominik Dürrschnabel, Maren Koyda, Gerd Stumme

    Abstract: Formal Concept Analysis (FCA) allows to analyze binary data by deriving concepts and ordering them in lattices. One of the main goals of FCA is to enable humans to comprehend the information that is encapsulated in the data; however, the large size of concept lattices is a limiting factor for the feasibility of understanding the underlying structural properties. The size of such a lattice depends… ▽ More

    Submitted 1 July, 2021; v1 submitted 21 June, 2021; originally announced June 2021.

    Comments: 17 pages, 2 figures, 3 tables, 1 algorithm, 26th International Conference on Conceptual Structures

    MSC Class: 68T30; 03G10 ACM Class: I.2.6

  16. arXiv:2106.09789  [pdf, other

    cs.NI cs.LG eess.SP

    Topological Indoor Map** through WiFi Signals

    Authors: Bastian Schaefermeier, Gerd Stumme, Tom Hanika

    Abstract: The ubiquitous presence of WiFi access points and mobile devices capable of measuring WiFi signal strengths allow for real-world applications in indoor localization and map**. In particular, no additional infrastructure is required. Previous approaches in this field were, however, often hindered by problems such as effortful map-building processes, changing environments and hardware differences.… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: 18 pages

  17. arXiv:2104.07159  [pdf, ps, other

    cs.DM cs.LO

    Boolean Substructures in Formal Concept Analysis

    Authors: Maren Koyda, Gerd Stumme

    Abstract: It is known that a (concept) lattice contains an n-dimensional Boolean suborder if and only if the context contains an n-dimensional contra-nominal scale as subcontext. In this work, we investigate more closely the interplay between the Boolean subcontexts of a given finite context and the Boolean suborders of its concept lattice. To this end, we define map**s from the set of subcontexts of a co… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

    MSC Class: 03G10 (Primary) 06B05 (Secondary) ACM Class: E.1; D.2

  18. arXiv:2102.02684  [pdf, other

    cs.CG math.CO

    Force-Directed Layout of Order Diagrams using Dimensional Reduction

    Authors: Dominik Dürrschnabel, Gerd Stumme

    Abstract: Order diagrams allow human analysts to understand and analyze structural properties of ordered data. While an experienced expert can create easily readable order diagrams, the automatic generation of those remains a hard task. In this work, we adapt force-directed approaches, which are known to generate aesthetically-pleasing drawings of graphs, to the realm of order diagrams. Our algorithm ReDraw… ▽ More

    Submitted 4 February, 2021; originally announced February 2021.

    Comments: 16 pages, 6 figures, 4 algorithms, for source code refer to https://github.com/domduerr/redraw

    MSC Class: 68R10; 06A07 ACM Class: G.2.2

  19. Triadic Exploration and Exploration with Multiple Experts

    Authors: Maximilian Felde, Gerd Stumme

    Abstract: Formal Concept Analysis (FCA) provides a method called attribute exploration which helps a domain expert discover structural dependencies in knowledge domains that can be represented by a formal context (a cross table of objects and attributes). Triadic Concept Analysis is an extension of FCA that incorporates the notion of conditions. Many extensions and variants of attribute exploration have bee… ▽ More

    Submitted 4 February, 2021; originally announced February 2021.

    Comments: 16 pages, 5 figures

  20. Topic Space Trajectories: A case study on machine learning literature

    Authors: Bastian Schäfermeier, Gerd Stumme, Tom Hanika

    Abstract: The annual number of publications at scientific venues, for example, conferences and journals, is growing quickly. Hence, even for researchers it becomes harder and harder to keep track of research topics and their progress. In this task, researchers can be supported by automated publication analysis. Yet, many such methods result in uninterpretable, purely numerical representations. As an attempt… ▽ More

    Submitted 18 May, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: 41 pages, 8 figures

    ACM Class: I.2.7; I.2.6; H.3.1; H.3.7

    Journal ref: Scientometrics (2021)

  21. Interactive Collaborative Exploration using Incomplete Contexts

    Authors: Maximilian Felde, Gerd Stumme

    Abstract: A well-known knowledge acquisition method in the field of Formal Concept Analysis (FCA) is attribute exploration. It is used to reveal dependencies in a set of attributes with help of a domain expert. In most applications no single expert is capable (time- and knowledge-wise) of exploring the knowledge domain alone. However, there is up to now no theory that models the interaction of multiple expe… ▽ More

    Submitted 31 January, 2020; v1 submitted 23 August, 2019; originally announced August 2019.

    Comments: 38 pages (31 pages + 7 pages appendix), 16 figures

  22. Orometric Methods in Bounded Metric Data

    Authors: Maximilian Stubbemann, Tom Hanika, Gerd Stumme

    Abstract: A large amount of data accommodated in knowledge graphs (KG) is actually metric. For example, the Wikidata KG contains a plenitude of metric facts about geographic entities like cities, chemical compounds or celestial objects. In this paper, we propose a novel approach that transfers orometric (topographic) measures to bounded metric spaces. While these methods were originally designed to identify… ▽ More

    Submitted 22 July, 2019; originally announced July 2019.

    Comments: 8 Pages, 1 figure

    MSC Class: 68T99 ACM Class: I.5.2

  23. arXiv:1906.06208  [pdf, other

    cs.CG math.CO

    Drawing Order Diagrams Through Two-Dimension Extension

    Authors: Dominik Dürrschnabel, Tom Hanika, Gerd Stumme

    Abstract: Order diagrams are an important tool to visualize the complex structure of ordered sets. Favorable drawings of order diagrams, i.e., easily readable for humans, are hard to come by, even for small ordered sets. Many attempts were made to transfer classical graph drawing approaches to order diagrams. Although these methods produce satisfying results for some ordered sets, they unfortunately perform… ▽ More

    Submitted 14 June, 2019; originally announced June 2019.

    Comments: 16 pages, 12 Figures

    MSC Class: 68R10 06A07

    Journal ref: J. Graph Algorithms Appl. 27(9): 783-802 (2023)

  24. arXiv:1905.07264  [pdf, ps, other

    cs.LG cs.AI

    Collaborative Interactive Learning -- A clarification of terms and a differentiation from other research fields

    Authors: Tom Hanika, Marek Herde, Jochen Kuhn, Jan Marco Leimeister, Paul Lukowicz, Sarah Oeste-Reiß, Albrecht Schmidt, Bernhard Sick, Gerd Stumme, Sven Tomforde, Katharina Anna Zweig

    Abstract: The field of collaborative interactive learning (CIL) aims at develo** and investigating the technological foundations for a new generation of smart systems that support humans in their everyday life. While the concept of CIL has already been carved out in detail (including the fields of dedicated CIL and opportunistic CIL) and many research objectives have been stated, there is still the need t… ▽ More

    Submitted 16 May, 2019; originally announced May 2019.

  25. arXiv:1903.00686  [pdf, other

    cs.CG cs.AI cs.DM

    DimDraw -- A novel tool for drawing concept lattices

    Authors: Dominik Dürrschnabel, Tom Hanika, Gerd Stumme

    Abstract: Concept lattice drawings are an important tool to visualize complex relations in data in a simple manner to human readers. Many attempts were made to transfer classical graph drawing approaches to order diagrams. Although those methods are satisfying for some lattices they unfortunately perform poorly in general. In this work we present a novel tool to draw concept lattices that is purely motivate… ▽ More

    Submitted 2 March, 2019; originally announced March 2019.

    Comments: 4 pages, 4 figures

    MSC Class: 68R10 68T30 03G10 68T05

  26. Discovering Implicational Knowledge in Wikidata

    Authors: Tom Hanika, Maximilian Marx, Gerd Stumme

    Abstract: Knowledge graphs have recently become the state-of-the-art tool for representing the diverse and complex knowledge of the world. Examples include the proprietary knowledge graphs of companies such as Google, Facebook, IBM, or Microsoft, but also freely available ones such as YAGO, DBpedia, and Wikidata. A distinguishing feature of Wikidata is that the knowledge is collaboratively edited and curate… ▽ More

    Submitted 3 February, 2019; originally announced February 2019.

    MSC Class: 68T30 03G10 68T27

  27. Relevant Attributes in Formal Contexts

    Authors: Tom Hanika, Maren Koyda, Gerd Stumme

    Abstract: Computing conceptual structures, like formal concept lattices, is in the age of massive data sets a challenging task. There are various approaches to deal with this, e.g., random sampling, parallelization, or attribute extraction. A so far not investigated method in the realm of formal concept analysis is attribute selection, as done in machine learning. Building up on this we introduce a method f… ▽ More

    Submitted 20 December, 2018; originally announced December 2018.

    Comments: 14 pages, 5 figures

    MSC Class: 68T30 03G10 ACM Class: I.2.6

  28. arXiv:1809.07405  [pdf, other

    cs.LG cs.CV stat.ML

    Distances for WiFi Based Topological Indoor Map**

    Authors: Bastian Schäfermeier, Tom Hanika, Gerd Stumme

    Abstract: For localization and map** of indoor environments through WiFi signals, locations are often represented as likelihoods of the received signal strength indicator. In this work we compare various measures of distance between such likelihoods in combination with different methods for estimation and representation. In particular, we show that among the considered distance measures the Earth Mover's… ▽ More

    Submitted 19 September, 2018; originally announced September 2018.

    Comments: 10 pages, 6 figures

  29. arXiv:1805.05714  [pdf, other

    cs.AI

    Intrinsic dimension and its application to association rules

    Authors: Tom Hanika, Friedrich Martin Schneider, Gerd Stumme

    Abstract: The curse of dimensionality in the realm of association rules is twofold. Firstly, we have the well known exponential increase in computational complexity with increasing item set size. Secondly, there is a \emph{related curse} concerned with the distribution of (spare) data itself in high dimension. The former problem is often coped with by projection, i.e., feature selection, whereas the best kn… ▽ More

    Submitted 15 May, 2018; originally announced May 2018.

    Comments: 4 pages, 1 figure

    MSC Class: 68T01 68T05 ACM Class: I.2.6

  30. Clones in Graphs

    Authors: Stephan Doerfel, Tom Hanika, Gerd Stumme

    Abstract: Finding structural similarities in graph data, like social networks, is a far-ranging task in data mining and knowledge discovery. A (conceptually) simple reduction would be to compute the automorphism group of a graph. However, this approach is ineffective in data mining since real world data does not exhibit enough structural regularity. Here we step in with a novel approach based on map**s th… ▽ More

    Submitted 30 July, 2018; v1 submitted 21 February, 2018; originally announced February 2018.

    Comments: 11 pages, 2 figures, 1 table

    MSC Class: 03G10 91D30 ACM Class: G.2.1; G.2.2

  31. arXiv:1801.07985  [pdf, other

    cs.AI cs.LG math.MG

    Intrinsic Dimension of Geometric Data Sets

    Authors: Tom Hanika, Friedrich Martin Schneider, Gerd Stumme

    Abstract: The curse of dimensionality is a phenomenon frequently observed in machine learning (ML) and knowledge discovery (KD). There is a large body of literature investigating its origin and impact, using methods from mathematics as well as from computer science. Among the mathematical insights into data dimensionality, there is an intimate link between the dimension curse and the phenomenon of measure c… ▽ More

    Submitted 26 October, 2020; v1 submitted 24 January, 2018; originally announced January 2018.

    Comments: v3: 33 pages, 3 figures, 2 tables

    MSC Class: 03G10 51F99 68P05 68T01 ACM Class: I.2.6

    Journal ref: Tohoku Math. J. (2) 74 (2022) 23-52

  32. Adaptive kNN using Expected Accuracy for Classification of Geo-Spatial Data

    Authors: Mark Kibanov, Martin Becker, Juergen Mueller, Martin Atzmueller, Andreas Hotho, Gerd Stumme

    Abstract: The k-Nearest Neighbor (kNN) classification approach is conceptually simple - yet widely applied since it often performs well in practical applications. However, using a global constant k does not always provide an optimal solution, e.g., for datasets with an irregular density distribution of data points. This paper proposes an adaptive kNN classifier where k is chosen dynamically for each instanc… ▽ More

    Submitted 14 December, 2017; originally announced January 2018.

  33. Mining Social Media to Inform Peatland Fire and Haze Disaster Management

    Authors: Mark Kibanov, Gerd Stumme, Imaduddin Amin, Jong Gun Lee

    Abstract: Peatland fires and haze events are disasters with national, regional and international implications. The phenomena lead to direct damage to local assets, as well as broader economic and environmental losses. Satellite imagery is still the main and often the only available source of information for disaster management. In this article, we test the potential of social media to assist disaster manage… ▽ More

    Submitted 2 August, 2017; v1 submitted 16 June, 2017; originally announced June 2017.

  34. Predicting Rising Follower Counts on Twitter Using Profile Information

    Authors: Juergen Mueller, Gerd Stumme

    Abstract: When evaluating the cause of one's popularity on Twitter, one thing is considered to be the main driver: Many tweets. There is debate about the kind of tweet one should publish, but little beyond tweets. Of particular interest is the information provided by each Twitter user's profile page. One of the features are the given names on those profiles. Studies on psychology and economics identified co… ▽ More

    Submitted 9 May, 2017; originally announced May 2017.

    Comments: 10 pages, 3 figures, 8 tables, WebSci '17, June 25--28, 2017, Troy, NY, USA

  35. Gender Inference using Statistical Name Characteristics in Twitter

    Authors: Juergen Mueller, Gerd Stumme

    Abstract: Much attention has been given to the task of gender inference of Twitter users. Although names are strong gender indicators, the names of Twitter users are rarely used as a feature; probably due to the high number of ill-formed names, which cannot be found in any name dictionary. Instead of relying solely on a name database, we propose a novel name classifier. Our approach extracts characteristics… ▽ More

    Submitted 1 July, 2016; v1 submitted 17 June, 2016; originally announced June 2016.

    Comments: 9 pages (8 pages in actual proceedings), 2 figures, 8 tables, conference: MISNC, SI, DS '16, August 15 - 17, 2016, Union, NJ, USA

  36. arXiv:1407.2161  [pdf, other

    cs.SI physics.soc-ph

    Link Prediction and the Role of Stronger Ties in Networks of Face-to-Face Proximity

    Authors: Christoph Scholz, Martin Atzmueller, Gerd Stumme

    Abstract: Understanding the structures why links are formed is an important and prominent research topic. In this paper, we therefore consider the link prediction problem in face-to-face contact networks, and analyze the predictability of new and recurring links. Furthermore, we study additional influence factors, and the role of stronger ties in these networks. Specifically, we compare neighborhood-based a… ▽ More

    Submitted 8 July, 2014; originally announced July 2014.

  37. arXiv:1407.0613  [pdf, other

    cs.SI physics.soc-ph

    On the Predictability of Talk Attendance at Academic Conferences

    Authors: Christoph Scholz, Jens Illig, Martin Atzmueller, Gerd Stumme

    Abstract: This paper focuses on the prediction of real-world talk attendances at academic conferences with respect to different influence factors. We study the predictability of talk attendances using real-world tracked face-to-face contacts. Furthermore, we investigate and discuss the predictive power of user interests extracted from the users' previous publications. We apply Hybrid Rooted PageRank, a stat… ▽ More

    Submitted 2 July, 2014; originally announced July 2014.

  38. arXiv:1309.3888  [pdf, other

    cs.SI physics.soc-ph

    User-Relatedness and Community Structure in Social Interaction Networks

    Authors: Folke Mitzlaff, Martin Atzmueller, Dominik Benz, Andreas Hotho, Gerd Stumme

    Abstract: With social media and the according social and ubiquitous applications finding their way into everyday life, there is a rapidly growing amount of user generated content yielding explicit and implicit network structures. We consider social activities and phenomena as proxies for user relatedness. Such activities are represented in so-called social interaction networks or evidence networks, with dif… ▽ More

    Submitted 16 September, 2013; originally announced September 2013.

  39. arXiv:1303.0484  [pdf, other

    cs.IR cs.SI physics.soc-ph

    Onomastics 2.0 - The Power of Social Co-Occurrences

    Authors: Folke Mitzlaff, Gerd Stumme

    Abstract: Onomastics is "the science or study of the origin and forms of proper names of persons or places." ["Onomastics". Merriam-Webster.com, 2013. http://www.merriam-webster.com (11 February 2013)]. Especially personal names play an important role in daily life, as all over the world future parents are facing the task of finding a suitable given name for their child. This choice is influenced by differe… ▽ More

    Submitted 3 March, 2013; originally announced March 2013.

    Comments: Historically, this is the first paper on the analysis of names in the context of the name search engine 'nameling'. arXiv admin note: text overlap with arXiv:1302.4412

  40. arXiv:1302.4412  [pdf, other

    cs.IR cs.SI physics.soc-ph

    Recommending Given Names

    Authors: Folke Mitzlaff, Gerd Stumme

    Abstract: All over the world, future parents are facing the task of finding a suitable given name for their child. This choice is influenced by different factors, such as the social context, language, cultural background and especially personal taste. Although this task is omnipresent, little research has been conducted on the analysis and application of interrelations among given names from a data mining p… ▽ More

    Submitted 19 February, 2013; v1 submitted 18 February, 2013; originally announced February 2013.

    Comments: Baseline results for the ECML PKDD Discovery Challenge 2013

  41. arXiv:0805.2045  [pdf, other

    cs.DL cs.IR

    Semantic Analysis of Tag Similarity Measures in Collaborative Tagging Systems

    Authors: Ciro Cattuto, Dominik Benz, Andreas Hotho, Gerd Stumme

    Abstract: Social bookmarking systems allow users to organise collections of resources on the Web in a collaborative fashion. The increasing popularity of these systems as well as first insights into their emergent semantics have made them relevant to disciplines like knowledge extraction and ontology learning. The problem of devising methods to measure the semantic relatedness between tags and characteriz… ▽ More

    Submitted 14 May, 2008; originally announced May 2008.

    Comments: 5 pages, 2 figures

    ACM Class: H.3.5; G.2.2; H.1.2; H.1.m; H.5.3