Skip to main content

Showing 1–40 of 40 results for author: Kajdanowicz, T

.
  1. arXiv:2405.10745  [pdf, other

    cs.LG cs.AI cs.CL

    Empowering Small-Scale Knowledge Graphs: A Strategy of Leveraging General-Purpose Knowledge Graphs for Enriched Embeddings

    Authors: Albert Sawczyn, Jakub Binkowski, Piotr Bielak, Tomasz Kajdanowicz

    Abstract: Knowledge-intensive tasks pose a significant challenge for Machine Learning (ML) techniques. Commonly adopted methods, such as Large Language Models (LLMs), often exhibit limitations when applied to such tasks. Nevertheless, there have been notable endeavours to mitigate these challenges, with a significant emphasis on augmenting LLMs through Knowledge Graphs (KGs). While KGs provide many advantag… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: Accepted for LREC-COLING 2024

  2. arXiv:2402.17906  [pdf, other

    cs.LG cs.SI

    Representation learning in multiplex graphs: Where and how to fuse information?

    Authors: Piotr Bielak, Tomasz Kajdanowicz

    Abstract: In recent years, unsupervised and self-supervised graph representation learning has gained popularity in the research community. However, most proposed methods are focused on homogeneous networks, whereas real-world graphs often contain multiple node and edge types. Multiplex graphs, a special type of heterogeneous graphs, possess richer information, provide better modeling capabilities and integr… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  3. arXiv:2310.18080  [pdf, other

    cs.LG

    Unveiling the Potential of Probabilistic Embeddings in Self-Supervised Learning

    Authors: Denis Janiak, Jakub Binkowski, Piotr Bielak, Tomasz Kajdanowicz

    Abstract: In recent years, self-supervised learning has played a pivotal role in advancing machine learning by allowing models to acquire meaningful representations from unlabeled data. An intriguing research avenue involves develo** self-supervised models within an information-theoretic framework, but many studies often deviate from the stochasticity assumptions made when deriving their objectives. To ga… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: Under review by AISTATS 2024

  4. Similarity-based Memory Enhanced Joint Entity and Relation Extraction

    Authors: Witold Kosciukiewicz, Mateusz Wojcik, Tomasz Kajdanowicz, Adam Gonczarek

    Abstract: Document-level joint entity and relation extraction is a challenging information extraction problem that requires a unified approach where a single neural network performs four sub-tasks: mention detection, coreference resolution, entity classification, and relation extraction. Existing methods often utilize a sequential multi-task learning approach, in which the arbitral decomposition causes the… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

  5. arXiv:2307.07007  [pdf, other

    cs.CL

    Electoral Agitation Data Set: The Use Case of the Polish Election

    Authors: Mateusz Baran, Mateusz Wójcik, Piotr Kolebski, Michał Bernaczyk, Krzysztof Rajda, Łukasz Augustyniak, Tomasz Kajdanowicz

    Abstract: The popularity of social media makes politicians use it for political advertisement. Therefore, social media is full of electoral agitation (electioneering), especially during the election campaigns. The election administration cannot track the spread and quantity of messages that count as agitation under the election code. It addresses a crucial problem, while also uncovering a niche that has not… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: 5 pages, 3 figures, Language Resources and Evaluation Conference

  6. arXiv:2307.05399  [pdf, other

    cs.LG cs.AI

    Domain-Agnostic Neural Architecture for Class Incremental Continual Learning in Document Processing Platform

    Authors: Mateusz Wójcik, Witold Kościukiewicz, Mateusz Baran, Tomasz Kajdanowicz, Adam Gonczarek

    Abstract: Production deployments in complex systems require ML architectures to be highly efficient and usable against multiple tasks. Particularly demanding are classification problems in which data arrives in a streaming fashion and each class is presented separately. Recent methods with stochastic gradient learning have been shown to struggle in such setups or have limitations like memory buffers, and be… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: arXiv admin note: text overlap with arXiv:2211.14963

  7. arXiv:2306.07902  [pdf, other

    cs.CL cs.AI

    Massively Multilingual Corpus of Sentiment Datasets and Multi-faceted Sentiment Classification Benchmark

    Authors: Łukasz Augustyniak, Szymon Woźniak, Marcin Gruza, Piotr Gramacki, Krzysztof Rajda, Mikołaj Morzy, Tomasz Kajdanowicz

    Abstract: Despite impressive advancements in multilingual corpora collection and model training, develo** large-scale deployments of multilingual models still presents a significant challenge. This is particularly true for language tasks that are culture-dependent. One such example is the area of multilingual sentiment analysis, where affective markers can be subtle and deeply ensconced in culture. This w… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: submitted to NeurIPS 2023 Datasets and Benchmarks track. Dataset: https://huggingface.co/datasets/Brand24/mms Code: https://github.com/Brand24-AI/mms_benchmark

  8. arXiv:2303.02023  [pdf, other

    cs.LG

    Graph-level representations using ensemble-based readout functions

    Authors: Jakub Binkowski, Albert Sawczyn, Denis Janiak, Piotr Bielak, Tomasz Kajdanowicz

    Abstract: Graph machine learning models have been successfully deployed in a variety of application areas. One of the most prominent types of models - Graph Neural Networks (GNNs) - provides an elegant way of extracting expressive node-level representation vectors, which can be used to solve node-related problems, such as classifying users in a social network. However, many tasks require representations at… ▽ More

    Submitted 20 April, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: International Conference on Computational Science (ICCS) 2023

  9. arXiv:2303.01926  [pdf, other

    cs.LG

    RAFEN -- Regularized Alignment Framework for Embeddings of Nodes

    Authors: Kamil Tagowski, Piotr Bielak, Jakub Binkowski, Tomasz Kajdanowicz

    Abstract: Learning representations of nodes has been a crucial area of the graph machine learning research area. A well-defined node embedding model should reflect both node features and the graph structure in the final embedding. In the case of dynamic graphs, this problem becomes even more complex as both features and structure may change over time. The embeddings of particular nodes should remain compara… ▽ More

    Submitted 19 April, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: ICCS 2023

  10. arXiv:2211.14963  [pdf, other

    cs.LG

    Neural Architecture for Online Ensemble Continual Learning

    Authors: Mateusz Wójcik, Witold Kościukiewicz, Tomasz Kajdanowicz, Adam Gonczarek

    Abstract: Continual learning with an increasing number of classes is a challenging task. The difficulty rises when each example is presented exactly once, which requires the model to learn online. Recent methods with classic parameter optimization procedures have been shown to struggle in such setups or have limitations like non-differentiable components or memory buffers. For this reason, we present the fu… ▽ More

    Submitted 21 August, 2023; v1 submitted 27 November, 2022; originally announced November 2022.

    Journal ref: https://meta-learn.github.io/2022/

  11. arXiv:2211.13112  [pdf, other

    cs.CL cs.IR cs.LG

    This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish

    Authors: Łukasz Augustyniak, Kamil Tagowski, Albert Sawczyn, Denis Janiak, Roman Bartusiak, Adrian Szymczak, Marcin Wątroba, Arkadiusz Janz, Piotr Szymański, Mikołaj Morzy, Tomasz Kajdanowicz, Maciej Piasecki

    Abstract: The availability of compute and data to train larger and larger language models increases the demand for robust methods of benchmarking the true progress of LM training. Recent years witnessed significant progress in standardized benchmarking for English. Benchmarks such as GLUE, SuperGLUE, or KILT have become de facto standard tools to compare large language models. Following the trend to replica… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: 10 pages, 8 pages appendix

    Journal ref: Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (NeurIPS 2022) - https://lepiszcze.ml

  12. arXiv:2204.04937  [pdf, other

    cs.CL cs.LG

    Assessment of Massively Multilingual Sentiment Classifiers

    Authors: Krzysztof Rajda, Łukasz Augustyniak, Piotr Gramacki, Marcin Gruza, Szymon Woźniak, Tomasz Kajdanowicz

    Abstract: Models are increasing in size and complexity in the hunt for SOTA. But what if those 2\% increase in performance does not make a difference in a production use case? Maybe benefits from a smaller, faster model outweigh those slight performance gains. Also, equally good performance across languages in multilingual tasks is more important than SOTA results on a single one. We present the biggest, un… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: Accepted for WASSA at ACL 2022

  13. Dynamic pricing and discounts by means of interactive presentation systems in stationary point of sales

    Authors: Marcin Lewicki, Tomasz Kajdanowicz, Piotr Bródka, Janusz Sobecki

    Abstract: The main purpose of this article was to create a model and simulate the profitability conditions of an interactive presentation system (IPS) with the recommender system (RS) used in the kiosk. 90 million simulations have been run in Python with SymPy to address the problem of discount recommendation offered to the clients according to their usage of the IPS.

    Submitted 28 January, 2022; originally announced January 2022.

    Journal ref: In: ICCS 2021. Lecture Notes in Computer Science, vol 12745. Springer, Cham (2021)

  14. arXiv:2110.05573  [pdf, other

    cs.SI cs.CL cs.LG

    Spatial Data Mining of Public Transport Incidents reported in Social Media

    Authors: Kamil Raczycki, Marcin Szymański, Yahor Yeliseyenka, Piotr Szymański, Tomasz Kajdanowicz

    Abstract: Public transport agencies use social media as an essential tool for communicating mobility incidents to passengers. However, while the short term, day-to-day information about transport phenomena is usually posted in social media with low latency, its availability is short term as the content is rarely made an aggregated form. Social media communication of transport phenomena usually lacks GIS ann… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

    Comments: Preprint, accepted to IWCTS at SIGSPATIAL'21

  15. Graph Barlow Twins: A self-supervised representation learning framework for graphs

    Authors: Piotr Bielak, Tomasz Kajdanowicz, Nitesh V. Chawla

    Abstract: The self-supervised learning (SSL) paradigm is an essential exploration area, which tries to eliminate the need for expensive data labeling. Despite the great success of SSL methods in computer vision and natural language processing, most of them employ contrastive learning objectives that require negative samples, which are hard to define. This becomes even more challenging in the case of graphs… ▽ More

    Submitted 12 September, 2023; v1 submitted 4 June, 2021; originally announced June 2021.

    Journal ref: Knowledge-Based Systems, Volume 256, 28 November 2022, 109631

  16. arXiv:2012.14727  [pdf, other

    cs.LG

    AttrE2vec: Unsupervised Attributed Edge Representation Learning

    Authors: Piotr Bielak, Tomasz Kajdanowicz, Nitesh V. Chawla

    Abstract: Representation learning has overcome the often arduous and manual featurization of networks through (unsupervised) feature learning as it results in embeddings that can apply to a variety of downstream learning tasks. The focus of representation learning on graphs has focused mainly on shallow (node-centric) or deep (graph-based) learning approaches. While there have been approaches that work on h… ▽ More

    Submitted 29 December, 2020; originally announced December 2020.

  17. arXiv:2006.10207  [pdf, other

    cs.CL cs.CC cs.IR cs.LG

    Political Advertising Dataset: the use case of the Polish 2020 Presidential Elections

    Authors: Łukasz Augustyniak, Krzysztof Rajda, Tomasz Kajdanowicz, Michał Bernaczyk

    Abstract: Political campaigns are full of political ads posted by candidates on social media. Political advertisements constitute a basic form of campaigning, subjected to various social requirements. We present the first publicly open dataset for detecting specific text chunks and categories of political advertising in the Polish language. It contains 1,705 human-annotated tweets tagged with nine categorie… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

    Comments: ACL 2020 WiNLP Workshop - accepted

  18. arXiv:2006.09102  [pdf, other

    cs.CV cs.LG

    UCSG-Net -- Unsupervised Discovering of Constructive Solid Geometry Tree

    Authors: Kacper Kania, Maciej Zięba, Tomasz Kajdanowicz

    Abstract: Signed distance field (SDF) is a prominent implicit representation of 3D meshes. Methods that are based on such representation achieved state-of-the-art 3D shape reconstruction quality. However, these methods struggle to reconstruct non-convex shapes. One remedy is to incorporate a constructive solid geometry framework (CSG) that represents a shape as a decomposition into primitives. It allows to… ▽ More

    Submitted 20 October, 2020; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: Accepted to Thirty-fourth Conference on Neural Information Processing Systems (NeurIPS 2020). Project page: https://kacperkan.github.io/ucsgnet. Project video: https://www.youtube.com/watch?v=s1p4UHtUG3g&feature=emb_title. Comments: 13 pages, 7 figures; apply reviewers' remarks, fix the reference to the CSG-Net work

  19. arXiv:1909.04917  [pdf, other

    cs.CL

    Comprehensive Analysis of Aspect Term Extraction Methods using Various Text Embeddings

    Authors: Łukasz Augustyniak, Tomasz Kajdanowicz, Przemysław Kazienko

    Abstract: Recently, a variety of model designs and methods have blossomed in the context of the sentiment analysis domain. However, there is still a lack of wide and comprehensive studies of aspect-based sentiment analysis (ABSA). We want to fill this gap and propose a comparison with ablation analysis of aspect term extraction using various text embedding methods. We particularly focused on architectures b… ▽ More

    Submitted 10 December, 2020; v1 submitted 11 September, 2019; originally announced September 2019.

  20. Extracting Aspects Hierarchies using Rhetorical Structure Theory

    Authors: Łukasz Augustyniak, Tomasz Kajdanowicz, Przemysław Kazienko

    Abstract: We propose a novel approach to generate aspect hierarchies that proved to be consistently correct compared with human-generated hierarchies. We present an unsupervised technique using Rhetorical Structure Theory and graph analysis. We evaluated our approach based on 100,000 reviews from Amazon and achieved an astonishing 80% coverage compared with human-generated hierarchies coded in ConceptNet. T… ▽ More

    Submitted 4 September, 2019; originally announced September 2019.

    Comments: ACAI 2018 MLNLP

    Journal ref: ACAI 2018 MLNLP

  21. Aspect Detection using Word and Char Embeddings with (Bi)LSTM and CRF

    Authors: Łukasz Augustyniak, Tomasz Kajdanowicz, Przemysław Kazienko

    Abstract: We proposed a~new accurate aspect extraction method that makes use of both word and character-based embeddings. We have conducted experiments of various models of aspect extraction using LSTM and BiLSTM including CRF enhancement on five different pre-trained word embeddings extended with character embeddings. The results revealed that BiLSTM outperforms regular LSTM, but also word embedding covera… ▽ More

    Submitted 3 September, 2019; originally announced September 2019.

    Comments: IEEE AIKE

    Journal ref: 2019 IEEE Second International Conference on Artificial Intelligence and Knowledge Engineering (AIKE), Sardinia, Italy, 2019, pp. 43-50

  22. arXiv:1904.03423  [pdf, other

    stat.ML cs.LG cs.SI

    FILDNE: A Framework for Incremental Learning of Dynamic Networks Embeddings

    Authors: Piotr Bielak, Kamil Tagowski, Maciej Falkiewicz, Tomasz Kajdanowicz, Nitesh V. Chawla

    Abstract: Representation learning on graphs has emerged as a powerful mechanism to automate feature vector generation for downstream machine learning tasks. The advances in representation on graphs have centered on both homogeneous and heterogeneous graphs, where the latter presenting the challenges associated with multi-typed nodes and/or edges. In this paper, we consider the additional challenge of evolvi… ▽ More

    Submitted 19 November, 2020; v1 submitted 6 April, 2019; originally announced April 2019.

  23. arXiv:1812.02956  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    LNEMLC: Label Network Embeddings for Multi-Label Classification

    Authors: Piotr Szymański, Tomasz Kajdanowicz, Nitesh Chawla

    Abstract: Multi-label classification aims to classify instances with discrete non-exclusive labels. Most approaches on multi-label classification focus on effective adaptation or transformation of existing binary and multi-class learning approaches but fail in modelling the joint probability of labels or do not preserve generalization abilities for unseen label combinations. To address these issues we propo… ▽ More

    Submitted 1 January, 2019; v1 submitted 7 December, 2018; originally announced December 2018.

    Comments: submitted to TPAMI

  24. arXiv:1809.00094  [pdf, other

    cs.SI physics.soc-ph

    Graph Energies of Egocentric Networks and Their Correlation with Vertex Centrality Measures

    Authors: Mikołaj Morzy, Tomasz Kajdanowicz

    Abstract: Graph energy is the energy of the matrix representation of the graph, where the energy of a matrix is the sum of singular values of the matrix. Depending on the definition of a matrix, one can contemplate graph energy, Randić energy, Laplacian energy, distance energy, and many others. Although theoretical properties of various graph energies have been investigated in the past in the areas of mathe… ▽ More

    Submitted 12 November, 2018; v1 submitted 31 August, 2018; originally announced September 2018.

    Journal ref: Entropy 2018, 20(12), 916

  25. arXiv:1801.03327  [pdf, other

    cs.SI physics.soc-ph

    Priority Attachment: a Comprehensive Mechanism for Generating Networks

    Authors: Mikołaj Morzy, Tomasz Kajdanowicz, Przemysław Kazienko, Grzegorz Miebs, Arkadiusz Rusin

    Abstract: We claim that networks are created according to the priority attachment mechanism and we show a simple model which uses the priority attachment to generate both synthetic and close to empirical networks. Priority attachment is a mechanism which generalizes previously proposed mechanisms, such as small world creation or preferential attachment, but we also observe its presence in a range of real-wo… ▽ More

    Submitted 20 June, 2018; v1 submitted 10 January, 2018; originally announced January 2018.

    Journal ref: Scientific Reportsvolume 9, Article number: 3383 (2019)

  26. Method for Aspect-Based Sentiment Annotation Using Rhetorical Analysis

    Authors: Łukasz Augustyniak, Krzysztof Rajda, Tomasz Kajdanowicz

    Abstract: This paper fills a gap in aspect-based sentiment analysis and aims to present a new method for preparing and analysing texts concerning opinion and generating user-friendly descriptive reports in natural language. We present a comprehensive set of techniques derived from Rhetorical Structure Theory and sentiment analysis to extract aspects from textual opinions and then build an abstractive summar… ▽ More

    Submitted 13 September, 2017; originally announced September 2017.

    MSC Class: 68T50 ACM Class: I.2.7

  27. arXiv:1707.07913  [pdf, other

    cs.SI cs.CY physics.soc-ph

    Spatio-temporal profiling of public transport delays based on large scale vehicle positioning data from GPS in Wrocław

    Authors: Piotr Szymański, Michał Żołnieruk, Piotr Oleszczyk, Igor Gisterek, Tomasz Kajdanowicz

    Abstract: In recent years many studies of urban mobility based on large data sets have been published: most of them based on crowdsourced GPS data or smart-card data. We present, what is to our knowledge the first, exploration of public transport delay data harvested from a large-scale, official public transport positioning system, provided by the Wrocław Municipality. We evaluate the characteristics of del… ▽ More

    Submitted 25 July, 2017; originally announced July 2017.

    Comments: accepted to KnowME2017

  28. arXiv:1704.08756  [pdf, other

    stat.ML cs.LG stat.ME

    A Network Perspective on Stratification of Multi-Label Data

    Authors: Piotr Szymański, Tomasz Kajdanowicz

    Abstract: In the recent years, we have witnessed the development of multi-label classification methods which utilize the structure of the label space in a divide and conquer approach to improve classification performance and allow large data sets to be classified efficiently. Yet most of the available data sets have been provided in train/test splits that did not account for maintaining a distribution of hi… ▽ More

    Submitted 27 April, 2017; originally announced April 2017.

    Comments: submitted for ECML2017

  29. arXiv:1702.04013  [pdf, ps, other

    cs.LG stat.ML

    Is a Data-Driven Approach still Better than Random Choice with Naive Bayes classifiers?

    Authors: Piotr Szymański, Tomasz Kajdanowicz

    Abstract: We study the performance of data-driven, a priori and random approaches to label space partitioning for multi-label classification with a Gaussian Naive Bayes classifier. Experiments were performed on 12 benchmark data sets and evaluated on 5 established measures of classification quality: micro and macro averaged F1 score, Subset Accuracy and Hamming loss. Data-driven methods are significantly be… ▽ More

    Submitted 13 February, 2017; originally announced February 2017.

  30. arXiv:1702.01460  [pdf, other

    cs.LG cs.MS

    A scikit-based Python environment for performing multi-label classification

    Authors: Piotr Szymański, Tomasz Kajdanowicz

    Abstract: scikit-multilearn is a Python library for performing multi-label classification. The library is compatible with the scikit/scipy ecosystem and uses sparse matrices for all internal operations. It provides native Python implementations of popular multi-label classification methods alongside a novel framework for label space partitioning and division. It includes modern algorithm adaptation methods,… ▽ More

    Submitted 10 December, 2018; v1 submitted 5 February, 2017; originally announced February 2017.

  31. arXiv:1609.07526  [pdf, ps, other

    cs.SI physics.soc-ph

    Balancing Speed and Coverage by Sequential Seeding in Complex Networks

    Authors: Jarosław Jankowski, Piotr Bródka, Przemysław Kazienko, Boleslaw Szymanski, Radosław Michalski, Tomasz Kajdanowicz

    Abstract: Information spreading in complex networks is often modeled as diffusing information with certain probability from nodes that possess it to their neighbors that do not. Information cascades are triggered when the activation of a set of initial nodes (seeds) results in diffusion to large number of nodes. Here, several novel approaches for seed initiation that replace the commonly used activation of… ▽ More

    Submitted 12 January, 2017; v1 submitted 23 September, 2016; originally announced September 2016.

    Journal ref: Scientific Reports 7:891, April 18, 2017

  32. arXiv:1606.03335  [pdf, other

    cs.CL cs.AI cs.DC

    WordNet2Vec: Corpora Agnostic Word Vectorization Method

    Authors: Roman Bartusiak, Łukasz Augustyniak, Tomasz Kajdanowicz, Przemysław Kazienko, Maciej Piasecki

    Abstract: A complex nature of big data resources demands new methods for structuring especially for textual content. WordNet is a good knowledge source for comprehensive abstraction of natural language as its good implementations exist for many languages. Since WordNet embeds natural language in the form of a complex network, a transformation mechanism WordNet2Vec is proposed in the paper. It creates vector… ▽ More

    Submitted 10 June, 2016; originally announced June 2016.

    Comments: 29 pages, 16 figures, submitted to journal

  33. arXiv:1606.02346  [pdf, other

    cs.LG cs.PF cs.SI stat.ML

    How is a data-driven approach better than random choice in label space division for multi-label classification?

    Authors: Piotr Szymański, Tomasz Kajdanowicz, Kristian Kersting

    Abstract: We propose using five data-driven community detection approaches from social networks to partition the label space for the task of multi-label classification as an alternative to random partitioning into equal subsets as performed by RAkELd: modularity-maximizing fastgreedy and leading eigenvector, infomap, walktrap and label propagation algorithms. We construct a label co-occurence graph (both we… ▽ More

    Submitted 7 June, 2016; originally announced June 2016.

  34. arXiv:1510.01270  [pdf, other

    stat.ML cs.LG cs.SI

    Learning in Unlabeled Networks - An Active Learning and Inference Approach

    Authors: Tomasz Kajdanowicz, Radosław Michalski, Katarzyna Musiał, Przemysław Kazienko

    Abstract: The task of determining labels of all network nodes based on the knowledge about network structure and labels of some training subset of nodes is called the within-network classification. It may happen that none of the labels of the nodes is known and additionally there is no information about number of classes to which nodes can be assigned. In such a case a subset of nodes has to be selected for… ▽ More

    Submitted 5 October, 2015; originally announced October 2015.

    Journal ref: AI Communications, Vol. 29, No. 1, 2016, IOS Press

  35. arXiv:1405.0538  [pdf, other

    cs.SI physics.soc-ph

    Seed Selection for Spread of Influence in Social Networks: Temporal vs. Static Approach

    Authors: Radosław Michalski, Tomasz Kajdanowicz, Piotr Bródka, Przemysław Kazienko

    Abstract: The problem of finding optimal set of users for influencing others in the social network has been widely studied. Because it is NP-hard, some heuristics were proposed to find sub-optimal solutions. Still, one of the commonly used assumption is the one that seeds are chosen on the static network, not the dynamic one. This static approach is in fact far from the real-world networks, where new nodes… ▽ More

    Submitted 21 November, 2014; v1 submitted 2 May, 2014; originally announced May 2014.

    Journal ref: New Generation Computing, Vol. 32, Issue 3-4, pp. 213-235, 2014

  36. arXiv:1306.0326  [pdf, other

    cs.DC

    Parallel Processing of Large Graphs

    Authors: Tomasz Kajdanowicz, Przemyslaw Kazienko, Wojciech Indyk

    Abstract: More and more large data collections are gathered worldwide in various IT systems. Many of them possess the networked nature and need to be processed and analysed as graph structures. Due to their size they require very often usage of parallel paradigm for efficient computation. Three parallel techniques have been compared in the paper: MapReduce, its map-side join extension and Bulk Synchronous P… ▽ More

    Submitted 3 June, 2013; originally announced June 2013.

    Comments: Preprint submitted to Future Generation Computer Systems

    MSC Class: 65Y05 ACM Class: D.1.3

  37. arXiv:1304.1877  [pdf

    cs.DB cs.CR

    Privacy-preserving Data Mining, Sharing and Publishing

    Authors: Katarzyna Pasierb, Tomasz Kajdanowicz, Przemyslaw Kazienko

    Abstract: The goal of the paper is to present different approaches to privacy-preserving data sharing and publishing in the context of e-health care systems. In particular, the literature review on technical issues in privacy assurance and current real-life high complexity implementation of medical system that assumes proper data sharing mechanisms are presented in the paper.

    Submitted 6 April, 2013; originally announced April 2013.

    Journal ref: Journal of Medical Informatics & Technologies, Vol. 18, pp. 69-76, 2011

  38. arXiv:1303.0284  [pdf

    cs.SI cs.IR physics.soc-ph

    Social Recommendations within the Multimedia Sharing Systems

    Authors: Katarzyna Musial, Przemyslaw Kazienkol, Tomasz Kajdanowicz

    Abstract: The social recommender system that supports the creation of new relations between users in the multimedia sharing system is presented in the paper. To generate suggestions the new concept of the multirelational social network was introduced. It covers both direct as well as object-based relationships that reflect social and semantic links between users. The main goal of the new method is to create… ▽ More

    Submitted 1 March, 2013; originally announced March 2013.

    Comments: recommender system, multirelational social network, multimedia sharing system, social network analysis, Best Paper Award. arXiv admin note: text overlap with arXiv:1303.0093

    MSC Class: 91D30 ACM Class: H.3.4

    Journal ref: Musial K., Kazienko P., Kajdanowicz T.: Social Recommendations within the Multimedia Sharing Systems. The First World Summit on the Knowledge Society, WSKS'08, Lecture Notes in Computer Science LNCS 5288, 2008, pp. 364-372

  39. Label-dependent Feature Extraction in Social Networks for Node Classification

    Authors: Tomasz Kajdanowicz, Przemyslaw Kazienko, Piotr Doskocz

    Abstract: A new method of feature extraction in the social network for within-network classification is proposed in the paper. The method provides new features calculated by combination of both: network structure information and class labels assigned to nodes. The influence of various features on classification performance has also been studied. The experiments on real-world data have shown that features cr… ▽ More

    Submitted 1 March, 2013; originally announced March 2013.

    Comments: feature extraction, label-dependent features, classification, social network analysis, AMD social network

    MSC Class: 91D30; 68T05; 68T10 ACM Class: I.2.8; I.2.11

    Journal ref: Kajdanowicz T., Kazienko P., Doskocz P.: Label-dependent Feature Extraction in Social Networks for Node Classification. Lecture Notes in Artificial Intelligence LNAI 6430, Springer, 2010, pp. 89-102

  40. arXiv:1303.0093  [pdf

    cs.SI cs.IR physics.soc-ph

    Multidimensional Social Network in the Social Recommender System

    Authors: Przemyslaw Kazienko, Katarzyna Musial, Tomasz Kajdanowicz

    Abstract: All online sharing systems gather data that reflects users' collective behaviour and their shared activities. This data can be used to extract different kinds of relationships, which can be grouped into layers, and which are basic components of the multidimensional social network proposed in the paper. The layers are created on the basis of two types of relations between humans, i.e. direct and ob… ▽ More

    Submitted 1 March, 2013; originally announced March 2013.

    Comments: social recommender system;Multidimensional social network (MSN);Web 2.0;multi-layered social network;multimedia sharing system (MSS);recommender system;social network analysis

    MSC Class: 91D30 ACM Class: H.3.4; H.3.5

    Journal ref: Kazienko, P.; Musial, K.; Kajdanowicz, T.; , "Multidimensional Social Network in the Social Recommender System," Systems, Man and Cybernetics, Part A: Systems and Humans, IEEE Transactions on , vol.41, no.4, pp.746-759, July 2011