Skip to main content

Showing 1–22 of 22 results for author: Paprzycki, M

.
  1. arXiv:2407.02122  [pdf, ps, other

    cs.CL

    Fake News Detection: It's All in the Data!

    Authors: Soveatin Kuntur, Anna Wróblewska, Marcin Paprzycki, Maria Ganzha

    Abstract: This comprehensive survey serves as an indispensable resource for researchers embarking on the journey of fake news detection. By highlighting the pivotal role of dataset quality and diversity, it underscores the significance of these elements in the effectiveness and robustness of detection models. The survey meticulously outlines the key features of datasets, various labeling systems employed, a… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2406.14266  [pdf, other

    cs.AI cs.HC

    Intelligent Interface: Enhancing Lecture Engagement with Didactic Activity Summaries

    Authors: Anna Wróblewska, Marcel Witas, Kinga Frańczak, Arkadiusz Kniaź, Siew Ann Cheong, Tan Seng Chee, Janusz Hołyst, Marcin Paprzycki

    Abstract: Recently, multiple applications of machine learning have been introduced. They include various possibilities arising when image analysis methods are applied to, broadly understood, video streams. In this context, a novel tool, developed for academic educators to enhance the teaching process by automating, summarizing, and offering prompt feedback on conducting lectures, has been developed. The imp… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 9 pages, 6 figures

  3. arXiv:2311.14540  [pdf, other

    cs.DB cs.AI

    RDF Stream Taxonomy: Systematizing RDF Stream Types in Research and Practice

    Authors: Piotr Sowinski, Pawel Szmeja, Maria Ganzha, Marcin Paprzycki

    Abstract: Over the years, RDF streaming was explored in research and practice from many angles, resulting in a wide range of RDF stream definitions. This variety presents a major challenge in discussing and integrating streaming systems, due to the lack of a common language. This work attempts to address this critical research gap, by systematizing RDF stream types present in the literature in a novel taxon… ▽ More

    Submitted 27 June, 2024; v1 submitted 24 November, 2023; originally announced November 2023.

  4. arXiv:2305.06226  [pdf, other

    cs.DB

    RiverBench: an Open RDF Streaming Benchmark Suite

    Authors: Piotr Sowinski, Maria Ganzha, Marcin Paprzycki

    Abstract: RDF streaming has been explored by the Semantic Web community from many angles, resulting in multiple task formulations and streaming methods. However, for many existing formulations of the problem, reliably benchmarking streaming solutions has been challenging due to the lack of well-described and appropriately diverse benchmark datasets. Existing datasets and evaluations, except a few notable ca… ▽ More

    Submitted 27 November, 2023; v1 submitted 10 May, 2023; originally announced May 2023.

    Comments: RiverBench is available online here: https://w3id.org/riverbench

  5. Application of genetic algorithm to load balancing in networks with a homogeneous traffic flow

    Authors: Marek Bolanowski, Alicja Gerka, Andrzej Paszkiewicz, Maria Ganzha, Marcin Paprzycki

    Abstract: The concept of extended cloud requires efficient network infrastructure to support ecosystems reaching form the edge to the cloud(s). Standard approaches to network load balancing deliver static solutions that are insufficient for the extended clouds, where network loads change often. To address this issue, a genetic algorithm based load optimizer is proposed and implemented. Next, its performance… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: Accepted for the conference -- The International Conference on Computational Science ICCS2023

    ACM Class: C.2.0

  6. Towards Edge-Cloud Architectures for Personal Protective Equipment Detection

    Authors: Jaroslaw Legierski, Kajetan Rachwal, Piotr Sowinski, Wojciech Niewolski, Przemyslaw Ratuszek, Zbigniew Kopertowski, Marcin Paprzycki, Maria Ganzha

    Abstract: Detecting Personal Protective Equipment in images and video streams is a relevant problem in ensuring the safety of construction workers. In this contribution, an architecture enabling live image recognition of such equipment is proposed. The solution is deployable in two settings -- edge-cloud and edge-only. The system was tested on an active construction site, as a part of a larger scenario, wit… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.

    Comments: Presented on the 4th International Conference on Information Management and Machine Intelligence (ICIMMI 2022). In print

    Journal ref: ICIMMI 2022: Proceedings of the 4th International Conference on Information Management & Machine Intelligence

  7. arXiv:2208.00682  [pdf

    cs.PF cs.DC cs.NI

    Eficiency of REST and gRPC realizing communication tasks in microservice-based ecosystems

    Authors: Marek Bolanowski, Kamil Żak, Andrzej Paszkiewicz, Maria Ganzha, Marcin Paprzycki, Piotr Sowiński, Ignacio Lacalle, Carlos E. Palau

    Abstract: The aim of this contribution is to analyse practical aspects of the use of REST APIs and gRPC to realize communication tasks in applications in microservice-based ecosystems. On the basis of performed experiments, classes of communication tasks, for which given technology performs data transfer more efficiently, have been established. This, in turn, allows formulation of criteria for the selection… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

    Comments: Accepted for the conference -- The 21st International Conference on Intelligent Software Methodologies, Tools, and Techniques (SOMET 2022)

    ACM Class: C.2.4; C.4; D.4.4

  8. arXiv:2207.07700  [pdf, other

    cs.LG

    Introducing Federated Learning into Internet of Things ecosystems -- preliminary considerations

    Authors: Karolina Bogacka, Katarzyna Wasielewska-Michniewska, Marcin Paprzycki, Maria Ganzha, Anastasiya Danilenka, Lambis Tassakos, Eduardo Garro

    Abstract: Federated learning (FL) was proposed to facilitate the training of models in a distributed environment. It supports the protection of (local) data privacy and uses local resources for model training. Until now, the majority of research has been devoted to "core issues", such as adaptation of machine learning algorithms to FL, data privacy protection, or dealing with the effects of uneven data dist… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

    Comments: Conference IEEE 8th World Forum on Internet of Things submission

  9. Efficient RDF Streaming for the Edge-Cloud Continuum

    Authors: Piotr Sowinski, Katarzyna Wasielewska-Michniewska, Maria Ganzha, Wieslaw Pawlowski, Pawel Szmeja, Marcin Paprzycki

    Abstract: With the ongoing, gradual shift of large-scale distributed systems towards the edge-cloud continuum, the need arises for software solutions that are universal, scalable, practical, and grounded in well-established technologies. Simultaneously, semantic technologies, especially in the streaming context, are becoming increasingly important for enabling interoperability in edge-cloud systems. However… ▽ More

    Submitted 10 July, 2022; originally announced July 2022.

    Comments: In review -- submitted to the IEEE 8th World Forum on Internet of Things

    Journal ref: 2022 IEEE 8th World Forum on Internet of Things (WF-IoT)

  10. arXiv:2207.04103  [pdf, other

    cs.LG cs.CV

    StatMix: Data augmentation method that relies on image statistics in federated learning

    Authors: Dominik Lewy, Jacek Mańdziuk, Maria Ganzha, Marcin Paprzycki

    Abstract: Availability of large amount of annotated data is one of the pillars of deep learning success. Although numerous big datasets have been made available for research, this is often not the case in real life applications (e.g. companies are not able to share data due to GDPR or concerns related to intellectual property rights protection). Federated learning (FL) is a potential solution to this proble… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

  11. arXiv:2206.08124  [pdf, other

    cs.LG

    Using adversarial images to improve outcomes of federated learning for non-IID data

    Authors: Anastasiya Danilenka, Maria Ganzha, Marcin Paprzycki, Jacek Mańdziuk

    Abstract: One of the important problems in federated learning is how to deal with unbalanced data. This contribution introduces a novel technique designed to deal with label skewed non-IID data, using adversarial inputs, created by the I-FGSM method. Adversarial inputs guide the training process and allow the Weighted Federated Averaging to give more importance to clients with 'selected' local label distrib… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

  12. arXiv:2205.02892  [pdf, ps, other

    cs.SE cs.AI cs.CL

    Ontology Reuse: the Real Test of Ontological Design

    Authors: Piotr Sowinski, Katarzyna Wasielewska-Michniewska, Maria Ganzha, Marcin Paprzycki, Costin Badica

    Abstract: Reusing ontologies in practice is still very challenging, especially when multiple ontologies are (jointly) involved. Moreover, despite recent advances, the realization of systematic ontology quality assurance remains a difficult problem. In this work, the quality of thirty biomedical ontologies, and the Computer Science Ontology are investigated, from the perspective of a practical use case. Spec… ▽ More

    Submitted 6 July, 2022; v1 submitted 5 May, 2022; originally announced May 2022.

    Comments: Accepted into SOMET 2022 conference

    Journal ref: Frontiers in Artificial Intelligence and Applications 355 (2022) 631-645

  13. An Energy Aware Clustering Scheme for 5G-enabled Edge Computing based IoMT Framework

    Authors: Jitendra Kumar Samriya, Mohit Kumar, Maria Ganzha, Marcin Paprzycki, Marek Bolanowski, Andrzej Paszkiewicz

    Abstract: In recent years, 5G network systems start to offer communication infrastructure for Internet of Things (IoT) applications, especially for health care service pro-viders. In smart health care systems, edge computing enabled Internet of Medical Things (IoMT) is an innovative technology to provide online health care monitor-ing facility to patients. Here, energy consumption, along with extending the… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    ACM Class: C.2.0

  14. arXiv:2204.04515  [pdf, other

    cs.LG cs.AI cs.IR

    Applying machine learning to predict behavior of bus transport in Warsaw, Poland

    Authors: Łukasz Pałys, Maria Ganzha, Marcin Paprzycki

    Abstract: Nowadays, it is possible to collect precise data describing movements of public transport. Specifically, for each bus (or tram) geoposition data can be regularly collected. This includes data for all buses in Warsaw, Poland. Moreover, this data can be downloaded and analyzed. In this context, one of the simplest questions is: can a model be build to represent behavior of busses, and predict their… ▽ More

    Submitted 9 April, 2022; originally announced April 2022.

    Comments: 18 pages, full version of paper for ICCS conference

  15. arXiv:2203.15158  [pdf, ps, other

    cs.LG cs.AI cs.CV

    Practical Aspects of Zero-Shot Learning

    Authors: Elie Saad, Marcin Paprzycki, Maria Ganzha

    Abstract: One of important areas of machine learning research is zero-shot learning. It is applied when properly labeled training data set is not available. A number of zero-shot algorithms have been proposed and experimented with. However, none of them seems to be the "overall winner". In situations like this, it may be possible to develop a meta-classifier that would combine "best aspects" of individual c… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

  16. Topical Classification of Food Safety Publications with a Knowledge Base

    Authors: Piotr Sowinski, Katarzyna Wasielewska-Michniewska, Maria Ganzha, Marcin Paprzycki

    Abstract: The vast body of scientific publications presents an increasing challenge of finding those that are relevant to a given research question, and making informed decisions on their basis. This becomes extremely difficult without the use of automated tools. Here, one possible area for improvement is automatic classification of publication abstracts according to their topic. This work introduces a nove… ▽ More

    Submitted 4 January, 2022; v1 submitted 2 January, 2022; originally announced January 2022.

  17. Exploring usability of Reddit in data science and knowledge processing

    Authors: Jan Sawicki, Maria Ganzha, Marcin Paprzycki, Amelia Bădică

    Abstract: This contribution argues that Reddit, as a massive, categorized, open-access dataset, is a useful data source, for "almost any topic". Hence, it can be used in data science, e.g. for knowledge exploration. This statement is backed-up with presented analysis, based on 180 manually annotated papers, related to Reddit itself, and data acquired from popular databases of scientific papers. Finally, an… ▽ More

    Submitted 14 April, 2023; v1 submitted 5 October, 2021; originally announced October 2021.

  18. Semantic Access Control for Privacy Management of Personal Sensing in Smart Cities

    Authors: Michał Drozdowicz, Maria Ganzha, Marcin Paprzycki

    Abstract: Personal and home sensors generate valuable information that could be used in Smart Cities. Unfortunately, typically, this data is locked out and used only by application/system developer. While vendors are to blame, one should consider also the "binary nature" of data access. Specifically, either owner has full control over her data (e.g. in a "closed system"), or she completely looses control, w… ▽ More

    Submitted 8 January, 2021; originally announced January 2021.

    Comments: in IEEE Transactions on Emerging Topics in Computing (Early Access), 2020

  19. A Review of Platforms for the Development of Agent Systems

    Authors: Constantin-Valentin Pal, Florin Leon, Marcin Paprzycki, Maria Ganzha

    Abstract: Agent-based computing is an active field of research with the goal of building autonomous software of hardware entities. This task is often facilitated by the use of dedicated, specialized frameworks. For almost thirty years, many such agent platforms have been developed. Meanwhile, some of them have been abandoned, others continue their development and new platforms are released. This paper prese… ▽ More

    Submitted 17 July, 2020; originally announced July 2020.

    Comments: 40 pages, 2 figures, 9 tables, 83 references

    MSC Class: 68T42

    Journal ref: Information, vol. 14, no. 6, article number 348, 47 pag., 2023

  20. arXiv:1111.4545  [pdf

    cs.CR

    Grid Security and Integration with Minimal Performance Degradation

    Authors: Sugata Sanyal, Rangarajan A. Vasudevan, Ajith Abraham, Marcin Paprzycki

    Abstract: Computational grids are believed to be the ultimate framework to meet the growing computational needs of the scientific community. Here, the processing power of geographically distributed resources working under different ownerships, having their own access policy, cost structure and the likes, is logically coupled to make them perform as a unified resource. The continuous increase of availability… ▽ More

    Submitted 19 November, 2011; originally announced November 2011.

    Comments: 8 Pages, 1 Figure

    Journal ref: Journal of Digital Information Management, Vol. 2, Issue 2, September, 2004

  21. arXiv:cs/0405050  [pdf

    cs.AI

    Traffic Accident Analysis Using Decision Trees and Neural Networks

    Authors: Miao M. Chong, Ajith Abraham, Marcin Paprzycki

    Abstract: The costs of fatalities and injuries due to traffic accident have a great impact on society. This paper presents our research to model the severity of injury resulting from traffic accidents using artificial neural networks and decision trees. We have applied them to an actual data set obtained from the National Automotive Sampling System (NASS) General Estimates System (GES). Experiment results… ▽ More

    Submitted 15 May, 2004; originally announced May 2004.

    ACM Class: I.2.0

    Journal ref: IADIS International Conference on Applied Computing, Portugal, IADIS Press, Pedro Isaias et al. (Eds.), ISBN: 9729894736, Volume 2, pp. 39-42, 2004

  22. arXiv:cs/0405017  [pdf

    cs.AI

    Data Mining Approach for Analyzing Call Center Performance

    Authors: Marcin Paprzycki, Ajith Abraham, Ruiyuan Guo

    Abstract: The aim of our research was to apply well-known data mining techniques (such as linear neural networks, multi-layered perceptrons, probabilistic neural networks, classification and regression trees, support vector machines and finally a hybrid decision tree neural network approach) to the problem of predicting the quality of service in call centers; based on the performance data actually collect… ▽ More

    Submitted 4 May, 2004; originally announced May 2004.

    ACM Class: I.2.0

    Journal ref: The 17th International Conference on Industrial & Engineering Applications of Artificial Intelligence and Expert Systems, Canada, Springer Verlag, Germany, 2004 (forth coming)