Search | arXiv e-print repository

FullBrain: a Social E-learning Platform

Authors: Mirko Biasini, Vittorio Carmignani, Nicola Ferro, Panagiotis Filianos, Maria Maistro, Giorgio Maria di Nunzio

Abstract: We present FullBrain, a social e-learning platform where students share and track their knowledge. FullBrain users can post notes, ask questions and share learning resources in dedicated course and concept spaces. We detail two components of FullBrain: a SIR system equipped with query autocomplete and query autosuggestion, and a Leaderboard module to improve user experience. We analyzed the day-to… ▽ More We present FullBrain, a social e-learning platform where students share and track their knowledge. FullBrain users can post notes, ask questions and share learning resources in dedicated course and concept spaces. We detail two components of FullBrain: a SIR system equipped with query autocomplete and query autosuggestion, and a Leaderboard module to improve user experience. We analyzed the day-to-day users' usage of the SIR system, measuring a time-to-complete a request below 0.11s, matching or exceeding our UX targets. Moreover, we performed stress tests which lead the way for more detailed analysis. Through a preliminary user study and log data analysis, we observe that 97% of the users' activity is directed to the top 4 positions in the leaderboard. △ Less

Submitted 1 December, 2022; originally announced December 2022.

arXiv:2112.06562 [pdf, other]

One Size Fits All: A Conceptual Data Model for Any Approach to Terminology

Authors: Giorgio Maria Di Nunzio, Federica Vezzani

Abstract: In this paper, we want to speculate about the possibility to model all the currently known/proposed approaches to terminology into a single schema. We will use the Entity-Relationship (ER) diagram as our tool for the conceptual data model of the problem and to express the associations between the objects of the study. We will analyse the onomasiological and semasiological approaches, the ontotermi… ▽ More In this paper, we want to speculate about the possibility to model all the currently known/proposed approaches to terminology into a single schema. We will use the Entity-Relationship (ER) diagram as our tool for the conceptual data model of the problem and to express the associations between the objects of the study. We will analyse the onomasiological and semasiological approaches, the ontoterminology paradigm, and the frame-based model, and we will draw the consequences in terms of the conceptual data model. The result of this discussion will be used as the basis of the next step of the data organization in terms of standardized terminological records and Linked Data. △ Less

Submitted 13 December, 2021; originally announced December 2021.

arXiv:2110.15683 [pdf, other]

Incentives for Item Duplication under Fair Ranking Policies

Authors: Giorgio Maria Di Nunzio, Alessandro Fabris, Gianmaria Silvello, Gian Antonio Susto

Abstract: Ranking is a fundamental operation in information access systems, to filter information and direct user attention towards items deemed most relevant to them. Due to position bias, items of similar relevance may receive significantly different exposure, raising fairness concerns for item providers and motivating recent research into fair ranking. While the area has progressed dramatically over rece… ▽ More Ranking is a fundamental operation in information access systems, to filter information and direct user attention towards items deemed most relevant to them. Due to position bias, items of similar relevance may receive significantly different exposure, raising fairness concerns for item providers and motivating recent research into fair ranking. While the area has progressed dramatically over recent years, no study to date has investigated the potential problem posed by duplicated items. Duplicates and near-duplicates are common in several domains, including marketplaces and document collections available to search engines. In this work, we study the behaviour of different fair ranking policies in the presence of duplicates, quantifying the extra-exposure gained by redundant items. We find that fairness-aware ranking policies may conflict with diversity, due to their potential to incentivize duplication more than policies solely focused on relevance. This fact poses a problem for system owners who, as a result of this incentive, may have to deal with increased redundancy, which is at odds with user satisfaction. Finally, we argue that this aspect represents a blind spot in the normative reasoning underlying common fair ranking metrics, as rewarding providers who duplicate their items with increased exposure seems unfair for the remaining providers. △ Less

Submitted 29 October, 2021; originally announced October 2021.

arXiv:1912.08582 [pdf]

Towards an automatic recognition of mixed languages: The Ukrainian-Russian hybrid language Surzhyk

Authors: Nataliya Sira, Giorgio Maria Di Nunzio, Viviana Nosilia

Abstract: Language interference is common in today's multilingual societies where more languages are being in contact and as a global final result leads to the creation of hybrid languages. These, together with doubts on their right to be officially recognised made emerge in the area of computational linguistics the problem of their automatic identification and further elaboration. In this paper, we propose… ▽ More Language interference is common in today's multilingual societies where more languages are being in contact and as a global final result leads to the creation of hybrid languages. These, together with doubts on their right to be officially recognised made emerge in the area of computational linguistics the problem of their automatic identification and further elaboration. In this paper, we propose a first attempt to identify the elements of a Ukrainian-Russian hybrid language, Surzhyk, through the adoption of the example-based rules created with the instruments of programming language R. Our example-based study consists of: 1) analysis of spoken samples of Surzhyk registered by Del Gaudio (2010) in Kyiv area and creation of the written corpus; 2) production of specific rules on the identification of Surzhyk patterns and their implementation; 3) testing the code and analysing the effectiveness. △ Less

Submitted 18 December, 2019; originally announced December 2019.

arXiv:1905.01257 [pdf, other]

A Relation Extraction Approach for Clinical Decision Support

Authors: Maristella Agosti, Giorgio Maria Di Nunzio, Stefano Marchesin, Gianmaria Silvello

Abstract: In this paper, we investigate how semantic relations between concepts extracted from medical documents can be employed to improve the retrieval of medical literature. Semantic relations explicitly represent relatedness between concepts and carry high informative power that can be leveraged to improve the effectiveness of retrieval functionalities of clinical decision support systems. We present pr… ▽ More In this paper, we investigate how semantic relations between concepts extracted from medical documents can be employed to improve the retrieval of medical literature. Semantic relations explicitly represent relatedness between concepts and carry high informative power that can be leveraged to improve the effectiveness of retrieval functionalities of clinical decision support systems. We present preliminary results and show how relations are able to provide a sizable increase of the precision for several topics, albeit having no impact on others. We then discuss some future directions to minimize the impact of negative results while maximizing the impact of good results. △ Less

Submitted 3 May, 2019; originally announced May 2019.

Comments: 4 pages, 1 figure, DTMBio-KMH 2018, in conjunction with ACM 27th Conference on Information and Knowledge Management (CIKM), October 22-26 2018, Lingotto, Turin, Italy

ACM Class: H.3.1; H.3.3

arXiv:1605.04144 [pdf, other]

Estimating the number of receiving nodes in 802.11 networks via machine learning techniques

Authors: Davide Del Desta, Matteo Danieletto, Giorgio Maria Di Nunzio, Michele Zorzi

Abstract: Nowadays, most mobile devices are equipped with multiple wireless interfaces, causing an emerging research interest in device to device (D2D) communication: the idea behind the D2D paradigm is to exploit the proper interface to directly communicate with another user, without traversing any network infrastructure. A first issue related to this paradigm consists in the need for a coordinator, called… ▽ More Nowadays, most mobile devices are equipped with multiple wireless interfaces, causing an emerging research interest in device to device (D2D) communication: the idea behind the D2D paradigm is to exploit the proper interface to directly communicate with another user, without traversing any network infrastructure. A first issue related to this paradigm consists in the need for a coordinator, called controller, able to decide when activating a D2D connection is appropriate and eventually able to manage such connection. In this view, the paradigm of Software Defined Networking (SDN), can be exploited both to handle the data flows among the devices and to interact directly with every device. This work is focused on a scenario where a device is selected by the SDN controller, in order to become the master node of a WiFi-Direct network. The remaining nodes, called clients, can exchange data with other nodes through the master. The objective is to infer, through different Machine Learning approaches, the number of nodes actively involved in receiving data, exploiting only the information available at the client side and without modifying any standard communication protocol. The information about the number of client nodes is crucial when, e.g., a user desires a precise prediction of the transmission estimated time of arrival (ETA) while downloading a file. △ Less

Submitted 13 May, 2016; originally announced May 2016.

Showing 1–6 of 6 results for author: Di Nunzio, G M