Skip to main content

Showing 1–50 of 60 results for author: Giunchiglia, F

Searching in archive cs. Search in all archives.
.
  1. Artificial Intelligence in Everyday Life 2.0: Educating University Students from Different Majors

    Authors: Maria Kasinidou, Styliani Kleanthous, Matteo Busso, Marcelo Rodas, Jahna Otterbacher, Fausto Giunchiglia

    Abstract: With the surge in data-centric AI and its increasing capabilities, AI applications have become a part of our everyday lives. However, misunderstandings regarding their capabilities, limitations, and associated advantages and disadvantages are widespread. Consequently, in the university setting, there is a crucial need to educate not only computer science majors but also students from various disci… ▽ More

    Submitted 12 April, 2024; originally announced June 2024.

    Comments: 7 pages, ITiCSE conference

  2. arXiv:2405.12434  [pdf, other

    cs.CL

    Resolving Word Vagueness with Scenario-guided Adapter for Natural Language Inference

    Authors: Yonghao Liu, Mengyu Li, Di Liang, Ximing Li, Fausto Giunchiglia, Lan Huang, Xiaoyue Feng, Renchu Guan

    Abstract: Natural Language Inference (NLI) is a crucial task in natural language processing that involves determining the relationship between two sentences, typically referred to as the premise and the hypothesis. However, traditional NLI models solely rely on the semantic information inherent in independent sentences and lack relevant situational visual information, which can hinder a complete understandi… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: IJCAI24

  3. arXiv:2405.11524  [pdf, other

    cs.CL

    Simple-Sampling and Hard-Mixup with Prototypes to Rebalance Contrastive Learning for Text Classification

    Authors: Mengyu Li, Yonghao Liu, Fausto Giunchiglia, Xiaoyue Feng, Renchu Guan

    Abstract: Text classification is a crucial and fundamental task in natural language processing. Compared with the previous learning paradigm of pre-training and fine-tuning by cross entropy loss, the recently proposed supervised contrastive learning approach has received tremendous attention due to its powerful feature learning capability and robustness. Although several studies have incorporated this techn… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: 12 pages, 9 figures

  4. arXiv:2405.04054  [pdf, other

    cs.HC

    What Impacts the Quality of the User Answers when Asked about the Current Context?

    Authors: Ivano Bison, Haonan Zhao, Fausto Giunchiglia

    Abstract: Sensor data provide an objective view of reality but fail to capture the subjective motivations behind an individual's behavior. This latter information is crucial for learning about the various dimensions of the personal context, thus increasing predictability. The main limitation is the human input, which is often not of the quality that is needed. The work so far has focused on the usually high… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 25 pages, 16 figures, under review by Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies

  5. arXiv:2405.01783  [pdf

    cs.CL

    Layers of technology in pluriversal design. Decolonising language technology with the LiveLanguage initiative

    Authors: Gertraud Koch, Gábor Bella, Paula Helm, Fausto Giunchiglia

    Abstract: Language technology has the potential to facilitate intercultural communication through meaningful translations. However, the current state of language technology is deeply entangled with colonial knowledge due to path dependencies and neo-colonial tendencies in the global governance of artificial intelligence (AI). Language technology is a complex and emerging field that presents challenges for c… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  6. arXiv:2404.17602  [pdf, other

    cs.HC

    A Methodology and System For Big-Thick Data Collection

    Authors: Ivan Kayongo, Haonan Zhao, Leonardo Malcotti, Fausto Giunchiglia

    Abstract: Pervasive sensors have become essential in research for gathering real-world data. However, current studies often focus solely on objective data, neglecting subjective human contributions. We introduce an approach and system for collecting big-thick data, combining extensive sensor data (big data) with qualitative human feedback (thick data). This fusion enables effective collaboration between hum… ▽ More

    Submitted 1 July, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: 8 pages, 3 figures, accepted by Aduous workshop

  7. arXiv:2403.20215  [pdf, other

    cs.CL

    Advancing the Arabic WordNet: Elevating Content Quality

    Authors: Abed Alhakim Freihat, Hadi Khalilia, Gábor Bella, Fausto Giunchiglia

    Abstract: High-quality WordNets are crucial for achieving high-quality results in NLP applications that rely on such resources. However, the wordnets of most languages suffer from serious issues of correctness and completeness with respect to the words and word meanings they define, such as incorrect lemmas, missing glosses and example sentences, or an inadequate, Western-centric representation of the morph… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  8. arXiv:2401.11753  [pdf

    cs.AI cs.DL

    From Knowledge Organization to Knowledge Representation and Back

    Authors: Fausto Giunchiglia, Mayukh Bagchi, Subhashis Das

    Abstract: Knowledge Organization (KO) and Knowledge Representation (KR) have been the two mainstream methodologies of knowledge modelling in the Information Science community and the Artificial Intelligence community, respectively. The facet-analytical tradition of KO has developed an exhaustive set of guiding canons for ensuring quality in organising and managing knowledge but has remained limited in terms… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: Accepted @ Annals of Library and Information Studies (ALIS) Journal - Ranganathan Commemorative Issue (2024)

    Report number: DISI22012024

  9. arXiv:2312.17263  [pdf, other

    cs.CL

    TACIT: A Target-Agnostic Feature Disentanglement Framework for Cross-Domain Text Classification

    Authors: Rui Song, Fausto Giunchiglia, Yingji Li, Mingjie Tian, Hao Xu

    Abstract: Cross-domain text classification aims to transfer models from label-rich source domains to label-poor target domains, giving it a wide range of practical applications. Many approaches promote cross-domain generalization by capturing domain-invariant features. However, these methods rely on unlabeled samples provided by the target domains, which renders the model ineffective when the target domain… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI-2024

  10. arXiv:2312.07302  [pdf, other

    cs.AI cs.DL

    From Knowledge Representation to Knowledge Organization and Back

    Authors: Fausto Giunchiglia, Mayukh Bagchi

    Abstract: Knowledge Representation (KR) and facet-analytical Knowledge Organization (KO) have been the two most prominent methodologies of data and knowledge modelling in the Artificial Intelligence community and the Information Science community, respectively. KR boasts of a robust and scalable ecosystem of technologies to support knowledge modelling while, often, underemphasizing the quality of its models… ▽ More

    Submitted 8 January, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: International Conference on Information (iConference) 2024 - Wisdom, Well-being, Win-win - Springer LNCS, Springer Cham Switzerland

    Report number: KNOWDIVEDISI012024

  11. arXiv:2311.12465  [pdf, other

    cs.AI

    Towards a Gateway for Knowledge Graph Schemas Collection, Analysis, and Embedding

    Authors: Mattia Fumagalli, Marco Boffo, Daqian Shi, Mayukh Bagchi, Fausto Giunchiglia

    Abstract: One of the significant barriers to the training of statistical models on knowledge graphs is the difficulty that scientists have in finding the best input data to address their prediction goal. In addition to this, a key challenge is to determine how to manipulate these relational data, which are often in the form of particular triples (i.e., subject, predicate, object), to enable the learning pro… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: Ontology Showcase and Demonstrations Track, 9th Joint Ontology Workshops (JOWO 2023), Co-located with FOIS 2023, 19-20 July, 2023, Sherbrooke, Québec, Canada. arXiv admin note: substantial text overlap with arXiv:2207.06112

    Report number: DISIKNOWDIVE21112023

  12. arXiv:2309.10681  [pdf

    cs.CY

    Social Interactions Mediated by the Internet and the Big- Five: a Cross-Country Analysis

    Authors: Andrea Mercado, Alethia Hume, Ivanno Bison, Fausto Giunchiglia, Amarsanaa Ganbold, Luca Cernuzzi

    Abstract: This study analyzes the possible relationship between personality traits, in terms of Big Five (extraversion, agreeableness, responsibility, emotional stability and openness to experience), and social interactions mediated by digital platforms in different socioeconomic and cultural contexts. We considered data from a questionnaire and the experience of using a chatbot, as a mean of requesting and… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: 5 pages

    MSC Class: 68U99 ACM Class: J.4

  13. arXiv:2308.13056  [pdf, other

    cs.CL

    Lexical Diversity in Kinship Across Languages and Dialects

    Authors: Hadi Khalilia, Gábor Bella, Abed Alhakim Freihat, Shandy Darma, Fausto Giunchiglia

    Abstract: Languages are known to describe the world in diverse ways. Across lexicons, diversity is pervasive, appearing through phenomena such as lexical gaps and untranslatability. However, in computational resources, such as multilingual lexical databases, diversity is hardly ever represented. In this paper, we introduce a method to enrich computational lexicons with content relating to linguistic diversi… ▽ More

    Submitted 26 October, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

  14. arXiv:2307.14119  [pdf, other

    cs.CV cs.AI cs.MM

    A semantics-driven methodology for high-quality image annotation

    Authors: Fausto Giunchiglia, Mayukh Bagchi, Xiaolei Diao

    Abstract: Recent work in Machine Learning and Computer Vision has highlighted the presence of various types of systematic flaws inside ground truth object recognition benchmark datasets. Our basic tenet is that these flaws are rooted in the many-to-many map**s which exist between the visual information encoded in images and the intended semantics of the labels annotating them. The net consequence is that… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: Accepted @ 26th European Conference on Artificial Intelligence (ECAI) 2023, Kraków, Poland

    Report number: KDECAI23

  15. arXiv:2307.13714  [pdf, other

    cs.CY cs.CL

    Diversity and Language Technology: How Techno-Linguistic Bias Can Cause Epistemic Injustice

    Authors: Paula Helm, Gábor Bella, Gertraud Koch, Fausto Giunchiglia

    Abstract: It is well known that AI-based language technology -- large language models, machine translation systems, multilingual dictionaries, and corpora -- is currently limited to 2 to 3 percent of the world's most widely spoken and/or financially and politically best supported languages. In response, recent research efforts have sought to extend the reach of AI technology to ``underserved languages.'' In… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    Comments: arXiv admin note: text overlap with arXiv:2307.13405

  16. arXiv:2307.13405  [pdf, other

    cs.CL cs.AI

    Towards Bridging the Digital Language Divide

    Authors: Gábor Bella, Paula Helm, Gertraud Koch, Fausto Giunchiglia

    Abstract: It is a well-known fact that current AI-based language technology -- language models, machine translation systems, multilingual dictionaries and corpora -- focuses on the world's 2-3% most widely spoken languages. Recent research efforts have attempted to expand the coverage of AI technology to `under-resourced languages.' The goal of our paper is to bring attention to a phenomenon that we call li… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    ACM Class: I.2.7; K.4.2

  17. arXiv:2307.01214  [pdf, other

    cs.CL cs.AI

    Automatic Counterfactual Augmentation for Robust Text Classification Based on Word-Group Search

    Authors: Rui Song, Fausto Giunchiglia, Yingji Li, Hao Xu

    Abstract: Despite large-scale pre-trained language models have achieved striking results for text classificaion, recent work has raised concerns about the challenge of shortcut learning. In general, a keyword is regarded as a shortcut if it creates a superficial association with the label, resulting in a false prediction. Conversely, shortcut learning can be mitigated if the model relies on robust causal fe… ▽ More

    Submitted 30 June, 2023; originally announced July 2023.

    Comments: 13 pages, 7 figures

  18. arXiv:2306.08561  [pdf, other

    physics.soc-ph cs.CY

    Adaptation of Student Behavioural Routines during COVID-19: A Multimodal Approach

    Authors: Nicolò A. Girardini, Simone Centellegher, Andrea Passerini, Ivano Bison, Fausto Giunchiglia, Bruno Lepri

    Abstract: One population group that had to significantly adapt and change their behaviour during the COVID-19 pandemic is students. While previous studies have extensively investigated the impact of the pandemic on their psychological well-being and academic performance, limited attention has been given to their activity routines. In this work, we analyze students' behavioural changes by examining qualitati… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

  19. arXiv:2306.05884  [pdf

    cs.CY

    Social interactions mediated by the Internet and the Big-Five: a cross-country analysis

    Authors: Andrea Mercado, Alethia Hume, Ivano Bison, Fausto Giunchiglia, Amarsanaa Ganbold, Luca Cernuzzi

    Abstract: This study analyzes the possible relationship between personality traits, in terms of Big Five (extraversion, agreeableness, responsibility, emotional stability and openness to experience), and social interactions mediated by digital platforms in different socioeconomic and cultural contexts. We considered data from a questionnaire and the experience of using a chatbot, as a mean of requesting and… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: Diversity-aware Hybrid Human-Artificial Intelligence (DHHAI)2023 workshop, 5 pages

  20. arXiv:2305.06088  [pdf, other

    cs.AI cs.DB

    Building Interoperable Electronic Health Records as Purpose-Driven Knowledge Graphs

    Authors: Simone Bocca, Alessio Zamboni, Gabor Bella, Yamini Chandrashekar, Mayukh Bagchi, Gabriel Kuper, Paolo Bouquet, Fausto Giunchiglia

    Abstract: When building a new application we are increasingly confronted with the need of reusing and integrating pre-existing knowledge. Nevertheless, it is a fact that this prior knowledge is virtually impossible to reuse as-is. This is true also in domains, e.g., eHealth, where a lot of effort has been put into develo** high-quality standards and reference ontologies, e.g. FHIR1. In this paper, we prop… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: DSAI SPRINGER BOOK. arXiv admin note: text overlap with arXiv:2105.09418

    Report number: DISIDSAI2023

    Journal ref: DSAI SPRINGER BOOK, 2023

  21. arXiv:2305.05422  [pdf, other

    cs.AI cs.CV

    Egocentric Hierarchical Visual Semantics

    Authors: Luca Erculiani, Andrea Bontempelli, Andrea Passerini, Fausto Giunchiglia

    Abstract: We are interested in aligning how people think about objects and what machines perceive, meaning by this the fact that object recognition, as performed by a machine, should follow a process which resembles that followed by humans when thinking of an object associated with a certain concept. The ultimate goal is to build systems which can meaningfully interact with their users, describing what they… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: 10 pages, 5 figures, Accepted for publication at The second International Conference on Hybrid Human-Artificial Intelligence (HHAI2023)

  22. arXiv:2304.08989  [pdf, other

    cs.CV

    Incremental Image Labeling via Iterative Refinement

    Authors: Fausto Giunchiglia, Xiaolei Diao, Mayukh Bagchi

    Abstract: Data quality is critical for multimedia tasks, while various types of systematic flaws are found in image benchmark datasets, as discussed in recent work. In particular, the existence of the semantic gap problem leads to a many-to-many map** between the information extracted from an image and its linguistic description. This unavoidable bias further leads to poor performance on current computer… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Journal ref: IWCIM@ICASSP 2023

  23. arXiv:2304.07910  [pdf, other

    cs.AI

    Recognizing Entity Types via Properties

    Authors: Daqian Shi, Fausto Giunchiglia

    Abstract: The mainstream approach to the development of ontologies is merging ontologies encoding different information, where one of the major difficulties is that the heterogeneity motivates the ontology merging but also limits high-quality merging performance. Thus, the entity type (etype) recognition task is proposed to deal with such heterogeneity, aiming to infer the class of entities and etypes by ex… ▽ More

    Submitted 24 April, 2023; v1 submitted 16 April, 2023; originally announced April 2023.

    Comments: FOIS 2023 conference paper

  24. arXiv:2302.13591  [pdf, other

    cs.AI

    Towards Ranking Schemas by Focus

    Authors: Mattia Fumagalli, Daqian Shi, Fausto Giunchiglia

    Abstract: The main goal of this paper is to evaluate knowledge base schemas, modeled as a set of entity types, each such type being associated with a set of properties, according to their focus. We intuitively model the notion of focus as ''the state or quality of being relevant in storing and retrieving information''. This definition of focus is adapted from the notion of ''categorization purpose'', as fir… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

  25. arXiv:2302.08591  [pdf, other

    cs.HC cs.CY cs.MM

    Complex Daily Activities, Country-Level Diversity, and Smartphone Sensing: A Study in Denmark, Italy, Mongolia, Paraguay, and UK

    Authors: Karim Assi, Lakmal Meegahapola, William Droz, Peter Kun, Amalia de Gotzen, Miriam Bidoglia, Sally Stares, George Gaskell, Altangerel Chagnaa, Amarsanaa Ganbold, Tsolmon Zundui, Carlo Caprini, Daniele Miorandi, Alethia Hume, Jose Luis Zarza, Luca Cernuzzi, Ivano Bison, Marcelo Dario Rodas Britez, Matteo Busso, Ronald Chenu-Abente, Fausto Giunchiglia, Daniel Gatica-Perez

    Abstract: Smartphones enable understanding human behavior with activity recognition to support people's daily lives. Prior studies focused on using inertial sensors to detect simple activities (sitting, walking, running, etc.) and were mostly conducted in homogeneous populations within a country. However, people are more sedentary in the post-pandemic world with the prevalence of remote/hybrid work/study se… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: ACM CHI 2023

  26. arXiv:2301.09169  [pdf, other

    cs.CL cs.AI

    Representing Interlingual Meaning in Lexical Databases

    Authors: Fausto Giunchiglia, Gabor Bella, Nandu Chandran Nair, Yang Chi, Hao Xu

    Abstract: In today's multilingual lexical databases, the majority of the world's languages are under-represented. Beyond a mere issue of resource incompleteness, we show that existing lexical databases have structural limitations that result in a reduced expressivity on culturally-specific words and in map** them across languages. In particular, the lexical meaning space of dominant languages, such as Eng… ▽ More

    Submitted 22 January, 2023; originally announced January 2023.

    ACM Class: I.2.4; I.2.7

  27. arXiv:2212.06629  [pdf, other

    cs.CV cs.AI

    Aligning Visual and Lexical Semantics

    Authors: Fausto Giunchiglia, Mayukh Bagchi, Xiaolei Diao

    Abstract: We discuss two kinds of semantics relevant to Computer Vision (CV) systems - Visual Semantics and Lexical Semantics. While visual semantics focus on how humans build concepts when using vision to perceive a target reality, lexical semantics focus on how humans build concepts of the same target reality through the use of language. The lack of coincidence between visual and lexical semantics, in tur… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

    Comments: iConference 2023, Barcelona, March 27 - 29, 2023

  28. arXiv:2211.03009  [pdf, other

    cs.HC cs.CY cs.MM

    Generalization and Personalization of Mobile Sensing-Based Mood Inference Models: An Analysis of College Students in Eight Countries

    Authors: Lakmal Meegahapola, William Droz, Peter Kun, Amalia de Gotzen, Chaitanya Nutakki, Shyam Diwakar, Salvador Ruiz Correa, Donglei Song, Hao Xu, Miriam Bidoglia, George Gaskell, Altangerel Chagnaa, Amarsanaa Ganbold, Tsolmon Zundui, Carlo Caprini, Daniele Miorandi, Alethia Hume, Jose Luis Zarza, Luca Cernuzzi, Ivano Bison, Marcelo Rodas Britez, Matteo Busso, Ronald Chenu-Abente, Can Gunel, Fausto Giunchiglia , et al. (2 additional authors not shown)

    Abstract: Mood inference with mobile sensing data has been studied in ubicomp literature over the last decade. This inference enables context-aware and personalized user experiences in general mobile apps and valuable feedback and interventions in mobile health apps. However, even though model generalization issues have been highlighted in many studies, the focus has always been on improving the accuracies… ▽ More

    Submitted 5 November, 2022; originally announced November 2022.

    Comments: ACM IMWUT 2022, To be presented at ACM Ubicomp 2023

  29. arXiv:2209.14049  [pdf, other

    cs.AI cs.DB cs.DL

    Popularity Driven Data Integration

    Authors: Fausto Giunchiglia, Simone Bocca, Mattia Fumagalli, Mayukh Bagchi, Alessio Zamboni

    Abstract: More and more, with the growing focus on large scale analytics, we are confronted with the need of integrating data from multiple sources. The problem is that these data are impossible to reuse as-is. The net result is high cost, with the further drawback that the resulting integrated data will again be hardly reusable as-is. iTelos is a general purpose methodology aiming at minimizing the effects… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: KGSWC 2022. Fourth Ibero-American Knowledge Graph and Semantic Web Conference joint with Third Indo-American Knowledge Graph and Semantic Web Conference 21-23 November 2022, Universidad Camilo José Cela, Madrid, Spain. arXiv admin note: substantial text overlap with arXiv:2105.09418

  30. arXiv:2207.06112  [pdf, other

    cs.AI cs.DB

    LiveSchema: A Gateway Towards Learning on Knowledge Graph Schemas

    Authors: Mattia Fumagalli, Marco Boffo, Daqian Shi, Mayukh Bagchi, Fausto Giunchiglia

    Abstract: One of the major barriers to the training of algorithms on knowledge graph schemas, such as vocabularies or ontologies, is the difficulty that scientists have in finding the best input resource to address the target prediction tasks. In addition to this, a key challenge is to determine how to manipulate (and embed) these data, which are often in the form of particular triples (i.e., subject, predi… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

  31. arXiv:2207.01091  [pdf

    cs.AI cs.DB

    Representation Heterogeneity

    Authors: Fausto Giunchiglia, Mayukh Bagchi

    Abstract: Semantic Heterogeneity is conventionally understood as the existence of variance in the representation of a target reality when modelled, by independent parties, in different databases, schemas and/ or data. We argue that the mere encoding of variance, while being necessary, is not sufficient enough to deal with the problem of representational heterogeneity, given that it is also necessary to enco… ▽ More

    Submitted 3 July, 2022; originally announced July 2022.

  32. arXiv:2206.10212  [pdf, other

    cs.HC

    A Context Model for Personal Data Streams

    Authors: Fausto Giunchiglia, Xiaoyue Li, Matteo Busso, Marcelo Rodas-Britez

    Abstract: We propose a model of the situational context of a person and show how it can be used to organize and, consequently, reason about massive streams of sensor data and annotations, as they can be collected from mobile devices, e.g. smartphones, smartwatches or fitness trackers. The proposed model is validated on a very large dataset about the everyday life of one hundred and fifty-eight people over f… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

    Comments: 8 pages, 3 figures, APWeb WAIM Conference

  33. arXiv:2206.07615  [pdf, other

    cs.CL

    The SIGMORPHON 2022 Shared Task on Morpheme Segmentation

    Authors: Khuyagbaatar Batsuren, Gábor Bella, Aryaman Arora, Viktor Martinović, Kyle Gorman, Zdeněk Žabokrtský, Amarsanaa Ganbold, Šárka Dohnalová, Magda Ševčíková, Kateřina Pelegrinová, Fausto Giunchiglia, Ryan Cotterell, Ekaterina Vylomova

    Abstract: The SIGMORPHON 2022 shared task on morpheme segmentation challenged systems to decompose a word into a sequence of morphemes and covered most types of morphology: compounds, derivations, and inflections. Subtask 1, word-level morpheme segmentation, covered 5 million words in 9 languages (Czech, English, Spanish, Hungarian, French, Italian, Russian, Latin, Mongolian) and received 13 system submissi… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

    Comments: The 19th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology

  34. arXiv:2205.15769  [pdf, other

    cs.LG cs.CV

    Concept-level Debugging of Part-Prototype Networks

    Authors: Andrea Bontempelli, Stefano Teso, Katya Tentori, Fausto Giunchiglia, Andrea Passerini

    Abstract: Part-prototype Networks (ProtoPNets) are concept-based classifiers designed to achieve the same performance as black-box models without compromising transparency. ProtoPNets compute predictions based on similarity to class-specific part-prototypes learned to recognize parts of training examples, making it easy to faithfully determine what examples are responsible for any target prediction and why.… ▽ More

    Submitted 23 January, 2023; v1 submitted 31 May, 2022; originally announced May 2022.

    Comments: Accepted for publication at ICLR 2023

  35. arXiv:2205.10123  [pdf, other

    cs.AI cs.LG

    Lifelong Personal Context Recognition

    Authors: Andrea Bontempelli, Marcelo Rodas Britez, Xiaoyue Li, Haonan Zhao, Luca Erculiani, Stefano Teso, Andrea Passerini, Fausto Giunchiglia

    Abstract: We focus on the development of AIs which live in lifelong symbiosis with a human. The key prerequisite for this task is that the AI understands - at any moment in time - the personal situational context that the human is in. We outline the key challenges that this task brings forth, namely (i) handling the human-like and ego-centric nature of the the user's context, necessary for understanding and… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

    Comments: 8 pages

  36. arXiv:2205.03608  [pdf, other

    cs.CL

    UniMorph 4.0: Universal Morphology

    Authors: Khuyagbaatar Batsuren, Omer Goldman, Salam Khalifa, Nizar Habash, Witold Kieraś, Gábor Bella, Brian Leonard, Garrett Nicolai, Kyle Gorman, Yustinus Ghanggo Ate, Maria Ryskina, Sabrina J. Mielke, Elena Budianskaya, Charbel El-Khaissi, Tiago Pimentel, Michael Gasser, William Lane, Mohit Raj, Matt Coler, Jaime Rafael Montoya Samame, Delio Siticonatzi Camaiteri, Benoît Sagot, Esaú Zumaeta Rojas, Didier López Francis, Arturo Oncevay , et al. (71 additional authors not shown)

    Abstract: The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological inflection tables for hundreds of diverse world languages. The project comprises two major thrusts: a language-independent feature schema for rich morphological annotation and a type-level resource of annotated data in diverse languages realizing that schema. This pa… ▽ More

    Submitted 19 June, 2022; v1 submitted 7 May, 2022; originally announced May 2022.

    Comments: LREC 2022; The first two authors made equal contributions

  37. arXiv:2204.05049  [pdf, other

    cs.CL

    Using Linguistic Typology to Enrich Multilingual Lexicons: the Case of Lexical Gaps in Kinship

    Authors: Temuulen Khishigsuren, Gábor Bella, Khuyagbaatar Batsuren, Abed Alhakim Freihat, Nandu Chandran Nair, Amarsanaa Ganbold, Hadi Khalilia, Yamini Chandrashekar, Fausto Giunchiglia

    Abstract: This paper describes a method to enrich lexical resources with content relating to linguistic diversity, based on knowledge from the field of lexical typology. We capture the phenomenon of diversity through the notions of lexical gap and language-specific word and use a systematic method to infer gaps semi-automatically on a large scale. As a first result obtained for the domain of kinship termino… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: LREC 2022

  38. arXiv:2203.04723  [pdf, other

    cs.CL

    Language Diversity: Visible to Humans, Exploitable by Machines

    Authors: Gábor Bella, Erdenebileg Byambadorj, Yamini Chandrashekar, Khuyagbaatar Batsuren, Danish Ashgar Cheema, Fausto Giunchiglia

    Abstract: The Universal Knowledge Core (UKC) is a large multilingual lexical database with a focus on language diversity and covering over a thousand languages. The aim of the database, as well as its tools and data catalogue, is to make the somewhat abstract notion of diversity visually understandable for humans and formally exploitable by machines. The UKC website lets users explore millions of individual… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

    Comments: Accepted for publication in ACL 2022

  39. The Theory, Practice, and Ethical Challenges of Designing a Diversity-Aware Platform for Social Relations

    Authors: Laura Schelenz, Ivano Bison, Matteo Busso, Amalia de Götzen, Daniel Gatica-Perez, Fausto Giunchiglia, Lakmal Meegahapola, Salvador Ruiz-Correa

    Abstract: Diversity-aware platform design is a paradigm that responds to the ethical challenges of existing social media platforms. Available platforms have been criticized for minimizing users' autonomy, marginalizing minorities, and exploiting users' data for profit maximization. This paper presents a design solution that centers the well-being of users. It presents the theory and practice of designing a… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

    Comments: AAAI/ACM Conference on AI, Ethics, and Society (AIES) 2021

  40. arXiv:2202.08512  [pdf, other

    cs.CV cs.AI

    Visual Ground Truth Construction as Faceted Classification

    Authors: Fausto Giunchiglia, Mayukh Bagchi, Xiaolei Diao

    Abstract: Recent work in Machine Learning and Computer Vision has provided evidence of systematic design flaws in the development of major object recognition benchmark datasets. One such example is ImageNet, wherein, for several categories of images, there are incongruences between the objects they represent and the labels used to annotate them. The consequences of this problem are major, in particular cons… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

  41. arXiv:2112.10531  [pdf

    cs.CV cs.AI

    Object Recognition as Classification via Visual Properties

    Authors: Fausto Giunchiglia, Mayukh Bagchi

    Abstract: We base our work on the teleosemantic modelling of concepts as abilities implementing the distinct functions of recognition and classification. Accordingly, we model two types of concepts - substance concepts suited for object recognition exploiting visual properties, and classification concepts suited for classification of substance concepts exploiting linguistically grounded properties. The goal… ▽ More

    Submitted 28 December, 2021; v1 submitted 20 December, 2021; originally announced December 2021.

  42. arXiv:2109.11160  [pdf, ps, other

    cs.LG cs.SE

    Toward a Unified Framework for Debugging Concept-based Models

    Authors: Andrea Bontempelli, Fausto Giunchiglia, Andrea Passerini, Stefano Teso

    Abstract: In this paper, we tackle interactive debugging of "gray-box" concept-based models (CBMs). These models learn task-relevant concepts appearing in the inputs and then compute a prediction by aggregating the concept activations. Our work stems from the observation that in CBMs both the concepts and the aggregation function can be affected by different kinds of bugs, and that fixing these bugs require… ▽ More

    Submitted 17 February, 2022; v1 submitted 23 September, 2021; originally announced September 2021.

    Comments: 11 pages, 1 figure. Accepted at the AAAI-22 Workshop on Interactive Machine Learning

  43. arXiv:2109.09140  [pdf, other

    cs.IT

    Property-based Entity Type Graph Matching

    Authors: Fausto Giunchiglia, Daqian Shi

    Abstract: We are interested in dealing with the heterogeneity of Knowledge bases (KBs), e.g., ontologies and schemas, modeled as sets of entity types (etypes), e.g., person, where each etype is associated with a set of properties, e.g., age or height, via an inheritance hierarchy. A huge literature exists on this topic. A common approach is to model KBs as graphs decorated with labels and reduce the problem… ▽ More

    Submitted 19 September, 2021; originally announced September 2021.

  44. arXiv:2108.08234  [pdf, other

    cs.AI

    Streaming and Learning the Personal Context

    Authors: Fausto Giunchiglia, Marcelo Rodas Britez, Andrea Bontempelli, Xiaoyue Li

    Abstract: The representation of the personal context is complex and essential to improve the help machines can give to humans for making sense of the world, and the help humans can give to machines to improve their efficiency. We aim to design a novel model representation of the personal context and design a learning process for better integration with machine learning. We aim to implement these elements in… ▽ More

    Submitted 18 August, 2021; originally announced August 2021.

    Comments: 9 pages, 4 figures

  45. arXiv:2106.03922  [pdf, other

    cs.LG stat.ML

    Interactive Label Cleaning with Example-based Explanations

    Authors: Stefano Teso, Andrea Bontempelli, Fausto Giunchiglia, Andrea Passerini

    Abstract: We tackle sequential learning under label noise in applications where a human supervisor can be queried to relabel suspicious examples. Existing approaches are flawed, in that they only relabel incoming examples that look "suspicious" to the model. As a consequence, those mislabeled examples that elude (or don't undergo) this cleaning step end up tainting the training data and the model with no fu… ▽ More

    Submitted 15 December, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: main article + supplementary material, Advances in Neural Information Processing Systems 34 (NeurIPS 2021)

  46. arXiv:2105.09432  [pdf, other

    cs.DB cs.AI

    Stratified Data Integration

    Authors: Fausto Giunchiglia, Alessio Zamboni, Mayukh Bagchi, Simone Bocca

    Abstract: We propose a novel approach to the problem of semantic heterogeneity where data are organized into a set of stratified and independent representation layers, namely: conceptual(where a set of unique alinguistic identifiers are connected inside a graph codifying their meaning), language(where sets of synonyms, possibly from multiple languages, annotate concepts), knowledge(in the form of a graph wh… ▽ More

    Submitted 19 May, 2021; originally announced May 2021.

  47. arXiv:2105.09422  [pdf, other

    cs.AI cs.CV

    Classifying concepts via visual properties

    Authors: Fausto Giunchiglia, Mayukh Bagchi

    Abstract: We assume that substances in the world are represented by two types of concepts, namely substance concepts and classification concepts, the former instrumental to (visual) perception, the latter to (language based) classification. Based on this distinction, we introduce a general methodology for building lexico-semantic hierarchies of substance concepts, where nodes are annotated with the media, e… ▽ More

    Submitted 19 May, 2021; originally announced May 2021.

  48. arXiv:2105.09418  [pdf, other

    cs.DB cs.AI

    iTelos -- Purpose Driven Knowledge Graph Generation

    Authors: Fausto Giunchiglia, Simone Bocca, Mattia Fumagalli, Mayukh Bagchi, Alessio Zamboni

    Abstract: When building a new application we are more and more confronted with the need of reusing and integrating pre-existing knowledge, e.g., ontologies, schemas, data of any kind, from multiple sources. Nevertheless, it is a fact that this prior knowledge is virtually impossible to reuse as-is. This difficulty is the cause of high costs, with the further drawback that the resulting application will agai… ▽ More

    Submitted 15 December, 2021; v1 submitted 19 May, 2021; originally announced May 2021.

  49. arXiv:2104.12379  [pdf, other

    cs.AI

    Towards Visual Semantics

    Authors: Fausto Giunchiglia, Luca Erculiani, Andrea Passerini

    Abstract: Lexical Semantics is concerned with how words encode mental representations of the world, i.e., concepts . We call this type of concepts, classification concepts . In this paper, we focus on Visual Semantics , namely on how humans build concepts representing what they perceive visually. We call this second type of concepts, substance concepts . As shown in the paper, these two types of concepts ar… ▽ More

    Submitted 14 September, 2021; v1 submitted 26 April, 2021; originally announced April 2021.

  50. arXiv:2104.05658  [pdf, other

    cs.CY cs.AI

    Towards Algorithmic Transparency: A Diversity Perspective

    Authors: Fausto Giunchiglia, Jahna Otterbacher, Styliani Kleanthous, Khuyagbaatar Batsuren, Veronika Bogin, Tsvi Kuflik, Avital Shulner Tal

    Abstract: As the role of algorithmic systems and processes increases in society, so does the risk of bias, which can result in discrimination against individuals and social groups. Research on algorithmic bias has exploded in recent years, highlighting both the problems of bias, and the potential solutions, in terms of algorithmic transparency (AT). Transparency is important for facilitating fairness manage… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.