Skip to main content

Showing 1–50 of 87 results for author: Silvestri, F

.
  1. arXiv:2406.14972  [pdf, other

    cs.CL cs.IR

    A Tale of Trust and Accuracy: Base vs. Instruct LLMs in RAG Systems

    Authors: Florin Cuconasu, Giovanni Trappolini, Nicola Tonellotto, Fabrizio Silvestri

    Abstract: Retrieval Augmented Generation (RAG) represents a significant advancement in artificial intelligence combining a retrieval phase with a generative phase, with the latter typically being powered by large language models (LLMs). The current common practices in RAG involve using "instructed" LLMs, which are fine-tuned with supervised training to enhance their ability to follow instructions and are al… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  2. arXiv:2406.12535  [pdf, other

    cs.IR

    Predicting Award Winning Research Papers at Publication Time

    Authors: Riccardo Vella, Andrea Vitaletti, Fabrizio Silvestri

    Abstract: In recent years, many studies have been focusing on predicting the scientific impact of research papers. Most of these predictions are based on citations count or rely on features obtainable only from already published papers. In this study, we predict the likelihood for a research paper of winning an award only relying on information available at publication time. For each paper, we build the cit… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  3. arXiv:2406.11720  [pdf, other

    cs.IR

    Graph Neural Re-Ranking via Corpus Graph

    Authors: Andrea Giuseppe Di Francesco, Christian Giannetti, Nicola Tonellotto, Fabrizio Silvestri

    Abstract: Re-ranking systems aim to reorder an initial list of documents to satisfy better the information needs associated with a user-provided query. Modern re-rankers predominantly rely on neural network models, which have proven highly effective in representing samples from various modalities. However, these models typically evaluate query-document pairs in isolation, neglecting the underlying document… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: This preprint is the result of work in progress, therefore it should still be considered a draft

  4. arXiv:2405.19749  [pdf, other

    cs.IR

    Generating Query Recommendations via LLMs

    Authors: Andrea Bacciu, Enrico Palumbo, Andreas Damianou, Nicola Tonellotto, Fabrizio Silvestri

    Abstract: Query recommendation systems are ubiquitous in modern search engines, assisting users in producing effective queries to meet their information needs. However, these systems require a large amount of data to produce good recommendations, such as a large collection of documents to index and query logs. In particular, query logs and user data are not available in cold start scenarios. Query logs are… ▽ More

    Submitted 4 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: Generating Query Recommendations via LLMs

    MSC Class: H.3.3

  5. arXiv:2404.15760  [pdf, other

    cs.LG cs.AI stat.ML

    Debiasing Machine Unlearning with Counterfactual Examples

    Authors: Ziheng Chen, Jia Wang, Jun Zhuang, Abbavaram Gowtham Reddy, Fabrizio Silvestri, ** Huang, Kaushiki Nag, Kun Kuang, Xin Ning, Gabriele Tolomei

    Abstract: The right to be forgotten (RTBF) seeks to safeguard individuals from the enduring effects of their historical actions by implementing machine-learning techniques. These techniques facilitate the deletion of previously acquired knowledge without requiring extensive model retraining. However, they often overlook a critical issue: unlearning processes bias. This bias emerges from two main sources: (1… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  6. arXiv:2403.14339  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    $\nabla τ$: Gradient-based and Task-Agnostic machine Unlearning

    Authors: Daniel Trippa, Cesare Campagnano, Maria Sofia Bucarelli, Gabriele Tolomei, Fabrizio Silvestri

    Abstract: Machine Unlearning, the process of selectively eliminating the influence of certain data examples used during a model's training, has gained significant attention as a means for practitioners to comply with recent data protection regulations. However, existing unlearning methods face critical drawbacks, including their prohibitively high cost, often associated with a large number of hyperparameter… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 14 pages, 2 figures

  7. arXiv:2403.05358  [pdf, other

    cs.CY cs.LG cs.SI stat.ML

    Variational Inference of Parameters in Opinion Dynamics Models

    Authors: Jacopo Lenti, Fabrizio Silvestri, Gianmarco De Francisci Morales

    Abstract: Despite the frequent use of agent-based models (ABMs) for studying social phenomena, parameter estimation remains a challenge, often relying on costly simulation-based heuristics. This work uses variational inference to estimate the parameters of an opinion dynamics ABM, by transforming the estimation problem into an optimization task that can be solved directly. Our proposal relies on probabili… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  8. arXiv:2403.05185  [pdf, other

    cs.IR cs.LG

    Personalized Audiobook Recommendations at Spotify Through Graph Neural Networks

    Authors: Marco De Nadai, Francesco Fabbri, Paul Gigioli, Alice Wang, Ang Li, Fabrizio Silvestri, Laura Kim, Shawn Lin, Vladan Radosavljevic, Sandeep Ghael, David Nyhan, Hugues Bouchard, Mounia Lalmas-Roelleke, Andreas Damianou

    Abstract: In the ever-evolving digital audio landscape, Spotify, well-known for its music and talk content, has recently introduced audiobooks to its vast user base. While promising, this move presents significant challenges for personalized recommendations. Unlike music and podcasts, audiobooks, initially available for a fee, cannot be easily skimmed before purchase, posing higher stakes for the relevance… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: To appear in The Web Conference 2024 proceedings

  9. arXiv:2402.14802  [pdf, other

    cs.LG cs.IR cs.SI

    Link Prediction under Heterophily: A Physics-Inspired Graph Neural Network Approach

    Authors: Andrea Giuseppe Di Francesco, Francesco Caso, Maria Sofia Bucarelli, Fabrizio Silvestri

    Abstract: In the past years, Graph Neural Networks (GNNs) have become the `de facto' standard in various deep learning domains, thanks to their flexibility in modeling real-world phenomena represented as graphs. However, the message-passing mechanism of GNNs faces challenges in learnability and expressivity, hindering high performance on heterophilic graphs, where adjacent nodes frequently have different la… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 7 pages, 1 figure

  10. The Power of Noise: Redefining Retrieval for RAG Systems

    Authors: Florin Cuconasu, Giovanni Trappolini, Federico Siciliano, Simone Filice, Cesare Campagnano, Yoelle Maarek, Nicola Tonellotto, Fabrizio Silvestri

    Abstract: Retrieval-Augmented Generation (RAG) has recently emerged as a method to extend beyond the pre-trained knowledge of Large Language Models by augmenting the original prompt with relevant passages or documents retrieved by an Information Retrieval (IR) system. RAG has become increasingly important for Generative AI solutions, especially in enterprise settings or in any domain in which knowledge is c… ▽ More

    Submitted 1 May, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

  11. arXiv:2401.03824  [pdf, ps, other

    cs.LG stat.ML

    A topological description of loss surfaces based on Betti Numbers

    Authors: Maria Sofia Bucarelli, Giuseppe Alessio D'Inverno, Monica Bianchini, Franco Scarselli, Fabrizio Silvestri

    Abstract: In the context of deep learning models, attention has recently been paid to studying the surface of the loss function in order to better understand training with methods based on gradient descent. This search for an appropriate description, both analytical and topological, has led to numerous efforts to identify spurious minima and characterize gradient dynamics. Our work aims to contribute to thi… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  12. arXiv:2312.17506  [pdf, other

    q-bio.QM cs.LG

    A graph neural network-based model with Out-of-Distribution Robustness for enhancing Antiretroviral Therapy Outcome Prediction for HIV-1

    Authors: Giulia Di Teodoro, Federico Siciliano, Valerio Guarrasi, Anne-Mieke Vandamme, Valeria Ghisetti, Anders Sönnerborg, Maurizio Zazzi, Fabrizio Silvestri, Laura Palagi

    Abstract: Predicting the outcome of antiretroviral therapies for HIV-1 is a pressing clinical challenge, especially when the treatment regimen includes drugs for which limited effectiveness data is available. This scarcity of data can arise either due to the introduction of a new drug to the market or due to limited use in clinical settings. To tackle this issue, we introduce a novel joint fusion model, whi… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

    Comments: 32 pages, 2 figures

    MSC Class: 68 ACM Class: I.2.6

  13. arXiv:2312.15228  [pdf, other

    cs.LG cs.CL cs.CR

    Adversarial Data Poisoning for Fake News Detection: How to Make a Model Misclassify a Target News without Modifying It

    Authors: Federico Siciliano, Luca Maiano, Lorenzo Papa, Federica Baccini, Irene Amerini, Fabrizio Silvestri

    Abstract: Fake news detection models are critical to countering disinformation but can be manipulated through adversarial attacks. In this position paper, we analyze how an attacker can compromise the performance of an online learning detector on specific news content without being able to manipulate the original target news. In some contexts, such as social networks, where the attacker cannot exert complet… ▽ More

    Submitted 4 January, 2024; v1 submitted 23 December, 2023; originally announced December 2023.

  14. arXiv:2312.02401  [pdf, other

    stat.ML cs.LG cs.SI

    Harmonizing Global Voices: Culturally-Aware Models for Enhanced Content Moderation

    Authors: Alex J. Chan, José Luis Redondo García, Fabrizio Silvestri, Colm O'Donnel, Konstantina Palla

    Abstract: Content moderation at scale faces the challenge of considering local cultural distinctions when assessing content. While global policies aim to maintain decision-making consistency and prevent arbitrary rule enforcement, they often overlook regional variations in interpreting natural language as expressed in content. In this study, we are looking into how moderation systems can tackle this issue b… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: 12 pages, 8 Figures. Supplementary material

  15. arXiv:2311.17815  [pdf, other

    cs.AR cs.AI

    A Survey on Design Methodologies for Accelerating Deep Learning on Heterogeneous Architectures

    Authors: Fabrizio Ferrandi, Serena Curzel, Leandro Fiorin, Daniele Ielmini, Cristina Silvano, Francesco Conti, Alessio Burrello, Francesco Barchi, Luca Benini, Luciano Lavagno, Teodoro Urso, Enrico Calore, Sebastiano Fabio Schifano, Cristian Zambelli, Maurizio Palesi, Giuseppe Ascia, Enrico Russo, Nicola Petra, Davide De Caro, Gennaro Di Meo, Valeria Cardellini, Salvatore Filippone, Francesco Lo Presti, Francesco Silvestri, Paolo Palazzari , et al. (1 additional authors not shown)

    Abstract: In recent years, the field of Deep Learning has seen many disruptive and impactful advancements. Given the increasing complexity of deep neural networks, the need for efficient hardware accelerators has become more and more pressing to design heterogeneous HPC platforms. The design of Deep Learning accelerators requires a multidisciplinary approach, combining expertise from several areas, spanning… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  16. arXiv:2310.08909  [pdf, other

    cs.SI cs.AI cs.LG

    Evading Community Detection via Counterfactual Neighborhood Search

    Authors: Andrea Bernini, Fabrizio Silvestri, Gabriele Tolomei

    Abstract: Community detection techniques are useful for social media platforms to discover tightly connected groups of users who share common interests. However, this functionality often comes at the expense of potentially exposing individuals to privacy breaches by inadvertently revealing their tastes or preferences. Therefore, some users may wish to preserve their anonymity and opt out of community detect… ▽ More

    Submitted 7 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

  17. arXiv:2310.04875  [pdf, other

    cs.LG cs.CL cs.CY cs.HC cs.OS

    Prompt-to-OS (P2OS): Revolutionizing Operating Systems and Human-Computer Interaction with Integrated AI Generative Models

    Authors: Gabriele Tolomei, Cesare Campagnano, Fabrizio Silvestri, Giovanni Trappolini

    Abstract: In this paper, we present a groundbreaking paradigm for human-computer interaction that revolutionizes the traditional notion of an operating system. Within this innovative framework, user requests issued to the machine are handled by an interconnected ecosystem of generative AI models that seamlessly integrate with or even replace traditional software applications. At the core of this paradigm… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

    Comments: 5 pages, 1 figure. Accepted at IEEE CogMI 2023 (IEEE International Conference on Cognitive Machine Intelligence)

  18. arXiv:2309.17116  [pdf, other

    cs.LG

    Sheaf Hypergraph Networks

    Authors: Iulia Duta, Giulia Cassarà, Fabrizio Silvestri, Pietro Liò

    Abstract: Higher-order relations are widespread in nature, with numerous phenomena involving complex interactions that extend beyond simple pairwise connections. As a result, advancements in higher-order processing can accelerate the growth of various fields requiring structured data. Current approaches typically represent these interactions using hypergraphs. We enhance this representation by introducing c… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

    Comments: Accepted at Neural Information Processing Systems (NeurIPS 2023)

  19. Investigating the Robustness of Sequential Recommender Systems Against Training Data Perturbations

    Authors: Filippo Betello, Federico Siciliano, Pushkar Mishra, Fabrizio Silvestri

    Abstract: Sequential Recommender Systems (SRSs) are widely employed to model user behavior over time. However, their robustness in the face of perturbations in training data remains a largely understudied yet critical issue. A fundamental challenge emerges in previous studies aimed at assessing the robustness of SRSs: the Rank-Biased Overlap (RBO) similarity is not particularly suited for this task as it is… ▽ More

    Submitted 27 December, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

    Journal ref: Advances in Information Retrieval. ECIR 2024. Lecture Notes in Computer Science, vol 14609

  20. arXiv:2307.12798  [pdf, other

    cs.CL cs.IR

    RRAML: Reinforced Retrieval Augmented Machine Learning

    Authors: Andrea Bacciu, Florin Cuconasu, Federico Siciliano, Fabrizio Silvestri, Nicola Tonellotto, Giovanni Trappolini

    Abstract: The emergence of large language models (LLMs) has revolutionized machine learning and related fields, showcasing remarkable abilities in comprehending, generating, and manipulating human language. However, their conventional usage through API-based text prompt submissions imposes certain limitations in terms of context constraints and external source availability. To address these challenges, we p… ▽ More

    Submitted 27 July, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

    Journal ref: CEUR Workshop Proceedings (2023, Vol. 3537, pp. 29-37)

  21. arXiv:2306.14457  [pdf, other

    cs.CL

    Fauno: The Italian Large Language Model that will leave you senza parole!

    Authors: Andrea Bacciu, Giovanni Trappolini, Andrea Santilli, Emanuele Rodolà, Fabrizio Silvestri

    Abstract: This paper presents Fauno, the first and largest open-source Italian conversational Large Language Model (LLM). Our goal with Fauno is to democratize the study of LLMs in Italian, demonstrating that obtaining a fine-tuned conversational bot with a single GPU is possible. In addition, we release a collection of datasets for conversational AI in Italian. The datasets on which we fine-tuned Fauno inc… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  22. arXiv:2306.00707  [pdf, other

    cs.LG physics.data-an

    Renormalized Graph Neural Networks

    Authors: Francesco Caso, Giovanni Trappolini, Andrea Bacciu, Pietro Liò, Fabrizio Silvestri

    Abstract: Graph Neural Networks (GNNs) have become essential for studying complex data, particularly when represented as graphs. Their value is underpinned by their ability to reflect the intricacies of numerous areas, ranging from social to biological networks. GNNs can grapple with non-linear behaviors, emerging patterns, and complex connections; these are also typical characteristics of complex systems.… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  23. Integrating Item Relevance in Training Loss for Sequential Recommender Systems

    Authors: Andrea Bacciu, Federico Siciliano, Nicola Tonellotto, Fabrizio Silvestri

    Abstract: Sequential Recommender Systems (SRSs) are a popular type of recommender system that learns from a user's history to predict the next item they are likely to interact with. However, user interactions can be affected by noise stemming from account sharing, inconsistent preferences, or accidental clicks. To address this issue, we (i) propose a new evaluation protocol that takes multiple future items… ▽ More

    Submitted 10 June, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Journal ref: In Proceedings of the 17th ACM Conference on Recommender Systems (pp. 1114-1119) 2023

  24. arXiv:2305.01447  [pdf, other

    cs.MM cs.CL cs.CV cs.DB cs.IR

    Multimodal Neural Databases

    Authors: Giovanni Trappolini, Andrea Santilli, Emanuele Rodolà, Alon Halevy, Fabrizio Silvestri

    Abstract: The rise in loosely-structured data available through text, images, and other modalities has called for new ways of querying them. Multimedia Information Retrieval has filled this gap and has witnessed exciting progress in recent years. Tasks such as search and retrieval of extensive multimedia archives have undergone massive performance improvements, driven to a large extent by recent development… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

    Journal ref: SIGIR 2023: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval

  25. arXiv:2305.00574  [pdf, other

    cs.IR

    The Dark Side of Explanations: Poisoning Recommender Systems with Counterfactual Examples

    Authors: Ziheng Chen, Fabrizio Silvestri, Jia Wang, Yongfeng Zhang, Gabriele Tolomei

    Abstract: Deep learning-based recommender systems have become an integral part of several online platforms. However, their black-box nature emphasizes the need for explainable artificial intelligence (XAI) approaches to provide human-understandable reasons why a specific item gets recommended to a given user. One such method is counterfactual explanation (CF). While CFs can be highly beneficial for users an… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

    Comments: To be published in SIGIR2023

  26. arXiv:2304.09097  [pdf, other

    cs.IR cs.LG

    Sheaf4Rec: Sheaf Neural Networks for Graph-based Recommender Systems

    Authors: Antonio Purificato, Giulia Cassarà, Federico Siciliano, Pietro Liò, Fabrizio Silvestri

    Abstract: Recent advancements in Graph Neural Networks (GNN) have facilitated their widespread adoption in various applications, including recommendation systems. GNNs have proven to be effective in addressing the challenges posed by recommendation systems by efficiently modeling graphs in which nodes represent users or items and edges denote preference relationships. However, current GNN techniques represe… ▽ More

    Submitted 16 March, 2024; v1 submitted 7 April, 2023; originally announced April 2023.

    Comments: 21 pages, 8 figures

    MSC Class: 55 ACM Class: I.2.6; H.3.3

  27. arXiv:2303.09470  [pdf, other

    cs.LG cs.AI stat.ML

    Learning with Noisy Labels through Learnable Weighting and Centroid Similarity

    Authors: Farooq Ahmad Wani, Maria Sofia Bucarelli, Fabrizio Silvestri

    Abstract: We introduce a novel method for training machine learning models in the presence of noisy labels, which are prevalent in domains such as medical diagnosis and autonomous driving and have the potential to degrade a model's generalization performance. Inspired by established literature that highlights how deep learning models are prone to overfitting to noisy samples in the later epochs of training,… ▽ More

    Submitted 25 June, 2024; v1 submitted 16 March, 2023; originally announced March 2023.

  28. arXiv:2303.08288  [pdf, other

    cs.CL cs.LG

    Attention-likelihood relationship in transformers

    Authors: Valeria Ruscio, Valentino Maiorca, Fabrizio Silvestri

    Abstract: We analyze how large language models (LLMs) represent out-of-context words, investigating their reliance on the given context to capture their semantics. Our likelihood-guided text perturbations reveal a correlation between token likelihood and attention values in transformer-based language models. Extensive experiments reveal that unexpected tokens cause the model to attend less to the informatio… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

  29. arXiv:2301.00214  [pdf

    cond-mat.mtrl-sci

    Understanding the Role of Non-Fullerene Acceptors Crystallinity on the Charge Transport Properties and Performance of Organic Solar Cells

    Authors: Pierluigi Mondelli, Pascal Kaienburg, Francesco Silvestri, Rebecca Scatena, Claire Welton, Martine Grandjean, Vincent Lemaur, Eduardo Solano, Mathias Nyman, Peter Horton, Simon Coles, Esther Barrena, Moritz Riede, Paolo Radaelli, David Beljonne, Manjunatha Reddy, Graham Morse

    Abstract: The active layer crystallinity has long been associated with favourable organic solar cells (OSCs) properties such as high mobility and Fill Factor. In particular, this applies to acceptor materials such as fullerene-derivatives and the most recent Non-Fullerene Acceptors (NFAs), which are now surpassing 19% of Power Conversion Efficiency. Despite these advantages are being commonly attributed to… ▽ More

    Submitted 31 December, 2022; originally announced January 2023.

    Comments: 16 pages, 8 figures

  30. arXiv:2212.06605  [pdf, other

    cs.DS

    Dimensionality reduction on complex vector spaces for Euclidean distance with dynamic weights

    Authors: Paolo Pellizzoni, Simone Moretti, Francesco Silvestri

    Abstract: The weighted Euclidean norm $\|x\|_w$ of a vector $x\in \mathbb{R}^d$ with weights $w\in \mathbb{R}^d$ is the Euclidean norm where the contribution of each dimension is scaled by a given weight. Approaches to dimensionality reduction that satisfy the Johnson-Lindenstrauss (JL) lemma can be easily adapted to the weighted Euclidean distance if weights are fixed: it suffices to scale each dimension o… ▽ More

    Submitted 14 December, 2023; v1 submitted 13 December, 2022; originally announced December 2022.

  31. arXiv:2209.09688  [pdf, other

    cs.LG cs.CR

    Sparse Vicious Attacks on Graph Neural Networks

    Authors: Giovanni Trappolini, Valentino Maiorca, Silvio Severino, Emanuele Rodolà, Fabrizio Silvestri, Gabriele Tolomei

    Abstract: Graph Neural Networks (GNNs) have proven to be successful in several predictive modeling tasks for graph-structured data. Amongst those tasks, link prediction is one of the fundamental problems for many real-world applications, such as recommender systems. However, GNNs are not immune to adversarial attacks, i.e., carefully crafted malicious examples that are designed to fool the predictive mo… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

  32. arXiv:2208.04222  [pdf, other

    cs.IR cs.AI cs.LG

    GREASE: Generate Factual and Counterfactual Explanations for GNN-based Recommendations

    Authors: Ziheng Chen, Fabrizio Silvestri, Jia Wang, Yongfeng Zhang, Zhenhua Huang, Hongshik Ahn, Gabriele Tolomei

    Abstract: Recently, graph neural networks (GNNs) have been widely used to develop successful recommender systems. Although powerful, it is very difficult for a GNN-based recommender system to attach tangible explanations of why a specific item ends up in the list of suggestions for a given user. Indeed, explaining GNN-based recommendations is unique, and existing GNN explanation methods are inappropriate fo… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

  33. arXiv:2207.13586  [pdf, other

    cs.LG cs.AI cs.LO

    Encoding Concepts in Graph Neural Networks

    Authors: Lucie Charlotte Magister, Pietro Barbiero, Dmitry Kazhdan, Federico Siciliano, Gabriele Ciravegna, Fabrizio Silvestri, Mateja Jamnik, Pietro Lio

    Abstract: The opaque reasoning of Graph Neural Networks induces a lack of human trust. Existing graph network explainers attempt to address this issue by providing post-hoc explanations, however, they fail to make the model itself more interpretable. To fill this gap, we introduce the Concept Encoder Module, the first differentiable concept-discovery approach for graph networks. The proposed approach makes… ▽ More

    Submitted 7 August, 2022; v1 submitted 27 July, 2022; originally announced July 2022.

  34. arXiv:2205.04274  [pdf, other

    cs.CL cs.AI cs.CV

    Detecting and Understanding Harmful Memes: A Survey

    Authors: Shivam Sharma, Firoj Alam, Md. Shad Akhtar, Dimitar Dimitrov, Giovanni Da San Martino, Hamed Firooz, Alon Halevy, Fabrizio Silvestri, Preslav Nakov, Tanmoy Chakraborty

    Abstract: The automatic identification of harmful content online is of major concern for social media platforms, policymakers, and society. Researchers have studied textual, visual, and audio content, but typically in isolation. Yet, harmful content often combines multiple modalities, as in the case of memes, which are of particular interest due to their viral nature. With this in mind, here we offer a comp… ▽ More

    Submitted 29 May, 2022; v1 submitted 9 May, 2022; originally announced May 2022.

    Comments: Accepted at IJCAI-ECAI 2022 (Survey Track) - Editorial Feedback Revised, 9 pages (7 main + 2 reference pages)

  35. arXiv:2202.05868  [pdf, other

    cs.DC

    Blocking Techniques for Sparse Matrix Multiplication on Tensor Accelerators

    Authors: Paolo Sylos Labini, Massimo Bernaschi, Francesco Silvestri, Flavio Vella

    Abstract: Tensor accelerators have gained popularity because they provide a cheap and efficient solution for speeding up computational-expensive tasks in Deep Learning and, more recently, in other Scientific Computing applications. However, since their features are specifically designed for tensor algebra (typically dense matrix-product), it is commonly assumed that they are not suitable for applications wi… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

    Comments: 12 pages, 14 images

  36. ReLAX: Reinforcement Learning Agent eXplainer for Arbitrary Predictive Models

    Authors: Ziheng Chen, Fabrizio Silvestri, Jia Wang, He Zhu, Hongshik Ahn, Gabriele Tolomei

    Abstract: Counterfactual examples (CFs) are one of the most popular methods for attaching post-hoc explanations to machine learning (ML) models. However, existing CF generation methods either exploit the internals of specific models or depend on each sample's neighborhood, thus they are hard to generalize for complex models and inefficient for large datasets. This work aims to overcome these limitations and… ▽ More

    Submitted 8 August, 2022; v1 submitted 22 October, 2021; originally announced October 2021.

  37. arXiv:2110.02775  [pdf, other

    cs.NE cs.AI cs.LG

    NEWRON: A New Generalization of the Artificial Neuron to Enhance the Interpretability of Neural Networks

    Authors: Federico Siciliano, Maria Sofia Bucarelli, Gabriele Tolomei, Fabrizio Silvestri

    Abstract: In this work, we formulate NEWRON: a generalization of the McCulloch-Pitts neuron structure. This new framework aims to explore additional desirable properties of artificial neurons. We show that some specializations of NEWRON allow the network to be interpretable with no change in their expressiveness. By just inspecting the models produced by our NEWRON-based networks, we can understand the rule… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

  38. arXiv:2109.08013  [pdf, other

    cs.CV cs.CL cs.LG cs.MM

    Detecting Propaganda Techniques in Memes

    Authors: Dimitar Dimitrov, Bishr Bin Ali, Shaden Shaar, Firoj Alam, Fabrizio Silvestri, Hamed Firooz, Preslav Nakov, Giovanni Da San Martino

    Abstract: Propaganda can be defined as a form of communication that aims to influence the opinions or the actions of people towards a specific goal; this is achieved by means of well-defined rhetorical and psychological devices. Propaganda, in the form we know it today, can be dated back to the beginning of the 17th century. However, it is with the advent of the Internet and the social media that it has sta… ▽ More

    Submitted 7 August, 2021; originally announced September 2021.

    Comments: propaganda, disinformation, fake news, memes, multimodality. arXiv admin note: text overlap with arXiv:2105.09284

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: ACL-2021

  39. arXiv:2107.00761  [pdf, other

    cs.DS cs.CE cs.LG cs.SI math.OC

    On the Bike Spreading Problem

    Authors: Elia Costa, Francesco Silvestri

    Abstract: A free-floating bike-sharing system (FFBSS) is a dockless rental system where an individual can borrow a bike and returns it anywhere, within the service area. To improve the rental service, available bikes should be distributed over the entire service area: a customer leaving from any position is then more likely to find a near bike and then to use the service. Moreover, spreading bikes among the… ▽ More

    Submitted 18 August, 2021; v1 submitted 1 July, 2021; originally announced July 2021.

    Comments: Proc. 21st Symposium on Algorithmic Approaches for Transportation Modelling, Optimization, and Systems (ATMOS 2021)

  40. arXiv:2106.01074  [pdf, other

    cs.CL cs.AI cs.DB

    Database Reasoning Over Text

    Authors: James Thorne, Majid Yazdani, Marzieh Saeidi, Fabrizio Silvestri, Sebastian Riedel, Alon Halevy

    Abstract: Neural models have shown impressive performance gains in answering queries from natural language text. However, existing works are unable to support database queries, such as "List/Count all female athletes who were born in 20th century", which require reasoning over sets of relevant facts with operations such as join, filtering and aggregation. We show that while state-of-the-art transformer mode… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: To appear at ACL2021

  41. arXiv:2105.09284  [pdf, other

    cs.MM cs.CL cs.LG

    SemEval-2021 Task 6: Detection of Persuasion Techniques in Texts and Images

    Authors: Dimitar Dimitrov, Bishr Bin Ali, Shaden Shaar, Firoj Alam, Fabrizio Silvestri, Hamed Firooz, Preslav Nakov, Giovanni Da San Martino

    Abstract: We describe SemEval-2021 task 6 on Detection of Persuasion Techniques in Texts and Images: the data, the annotation guidelines, the evaluation setup, the results, and the participating systems. The task focused on memes and had three subtasks: (i) detecting the techniques in the text, (ii) detecting the text spans where the techniques are used, and (iii) detecting techniques in the entire meme, i.… ▽ More

    Submitted 25 April, 2021; originally announced May 2021.

    Comments: propaganda, disinformation, misinformation, fake news, memes, multimodality

    MSC Class: 68T50 ACM Class: F.2.2; I.2.7

    Journal ref: SemEval-2021

  42. arXiv:2104.00353  [pdf, other

    eess.AS cs.LG

    CycleDRUMS: Automatic Drum Arrangement For Bass Lines Using CycleGAN

    Authors: Giorgio Barnabò, Giovanni Trappolini, Lorenzo Lastilla, Cesare Campagnano, Angela Fan, Fabio Petroni, Fabrizio Silvestri

    Abstract: The two main research threads in computer-based music generation are: the construction of autonomous music-making systems, and the design of computer-based environments to assist musicians. In the symbolic domain, the key problem of automatically arranging a piece music was extensively studied, while relatively fewer systems tackled this challenge in the audio domain. In this contribution, we prop… ▽ More

    Submitted 9 April, 2021; v1 submitted 1 April, 2021; originally announced April 2021.

    Comments: 9 pages, 5 figures, submitted to IEEE Transactions on Multimedia, the authors contributed equally to this work

  43. arXiv:2103.12541  [pdf, other

    cs.MM cs.AI cs.CL cs.CR cs.CY cs.LG cs.SI

    A Survey on Multimodal Disinformation Detection

    Authors: Firoj Alam, Stefano Cresci, Tanmoy Chakraborty, Fabrizio Silvestri, Dimiter Dimitrov, Giovanni Da San Martino, Shaden Shaar, Hamed Firooz, Preslav Nakov

    Abstract: Recent years have witnessed the proliferation of offensive content online such as fake news, propaganda, misinformation, and disinformation. While initially this was mostly about textual content, over time images and videos gained popularity, as they are much easier to consume, attract more attention, and spread further than text. As a result, researchers started leveraging different modalities an… ▽ More

    Submitted 28 September, 2022; v1 submitted 13 March, 2021; originally announced March 2021.

    Comments: Accepted at COLING-2022, disinformation, misinformation, factuality, harmfulness, fake news, propaganda, multimodality, text, images, videos, network structure, temporality

    MSC Class: 68T50 ACM Class: I.2.7

  44. arXiv:2102.03322  [pdf, other

    cs.LG cs.AI

    CF-GNNExplainer: Counterfactual Explanations for Graph Neural Networks

    Authors: Ana Lucic, Maartje ter Hoeve, Gabriele Tolomei, Maarten de Rijke, Fabrizio Silvestri

    Abstract: Given the increasing promise of graph neural networks (GNNs) in real-world applications, several methods have been developed for explaining their predictions. Existing methods for interpreting predictions from GNNs have primarily focused on generating subgraphs that are especially relevant for a particular prediction. However, such methods are not counterfactual (CF) in nature: given a prediction,… ▽ More

    Submitted 22 February, 2022; v1 submitted 5 February, 2021; originally announced February 2021.

    Comments: Accepted to AISTATS 2022

  45. arXiv:2101.10905  [pdf, other

    cs.DS cs.DB cs.LG

    Sampling a Near Neighbor in High Dimensions -- Who is the Fairest of Them All?

    Authors: Martin Aumüller, Sariel Har-Peled, Sepideh Mahabadi, Rasmus Pagh, Francesco Silvestri

    Abstract: Similarity search is a fundamental algorithmic primitive, widely used in many computer science disciplines. Given a set of points $S$ and a radius parameter $r>0$, the $r$-near neighbor ($r$-NN) problem asks for a data structure that, given any query point $q$, returns a point $p$ within distance at most $r$ from $q$. In this paper, we study the $r$-NN problem in the light of individual fairness a… ▽ More

    Submitted 26 January, 2021; originally announced January 2021.

    Comments: arXiv admin note: text overlap with arXiv:1906.02640

  46. arXiv:2010.06973  [pdf, other

    cs.CL cs.DB cs.LG

    Neural Databases

    Authors: James Thorne, Majid Yazdani, Marzieh Saeidi, Fabrizio Silvestri, Sebastian Riedel, Alon Halevy

    Abstract: In recent years, neural networks have shown impressive performance gains on long-standing AI problems, and in particular, answering queries from natural language text. These advances raise the question of whether they can be extended to a point where we can relax the fundamental assumption of database management, namely, that our data is represented as fields of a pre-defined schema. This paper… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

    Comments: Submitted to PVLDB vol 14

  47. arXiv:2009.10311  [pdf, other

    cs.SI cs.AI

    Preserving Integrity in Online Social Networks

    Authors: Alon Halevy, Cristian Canton Ferrer, Hao Ma, Umut Ozertem, Patrick Pantel, Marzieh Saeidi, Fabrizio Silvestri, Ves Stoyanov

    Abstract: Online social networks provide a platform for sharing information and free expression. However, these networks are also used for malicious purposes, such as distributing misinformation and hate speech, selling illegal drugs, and coordinating sex trafficking or child exploitation. This paper surveys the state of the art in kee** online platforms and their users safe from such harm, also known as… ▽ More

    Submitted 25 September, 2020; v1 submitted 22 September, 2020; originally announced September 2020.

  48. arXiv:2006.12608  [pdf, ps, other

    cs.DS cs.DC cs.IR

    Similarity Search with Tensor Core Units

    Authors: Thomas D. Ahle, Francesco Silvestri

    Abstract: Tensor Core Units (TCUs) are hardware accelerators developed for deep neural networks, which efficiently support the multiplication of two dense $\sqrt{m}\times \sqrt{m}$ matrices, where $m$ is a given hardware parameter. In this paper, we show that TCUs can speed up similarity search problems as well. We propose algorithms for the Johnson-Lindenstrauss dimensionality reduction and for similarity… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

  49. arXiv:2006.00937  [pdf, ps, other

    cs.LG cs.IR stat.ML

    Concept Matching for Low-Resource Classification

    Authors: Federico Errica, Ludovic Denoyer, Bora Edizel, Fabio Petroni, Vassilis Plachouras, Fabrizio Silvestri, Sebastian Riedel

    Abstract: We propose a model to tackle classification tasks in the presence of very little training data. To this aim, we approximate the notion of exact match with a theoretically sound mechanism that computes a probability of matching in the input space. Importantly, the model learns to focus on elements of the input that are relevant for the task at hand; by leveraging highlighted portions of the trainin… ▽ More

    Submitted 1 June, 2020; originally announced June 2020.

  50. arXiv:2004.06423  [pdf

    physics.optics physics.app-ph

    Color filter arrays based on dielectric metasurface elements

    Authors: Jonas Berzins, Fabrizio Silvestri, Giampiero Gerini, Frank Setzpfandt, Thomas Pertsch, Stefan M. B. Bäumer

    Abstract: Digital imaging has been steadily improving over the past decades and we are moving towards a wide use of multi- and hyperspectral cameras. A key component of such imaging systems are color filter arrays, which define the spectrum of light detected by each camera pixel. Hence, it is essential to develop a variable, robust and scalable way for controlling the transmission of light. Nanostructured s… ▽ More

    Submitted 14 April, 2020; originally announced April 2020.

    Journal ref: SPIE Proc. 10671, 106711F (2018)