Skip to main content

Showing 1–50 of 89 results for author: Bianchi, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.07496  [pdf, other

    cs.CL cs.AI cs.LG

    TextGrad: Automatic "Differentiation" via Text

    Authors: Mert Yuksekgonul, Federico Bianchi, Joseph Boen, Sheng Liu, Zhi Huang, Carlos Guestrin, James Zou

    Abstract: AI is undergoing a paradigm shift, with breakthroughs achieved by systems orchestrating multiple large language models (LLMs) and other complex components. As a result, develo** principled and automated optimization methods for compound AI systems is one of the most important new challenges. Neural networks faced a similar challenge in its early days until backpropagation and automatic different… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 41 pages, 6 figures

  2. arXiv:2402.13926  [pdf, other

    cs.CL cs.AI

    Large Language Models are Vulnerable to Bait-and-Switch Attacks for Generating Harmful Content

    Authors: Federico Bianchi, James Zou

    Abstract: The risks derived from large language models (LLMs) generating deceptive and damaging content have been the subject of considerable research, but even safe generations can lead to problematic downstream impacts. In our study, we shift the focus to how even safe text coming from LLMs can be easily turned into potentially dangerous content through Bait-and-Switch attacks. In such attacks, the user f… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  3. arXiv:2402.10634  [pdf, other

    cs.LG cs.AI

    Graph-based Forecasting with Missing Data through Spatiotemporal Downsampling

    Authors: Ivan Marisca, Cesare Alippi, Filippo Maria Bianchi

    Abstract: Given a set of synchronous time series, each associated with a sensor-point in space and characterized by inter-series relationships, the problem of spatiotemporal forecasting consists of predicting future observations for each point. Spatiotemporal graph neural networks achieve striking results by representing the relationships across time series as a graph. Nonetheless, most existing methods rel… ▽ More

    Submitted 8 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: Accepted at ICML 2024

  4. arXiv:2402.05863  [pdf, other

    cs.AI cs.CL cs.GT

    How Well Can LLMs Negotiate? NegotiationArena Platform and Analysis

    Authors: Federico Bianchi, Patrick John Chia, Mert Yuksekgonul, Jacopo Tagliabue, Dan Jurafsky, James Zou

    Abstract: Negotiation is the basis of social interactions; humans negotiate everything from the price of cars to how to share common resources. With rapidly growing interest in using large language models (LLMs) to act as agents on behalf of human users, such LLM agents would also need to be able to negotiate. In this paper, we study how well LLMs can negotiate with each other. We develop NegotiationArena:… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  5. Scalable and automated Evaluation of Blue Team cyber posture in Cyber Ranges

    Authors: Federica Bianchi, Enrico Bassetti, Angelo Spognardi

    Abstract: Cyber ranges are virtual training ranges that have emerged as indispensable environments for conducting secure exercises and simulating real or hypothetical scenarios. These complex computational infrastructures enable the simulation of attacks, facilitating the evaluation of defense tools and methodologies and develo** novel countermeasures against threats. One of the main challenges of cyber r… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  6. arXiv:2309.11118  [pdf, other

    cs.RO eess.SY

    Vehicle-to-Grid and ancillary services:a profitability analysis under uncertainty

    Authors: Federico Bianchi, Alessandro Falsone, Riccardo Vignali

    Abstract: The rapid and massive diffusion of electric vehicles poses new challenges to the electric system, which must be able to supply these new loads, but at the same time opens up new opportunities thanks to the possible provision of ancillary services. Indeed, in the so-called Vehicle-to-Grid (V2G) set-up, the charging power can be modulated throughout the day so that a fleet of vehicles can absorb an… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Comments: Accepted by IFAC for publication under a Creative Commons Licence CC-BY-NC-ND

  7. arXiv:2309.07875  [pdf, other

    cs.CL

    Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions

    Authors: Federico Bianchi, Mirac Suzgun, Giuseppe Attanasio, Paul Röttger, Dan Jurafsky, Tatsunori Hashimoto, James Zou

    Abstract: Training large language models to follow instructions makes them perform better on a wide range of tasks and generally become more helpful. However, a perfectly helpful model will follow even the most malicious instructions and readily generate harmful content. In this paper, we raise concerns over the safety of models that only emphasize helpfulness, not harmlessness, in their instruction-tuning.… ▽ More

    Submitted 19 March, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

  8. arXiv:2308.12844  [pdf, other

    cs.LG

    Probabilistic load forecasting with Reservoir Computing

    Authors: Michele Guerra, Simone Scardapane, Filippo Maria Bianchi

    Abstract: Some applications of deep learning require not only to provide accurate results but also to quantify the amount of confidence in their prediction. The management of an electric power grid is one of these cases: to avoid risky scenarios, decision-makers need both precise and reliable forecasts of, for example, power loads. For this reason, point forecasts are not enough hence it is necessary to ado… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

  9. arXiv:2308.01263  [pdf, other

    cs.CL cs.AI

    XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models

    Authors: Paul Röttger, Hannah Rose Kirk, Bertie Vidgen, Giuseppe Attanasio, Federico Bianchi, Dirk Hovy

    Abstract: Without proper safeguards, large language models will readily follow malicious instructions and generate toxic content. This risk motivates safety efforts such as red-teaming and large-scale feedback learning, which aim to make models both helpful and harmless. However, there is a tension between these two objectives, since harmlessness requires models to refuse to comply with unsafe prompts, and… ▽ More

    Submitted 1 April, 2024; v1 submitted 2 August, 2023; originally announced August 2023.

    Comments: Accepted at NAACL 2024 (Main Conference)

  10. arXiv:2304.10621  [pdf, other

    cs.IR

    E Pluribus Unum: Guidelines on Multi-Objective Evaluation of Recommender Systems

    Authors: Patrick John Chia, Giuseppe Attanasio, Jacopo Tagliabue, Federico Bianchi, Ciro Greco, Gabriel de Souza P. Moreira, Davide Eynard, Fahd Husain

    Abstract: Recommender Systems today are still mostly evaluated in terms of accuracy, with other aspects beyond the immediate relevance of recommendations, such as diversity, long-term user retention and fairness, often taking a back seat. Moreover, reconciling multiple performance perspectives is by definition indeterminate, presenting a stumbling block to those in the pursuit of rounded evaluation of Recom… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: 15 pages, under submission

  11. arXiv:2304.07152  [pdf, other

    cs.LG

    Combining Stochastic Explainers and Subgraph Neural Networks can Increase Expressivity and Interpretability

    Authors: Indro Spinelli, Michele Guerra, Filippo Maria Bianchi, Simone Scardapane

    Abstract: Subgraph-enhanced graph neural networks (SGNN) can increase the expressive power of the standard message-passing framework. This model family represents each graph as a collection of subgraphs, generally extracted by random sampling or with hand-crafted heuristics. Our key observation is that by selecting "meaningful" subgraphs, besides improving the expressivity of a GNN, it is also possible to o… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

  12. arXiv:2304.07145  [pdf, ps, other

    cs.IR cs.CY

    EvalRS 2023. Well-Rounded Recommender Systems For Real-World Deployments

    Authors: Federico Bianchi, Patrick John Chia, Ciro Greco, Claudio Pomo, Gabriel Moreira, Davide Eynard, Fahd Husain, Jacopo Tagliabue

    Abstract: EvalRS aims to bring together practitioners from industry and academia to foster a debate on rounded evaluation of recommender systems, with a focus on real-world impact across a multitude of deployment scenarios. Recommender systems are often evaluated only through accuracy metrics, which fall short of fully characterizing their generalization capabilities and miss important aspects, such as fair… ▽ More

    Submitted 22 July, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

    Comments: EvalRS 2023 is a workshop at KDD23. Code and hackathon materials: https://github.com/RecList/evalRS-KDD-2023

  13. arXiv:2304.01575  [pdf, other

    cs.LG

    The expressive power of pooling in Graph Neural Networks

    Authors: Filippo Maria Bianchi, Veronica Lachi

    Abstract: In Graph Neural Networks (GNNs), hierarchical pooling operators generate local summaries of the data by coarsening the graph structure and the vertex features. While considerable attention has been devoted to analyzing the expressive power of message-passing (MP) layers in GNNs, a study on how graph pooling affects the expressiveness of a GNN is still lacking. Additionally, despite the recent adva… ▽ More

    Submitted 12 October, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

  14. arXiv:2212.09056  [pdf, other

    cs.CL

    Beyond Digital "Echo Chambers": The Role of Viewpoint Diversity in Political Discussion

    Authors: Rishav Hada, Amir Ebrahimi Fard, Sarah Shugars, Federico Bianchi, Patricia Rossini, Dirk Hovy, Rebekah Tromble, Nava Tintarev

    Abstract: Increasingly taking place in online spaces, modern political conversations are typically perceived to be unproductively affirming -- siloed in so called ``echo chambers'' of exclusively like-minded discussants. Yet, to date we lack sufficient means to measure viewpoint diversity in conversations. To this end, in this paper, we operationalize two viewpoint metrics proposed for recommender systems a… ▽ More

    Submitted 18 December, 2022; originally announced December 2022.

    Comments: Camera-ready version in WSDM 2023

  15. arXiv:2211.06218  [pdf, other

    cs.LG

    Total Variation Graph Neural Networks

    Authors: Jonas Berg Hansen, Filippo Maria Bianchi

    Abstract: Recently proposed Graph Neural Networks (GNNs) for vertex clustering are trained with an unsupervised minimum cut objective, approximated by a Spectral Clustering (SC) relaxation. However, the SC relaxation is loose and, while it offers a closed-form solution, it also yields overly smooth cluster assignments that poorly separate the vertices. In this paper, we propose a GNN model that computes clu… ▽ More

    Submitted 27 April, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

  16. arXiv:2211.04281  [pdf, other

    cs.CL

    SocioProbe: What, When, and Where Language Models Learn about Sociodemographics

    Authors: Anne Lauscher, Federico Bianchi, Samuel Bowman, Dirk Hovy

    Abstract: Pre-trained language models (PLMs) have outperformed other NLP models on a wide range of tasks. Opting for a more thorough understanding of their capabilities and inner workings, researchers have established the extend to which they capture lower-level knowledge like grammaticality, and mid-level semantic knowledge like factual understanding. However, there is still little understanding of their k… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

    Comments: Accepted for publication at EMNLP 2022

  17. Easily Accessible Text-to-Image Generation Amplifies Demographic Stereotypes at Large Scale

    Authors: Federico Bianchi, Pratyusha Kalluri, Esin Durmus, Faisal Ladhak, Myra Cheng, Debora Nozza, Tatsunori Hashimoto, Dan Jurafsky, James Zou, Aylin Caliskan

    Abstract: Machine learning models that convert user-written text descriptions into images are now widely available online and used by millions of users to generate millions of images a day. We investigate the potential for these models to amplify dangerous and complex stereotypes. We find a broad range of ordinary prompts produce stereotypes, including prompts simply mentioning traits, descriptors, occupati… ▽ More

    Submitted 7 June, 2023; v1 submitted 7 November, 2022; originally announced November 2022.

    Comments: FAccT 2023 paper. The published version is available at 10.1145/3593013.3594095

  18. arXiv:2210.15870  [pdf, other

    cs.CL

    "It's Not Just Hate'': A Multi-Dimensional Perspective on Detecting Harmful Speech Online

    Authors: Federico Bianchi, Stefanie Anja Hills, Patricia Rossini, Dirk Hovy, Rebekah Tromble, Nava Tintarev

    Abstract: Well-annotated data is a prerequisite for good Natural Language Processing models. Too often, though, annotation decisions are governed by optimizing time or annotator agreement. We make a case for nuanced efforts in an interdisciplinary setting for annotating offensive online speech. Detecting offensive content is rapidly becoming one of the most important real-world NLP tasks. However, most data… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: EMNLP 2022

  19. arXiv:2210.14763  [pdf, other

    cs.CL

    ProSiT! Latent Variable Discovery with PROgressive SImilarity Thresholds

    Authors: Tommaso Fornaciari, Dirk Hovy, Federico Bianchi

    Abstract: The most common ways to explore latent document dimensions are topic models and clustering methods. However, topic models have several drawbacks: e.g., they require us to choose the number of latent dimensions a priori, and the results are stochastic. Most clustering methods have the same issues and lack flexibility in various ways, such as not accounting for the influence of different topics on s… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

  20. arXiv:2210.11359  [pdf, other

    cs.CL

    Data-Efficient Strategies for Expanding Hate Speech Detection into Under-Resourced Languages

    Authors: Paul Röttger, Debora Nozza, Federico Bianchi, Dirk Hovy

    Abstract: Hate speech is a global phenomenon, but most hate speech datasets so far focus on English-language content. This hinders the development of more effective hate speech detection models in hundreds of languages spoken by billions across the world. More data is needed, but annotating hateful content is expensive, time-consuming and potentially harmful to annotators. To mitigate these issues, we explo… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: Accepted at EMNLP 2022 (Main Conference)

  21. arXiv:2210.07365  [pdf, other

    cs.CL

    Is It Worth the (Environmental) Cost? Limited Evidence for Temporal Adaptation via Continuous Training

    Authors: Giuseppe Attanasio, Debora Nozza, Federico Bianchi, Dirk Hovy

    Abstract: Language is constantly changing and evolving, leaving language models to become quickly outdated. Consequently, we should continuously update our models with new data to expose them to new events and facts. However, that requires additional computing, which means new carbon emissions. Do any measurable benefits justify this cost? This paper looks for empirical evidence to support continuous traini… ▽ More

    Submitted 4 May, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: 8 pages

  22. arXiv:2210.01936  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    When and why vision-language models behave like bags-of-words, and what to do about it?

    Authors: Mert Yuksekgonul, Federico Bianchi, Pratyusha Kalluri, Dan Jurafsky, James Zou

    Abstract: Despite the success of large vision and language models (VLMs) in many downstream applications, it is unclear how well they encode compositional information. Here, we create the Attribution, Relation, and Order (ARO) benchmark to systematically evaluate the ability of VLMs to understand different types of relationships, attributes, and order. ARO consists of Visual Genome Attribution, to test the… ▽ More

    Submitted 23 March, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: ICLR 2023 Oral (notable-top-5%)

  23. Explainability in subgraphs-enhanced Graph Neural Networks

    Authors: Michele Guerra, Indro Spinelli, Simone Scardapane, Filippo Maria Bianchi

    Abstract: Recently, subgraphs-enhanced Graph Neural Networks (SGNNs) have been introduced to enhance the expressive power of Graph Neural Networks (GNNs), which was proved to be not higher than the 1-dimensional Weisfeiler-Leman isomorphism test. The new paradigm suggests using subgraphs extracted from the input graph to improve the model's expressiveness, but the additional complexity exacerbates an alread… ▽ More

    Submitted 19 January, 2023; v1 submitted 16 September, 2022; originally announced September 2022.

    Comments: The source code implementing our workflow is publicly available online at https://github.com/MicheleUIT/Explaining_SGNN

  24. arXiv:2209.06520  [pdf, other

    cs.LG cs.AI

    Scalable Spatiotemporal Graph Neural Networks

    Authors: Andrea Cini, Ivan Marisca, Filippo Maria Bianchi, Cesare Alippi

    Abstract: Neural forecasting of spatiotemporal time series drives both research and industrial innovation in several relevant application domains. Graph neural networks (GNNs) are often the core component of the forecasting architecture. However, in most spatiotemporal GNNs, the computational complexity scales up to a quadratic factor with the length of the sequence times the number of links in the graph, h… ▽ More

    Submitted 20 February, 2023; v1 submitted 14 September, 2022; originally announced September 2022.

    Comments: Published as conference paper at AAAI 23

  25. Real-Time Oil Leakage Detection on Aftermarket Motorcycle Dam** System with Convolutional Neural Networks

    Authors: Federico Bianchi, Stefano Speziali, Andrea Marini, Massimiliano Proietti, Lorenzo Menculini, Alberto Garinei, Gabriele Bellani, Marcello Marconi

    Abstract: In this work, we describe in detail how Deep Learning and Computer Vision can help to detect fault events of the AirTender system, an aftermarket motorcycle dam** system component. One of the most effective ways to monitor the AirTender functioning is to look for oil stains on its surface. Starting from real-time images, AirTender is first detected in the motorbike suspension system, simulated i… ▽ More

    Submitted 23 November, 2022; v1 submitted 10 August, 2022; originally announced August 2022.

    Comments: analysis of literature reviewed, n.2 figures added, minor corrections

  26. Simplifying Clustering with Graph Neural Networks

    Authors: Filippo Maria Bianchi

    Abstract: The objective functions used in spectral clustering are usually composed of two terms: i) a term that minimizes the local quadratic variation of the cluster assignments on the graph and; ii) a term that balances the clustering partition and helps avoiding degenerate solutions. This paper shows that a graph neural network, equipped with suitable message passing layers, can generate good cluster ass… ▽ More

    Submitted 27 November, 2022; v1 submitted 18 July, 2022; originally announced July 2022.

  27. arXiv:2207.05772  [pdf, ps, other

    cs.IR

    EvalRS: a Rounded Evaluation of Recommender Systems

    Authors: Jacopo Tagliabue, Federico Bianchi, Tobias Schnabel, Giuseppe Attanasio, Ciro Greco, Gabriel de Souza P. Moreira, Patrick John Chia

    Abstract: Much of the complexity of Recommender Systems (RSs) comes from the fact that they are used as part of more complex applications and affect user experience through a varied range of user interfaces. However, research focused almost exclusively on the ability of RSs to produce accurate item rankings while giving little attention to the evaluation of RS behavior in real-world scenarios. Such narrow f… ▽ More

    Submitted 12 August, 2022; v1 submitted 12 July, 2022; originally announced July 2022.

    Comments: CIKM 2022 Data Challenge Paper

  28. arXiv:2204.03972  [pdf, other

    cs.IR cs.CL

    Contrastive language and vision learning of general fashion concepts

    Authors: Patrick John Chia, Giuseppe Attanasio, Federico Bianchi, Silvia Terragni, Ana Rita Magalhães, Diogo Goncalves, Ciro Greco, Jacopo Tagliabue

    Abstract: The steady rise of online shop** goes hand in hand with the development of increasingly complex ML and NLP models. While most use cases are cast as specialized supervised learning problems, we argue that practitioners would greatly benefit from more transferable representations of products. In this work, we build on recent developments in contrastive learning to train FashionCLIP, a CLIP-like mo… ▽ More

    Submitted 18 April, 2023; v1 submitted 8 April, 2022; originally announced April 2022.

    Comments: Latest version available at https://www.nature.com/articles/s41598-022-23052-9; model available at https://huggingface.co/patrickjohncyh/fashion-clip

  29. arXiv:2204.02473  [pdf, other

    cs.IR cs.AI

    "Does it come in black?" CLIP-like models are zero-shot recommenders

    Authors: Patrick John Chia, Jacopo Tagliabue, Federico Bianchi, Ciro Greco, Diogo Goncalves

    Abstract: Product discovery is a crucial component for online shop**. However, item-to-item recommendations today do not allow users to explore changes along selected dimensions: given a query item, can a model suggest something similar but in a different color? We consider item recommendations of the comparative nature (e.g. "something darker") and show how CLIP-based models can support this use case in… ▽ More

    Submitted 11 April, 2022; v1 submitted 5 April, 2022; originally announced April 2022.

    Comments: Accepted at ACL 2022 (ECNLP)

  30. Recognition of polar lows in Sentinel-1 SAR images with deep learning

    Authors: Jakob Grahn, Filippo Maria Bianchi

    Abstract: In this paper, we explore the possibility of detecting polar lows in C-band SAR images by means of deep learning. Specifically, we introduce a novel dataset consisting of Sentinel-1 images divided into two classes, representing the presence and absence of a maritime mesocyclone, respectively. The dataset is constructed using the ERA5 dataset as baseline and it consists of 2004 annotated images. To… ▽ More

    Submitted 5 September, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: 11 pages (+4 supplementary), 11 figures (+2 supplementary)

  31. arXiv:2203.07080  [pdf, other

    cs.LG physics.data-an

    Probabilistic forecasts of wind power generation in regions with complex topography using deep learning methods: An Arctic case

    Authors: Odin Foldvik Eikeland, Finn Dag Hovem, Tom Eirik Olsen, Matteo Chiesa, Filippo Maria Bianchi

    Abstract: The energy market relies on forecasting capabilities of both demand and power generation that need to be kept in dynamic balance. Today, when it comes to renewable energy generation, such decisions are increasingly made in a liberalized electricity market environment, where future power generation must be offered through contracts and auction mechanisms, hence based on forecasts. The increased sha… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

    Comments: 16 pages, 8 Figures, 4 Tables

  32. Ensemble Conformalized Quantile Regression for Probabilistic Time Series Forecasting

    Authors: Vilde Jensen, Filippo Maria Bianchi, Stian Norman Anfinsen

    Abstract: This paper presents a novel probabilistic forecasting method called ensemble conformalized quantile regression (EnCQR). EnCQR constructs distribution-free and approximately marginally valid prediction intervals (PIs), which are suitable for nonstationary and heteroscedastic time series data. EnCQR can be applied on top of a generic forecasting model, including deep learning architectures. EnCQR ex… ▽ More

    Submitted 6 November, 2022; v1 submitted 17 February, 2022; originally announced February 2022.

    Journal ref: IEEE Transactions on Neural Networks and Learning Systems, 2022

  33. arXiv:2201.10986  [pdf, other

    cs.CL

    Twitter-Demographer: A Flow-based Tool to Enrich Twitter Data

    Authors: Federico Bianchi, Vincenzo Cutrona, Dirk Hovy

    Abstract: Twitter data have become essential to Natural Language Processing (NLP) and social science research, driving various scientific discoveries in recent years. However, the textual data alone are often not enough to conduct studies: especially social scientists need more variables to perform their analysis and control for various factors. How we augment this information, such as users' location, age,… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

  34. arXiv:2111.09963  [pdf, other

    cs.IR cs.AI cs.LG

    Beyond NDCG: behavioral testing of recommender systems with RecList

    Authors: Patrick John Chia, Jacopo Tagliabue, Federico Bianchi, Chloe He, Brian Ko

    Abstract: As with most Machine Learning systems, recommender systems are typically evaluated through performance metrics computed over held-out data points. However, real-world behavior is undoubtedly nuanced: ad hoc error analysis and deployment-specific tests must be employed to ensure the desired quality in actual deployments. In this paper, we propose RecList, a behavioral-based testing methodology. Rec… ▽ More

    Submitted 27 March, 2022; v1 submitted 18 November, 2021; originally announced November 2021.

    Comments: Paper accepted to the WebConf 2022

  35. Power Flow Balancing with Decentralized Graph Neural Networks

    Authors: Jonas Berg Hansen, Stian Normann Anfinsen, Filippo Maria Bianchi

    Abstract: We propose an end-to-end framework based on a Graph Neural Network (GNN) to balance the power flows in energy grids. The balancing is framed as a supervised vertex regression task, where the GNN is trained to predict the current and power injections at each grid branch that yield a power flow balance. By representing the power grid as a line graph with branches as vertices, we can train a GNN that… ▽ More

    Submitted 11 August, 2022; v1 submitted 3 November, 2021; originally announced November 2021.

  36. Understanding Pooling in Graph Neural Networks

    Authors: Daniele Grattarola, Daniele Zambon, Filippo Maria Bianchi, Cesare Alippi

    Abstract: Inspired by the conventional pooling layers in convolutional neural networks, many recent works in the field of graph machine learning have introduced pooling operators to reduce the size of graphs. The great variety in the literature stems from the many possible strategies for coarsening a graph, which may depend on different assumptions on the graph structure or the specific downstream task. In… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

    Comments: 10 pages, 6 figures

    Journal ref: IEEE Transactions on Neural Networks and Learning Systems (Volume: 35, Issue: 2, February 2024)

  37. arXiv:2109.13037  [pdf, other

    cs.CL

    Language Invariant Properties in Natural Language Processing

    Authors: Federico Bianchi, Debora Nozza, Dirk Hovy

    Abstract: Meaning is context-dependent, but many properties of language (should) remain the same even if we transform the context. For example, sentiment, entailment, or speaker properties should be the same in a translation and original of a text. We introduce language invariant properties: i.e., properties that should not change when we transform text, and how they can be used to quantitatively evaluate t… ▽ More

    Submitted 1 October, 2021; v1 submitted 27 September, 2021; originally announced September 2021.

  38. arXiv:2109.07231  [pdf, other

    cs.CL

    SWEAT: Scoring Polarization of Topics across Different Corpora

    Authors: Federico Bianchi, Marco Marelli, Paolo Nicoli, Matteo Palmonari

    Abstract: Understanding differences of viewpoints across corpora is a fundamental task for computational social sciences. In this paper, we propose the Sliced Word Embedding Association Test (SWEAT), a novel statistical measure to compute the relative polarization of a topical wordset across two distributional representations. To this end, SWEAT uses two additional wordsets, deemed to have opposite valence,… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

    Comments: Published as a conference paper at EMNLP2021

  39. arXiv:2108.08688  [pdf, other

    cs.CL cs.CV

    Contrastive Language-Image Pre-training for the Italian Language

    Authors: Federico Bianchi, Giuseppe Attanasio, Raphael Pisoni, Silvia Terragni, Gabriele Sarti, Sri Lakshmi

    Abstract: CLIP (Contrastive Language-Image Pre-training) is a very recent multi-modal model that jointly learns representations of images and texts. The model is trained on a massive amount of English data and shows impressive performance on zero-shot classification tasks. Training the same model on a different language is not trivial, since data in other languages might be not enough and the model needs hi… ▽ More

    Submitted 19 August, 2021; originally announced August 2021.

  40. arXiv:2108.07060  [pdf, other

    cs.LG

    Detecting and interpreting faults in vulnerable power grids with machine learning

    Authors: Odin Foldvik Eikeland, Inga Setså Holmstrand, Sigurd Bakkejord, Matteo Chiesa, Filippo Maria Bianchi

    Abstract: Unscheduled power disturbances cause severe consequences both for customers and grid operators. To defend against such events, it is necessary to identify the causes of interruptions in the power distribution network. In this work, we focus on the power grid of a Norwegian community in the Arctic that experiences several faults whose sources are unknown. First, we construct a data set consisting o… ▽ More

    Submitted 16 August, 2021; originally announced August 2021.

  41. arXiv:2104.09423  [pdf, ps, other

    cs.IR

    SIGIR 2021 E-Commerce Workshop Data Challenge

    Authors: Jacopo Tagliabue, Ciro Greco, Jean-Francis Roy, Bingqing Yu, Patrick John Chia, Federico Bianchi, Giovanni Cassani

    Abstract: The 2021 SIGIR workshop on eCommerce is hosting the Coveo Data Challenge for "In-session prediction for purchase intent and recommendations". The challenge addresses the growing need for reliable predictions within the boundaries of a shop** session, as customer intentions can be different depending on the occasion. The need for efficient procedures for personalization is even clearer if we cons… ▽ More

    Submitted 16 July, 2021; v1 submitted 19 April, 2021; originally announced April 2021.

    Comments: SIGIR eCOM 2021 Data Challenge

  42. arXiv:2104.08874  [pdf, other

    cs.CL

    Language in a (Search) Box: Grounding Language Learning in Real-World Human-Machine Interaction

    Authors: Federico Bianchi, Ciro Greco, Jacopo Tagliabue

    Abstract: We investigate grounded language learning through real-world data, by modelling a teacher-learner dynamics through the natural interactions occurring between users and search engines; in particular, we explore the emergence of semantic generalization from unsupervised dense representations outside of synthetic environments. A grounding domain, a denotation function and a composition function are l… ▽ More

    Submitted 18 April, 2021; originally announced April 2021.

    Comments: Published as a conference paper at NAACL2021

  43. arXiv:2104.04710  [pdf, other

    cs.LG cs.NE stat.ML

    Pyramidal Reservoir Graph Neural Network

    Authors: Filippo Maria Bianchi, Claudio Gallicchio, Alessio Micheli

    Abstract: We propose a deep Graph Neural Network (GNN) model that alternates two types of layers. The first type is inspired by Reservoir Computing (RC) and generates new vertex features by iterating a non-linear map until it converges to a fixed point. The second type of layer implements graph pooling operations, that gradually reduce the support graph and the vertex features, and further improve the compu… ▽ More

    Submitted 10 April, 2021; originally announced April 2021.

    Comments: this is a pre-print version of a paper submitted for journal publication

  44. arXiv:2104.02061  [pdf, other

    cs.IR cs.LG

    Query2Prod2Vec Grounded Word Embeddings for eCommerce

    Authors: Federico Bianchi, Jacopo Tagliabue, Bingqing Yu

    Abstract: We present Query2Prod2Vec, a model that grounds lexical representations for product search in product embeddings: in our model, meaning is a map** between words and a latent space of products in a digital shop. We leverage shop** sessions to learn the underlying space and use merchandising annotations to build lexical analogies for evaluation: our experiments show that our model is more accura… ▽ More

    Submitted 2 April, 2021; originally announced April 2021.

    Comments: Published as a conference paper at NAACL2021 - Industry Track

  45. arXiv:2012.09807  [pdf, other

    cs.CL cs.IR

    BERT Goes Shop**: Comparing Distributional Models for Product Representations

    Authors: Federico Bianchi, Bingqing Yu, Jacopo Tagliabue

    Abstract: Word embeddings (e.g., word2vec) have been applied successfully to eCommerce products through~\textit{prod2vec}. Inspired by the recent performance improvements on several NLP tasks brought by contextualized embeddings, we propose to transfer BERT-like architectures to eCommerce: our model -- ~\textit{Prod2BERT} -- is trained to generate representations of products through masked session modeling.… ▽ More

    Submitted 23 June, 2021; v1 submitted 17 December, 2020; originally announced December 2020.

    Comments: Updated version. Published as a workshop paper at ECNLP 4 at ACL-IJCNLP 2021

  46. arXiv:2007.14906  [pdf, other

    cs.IR cs.LG

    Fantastic Embeddings and How to Align Them: Zero-Shot Inference in a Multi-Shop Scenario

    Authors: Federico Bianchi, Jacopo Tagliabue, Bingqing Yu, Luca Bigon, Ciro Greco

    Abstract: This paper addresses the challenge of leveraging multiple embedding spaces for multi-shop personalization, proving that zero-shot inference is possible by transferring shop** intent from one website to another without manual intervention. We detail a machine learning pipeline to train and optimize embeddings within shops first, and support the quantitative findings with additional qualitative in… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

    Comments: accepted at 2020 SIGIR Workshop On eCommerce

  47. arXiv:2006.13575  [pdf, other

    cs.CV cs.LG

    Large-scale detection and categorization of oil spills from SAR images with deep learning

    Authors: Filippo Maria Bianchi, Martine M. Espeseth, Njål Borch

    Abstract: We propose a deep learning framework to detect and categorize oil spills in synthetic aperture radar (SAR) images at a large scale. By means of a carefully designed neural network model for image segmentation trained on an extensive dataset, we are able to obtain state-of-the-art performance in oil spill detection, achieving results that are comparable to results produced by human operators. We al… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.

  48. Knowledge Graph Embeddings and Explainable AI

    Authors: Federico Bianchi, Gaetano Rossiello, Luca Costabello, Matteo Palmonari, Pasquale Minervini

    Abstract: Knowledge graph embeddings are now a widely adopted approach to knowledge representation in which entities and relationships are embedded in vector spaces. In this chapter, we introduce the reader to the concept of knowledge graph embeddings by explaining what they are, how they can be generated and how they can be evaluated. We summarize the state-of-the-art in this field by describing the approa… ▽ More

    Submitted 30 April, 2020; originally announced April 2020.

    Comments: Federico Bianchi, Gaetano Rossiello, Luca Costabello, Matteo Plamonari, Pasquale Minervini, Knowledge Graph Embeddings and Explainable AI. In: Ilaria Tiddi, Freddy Lecue, Pascal Hitzler (eds.), Knowledge Graphs for eXplainable AI -- Foundations, Applications and Challenges. Studies on the Semantic Web, IOS Press, Amsterdam, 2020

  49. arXiv:2004.07737  [pdf, other

    cs.CL

    Cross-lingual Contextualized Topic Models with Zero-shot Learning

    Authors: Federico Bianchi, Silvia Terragni, Dirk Hovy, Debora Nozza, Elisabetta Fersini

    Abstract: Many data sets (e.g., reviews, forums, news, etc.) exist parallelly in multiple languages. They all cover the same content, but the linguistic differences make it impossible to use traditional, bag-of-word-based topic models. Models have to be either single-language or suffer from a huge, but extremely sparse vocabulary. Both issues can be addressed by transfer learning. In this paper, we introduc… ▽ More

    Submitted 4 February, 2021; v1 submitted 16 April, 2020; originally announced April 2020.

    Comments: Updated version. Published as a conference paper at EACL2021

  50. arXiv:2004.07011  [pdf, other

    cs.CV

    Code-Aligned Autoencoders for Unsupervised Change Detection in Multimodal Remote Sensing Images

    Authors: Luigi T. Luppino, Mads A. Hansen, Michael Kampffmeyer, Filippo M. Bianchi, Gabriele Moser, Robert Jenssen, Stian N. Anfinsen

    Abstract: Image translation with convolutional autoencoders has recently been used as an approach to multimodal change detection in bitemporal satellite images. A main challenge is the alignment of the code spaces by reducing the contribution of change pixels to the learning of the translation function. Many existing approaches train the networks by exploiting supervised information of the change areas, whi… ▽ More

    Submitted 15 April, 2020; originally announced April 2020.