-
Let Guidelines Guide You: A Prescriptive Guideline-Centered Data Annotation Methodology
Authors:
Federico Ruggeri,
Eleonora Misino,
Arianna Muti,
Katerina Korre,
Paolo Torroni,
Alberto Barrón-Cedeño
Abstract:
We introduce the Guideline-Centered annotation process, a novel data annotation methodology focused on reporting the annotation guidelines associated with each data sample. We identify three main limitations of the standard prescriptive annotation process and describe how the Guideline-Centered methodology overcomes them by reducing the loss of information in the annotation process and ensuring ad…
▽ More
We introduce the Guideline-Centered annotation process, a novel data annotation methodology focused on reporting the annotation guidelines associated with each data sample. We identify three main limitations of the standard prescriptive annotation process and describe how the Guideline-Centered methodology overcomes them by reducing the loss of information in the annotation process and ensuring adherence to guidelines. Additionally, we discuss how the Guideline-Centered enables the reuse of annotated data across multiple tasks at the cost of a single human-annotation process.
△ Less
Submitted 2 July, 2024; v1 submitted 20 June, 2024;
originally announced June 2024.
-
Promoting Fairness and Diversity in Speech Datasets for Mental Health and Neurological Disorders Research
Authors:
Eleonora Mancini,
Ana Tanevska,
Andrea Galassi,
Alessio Galatolo,
Federico Ruggeri,
Paolo Torroni
Abstract:
Current research in machine learning and artificial intelligence is largely centered on modeling and performance evaluation, less so on data collection. However, recent research demonstrated that limitations and biases in data may negatively impact trustworthiness and reliability. These aspects are particularly impactful on sensitive domains such as mental health and neurological disorders, where…
▽ More
Current research in machine learning and artificial intelligence is largely centered on modeling and performance evaluation, less so on data collection. However, recent research demonstrated that limitations and biases in data may negatively impact trustworthiness and reliability. These aspects are particularly impactful on sensitive domains such as mental health and neurological disorders, where speech data are used to develop AI applications aimed at improving the health of patients and supporting healthcare providers. In this paper, we chart the landscape of available speech datasets for this domain, to highlight possible pitfalls and opportunities for improvement and promote fairness and diversity. We present a comprehensive list of desiderata for building speech datasets for mental health and neurological disorders and distill it into a checklist focused on ethical concerns to foster more responsible research.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Likelihood distortion and Bayesian local robustness
Authors:
Antonio Di Noia,
Fabrizio Ruggeri,
Antonietta Mira
Abstract:
Robust Bayesian analysis has been mainly devoted to detecting and measuring robustness to the prior distribution. Indeed, many contributions in the literature aim to define suitable classes of priors which allow the computation of variations of quantities of interest while the prior changes within those classes. The literature has devoted much less attention to the robustness of Bayesian methods t…
▽ More
Robust Bayesian analysis has been mainly devoted to detecting and measuring robustness to the prior distribution. Indeed, many contributions in the literature aim to define suitable classes of priors which allow the computation of variations of quantities of interest while the prior changes within those classes. The literature has devoted much less attention to the robustness of Bayesian methods to the likelihood function due to mathematical and computational complexity, and because it is often arguably considered a more objective choice compared to the prior. In this contribution, a new approach to Bayesian local robustness to the likelihood function is proposed and extended to robustness to the prior and to both. This approach is based on the notion of distortion function introduced in the literature on risk theory, and then successfully adopted to build suitable classes of priors for Bayesian global robustness to the prior. The novel robustness measure is a local sensitivity measure that turns out to be very tractable and easy to compute for certain classes of distortion functions. Asymptotic properties are derived and numerical experiments illustrate the theory and its applicability for modelling purposes.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
PejorativITy: Disambiguating Pejorative Epithets to Improve Misogyny Detection in Italian Tweets
Authors:
Arianna Muti,
Federico Ruggeri,
Cagri Toraman,
Lorenzo Musetti,
Samuel Algherini,
Silvia Ronchi,
Gianmarco Saretto,
Caterina Zapparoli,
Alberto Barrón-Cedeño
Abstract:
Misogyny is often expressed through figurative language. Some neutral words can assume a negative connotation when functioning as pejorative epithets. Disambiguating the meaning of such terms might help the detection of misogyny. In order to address such task, we present PejorativITy, a novel corpus of 1,200 manually annotated Italian tweets for pejorative language at the word level and misogyny a…
▽ More
Misogyny is often expressed through figurative language. Some neutral words can assume a negative connotation when functioning as pejorative epithets. Disambiguating the meaning of such terms might help the detection of misogyny. In order to address such task, we present PejorativITy, a novel corpus of 1,200 manually annotated Italian tweets for pejorative language at the word level and misogyny at the sentence level. We evaluate the impact of injecting information about disambiguated words into a model targeting misogyny detection. In particular, we explore two different approaches for injection: concatenation of pejorative information and substitution of ambiguous words with univocal terms. Our experimental results, both on our corpus and on two popular benchmarks on Italian tweets, show that both approaches lead to a major classification improvement, indicating that word sense disambiguation is a promising preliminary step for misogyny detection. Furthermore, we investigate LLMs' understanding of pejorative epithets by means of contextual word embeddings analysis and prompting.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
A bivariate two-state Markov modulated Poisson process for failure modelling
Authors:
Yoel G. Yera,
Rosa E. Lillo,
Bo F. Nielsen,
Pepa Ramírez-Cobo,
Fabrizio Ruggeri
Abstract:
Motivated by a real failure dataset in a two-dimensional context, this paper presents an extension of the Markov modulated Poisson process (MMPP) to two dimensions. The one-dimensional MMPP has been proposed for the modeling of dependent and non-exponential inter-failure times (in contexts as queuing, risk or reliability, among others). The novel two-dimensional MMPP allows for dependence between…
▽ More
Motivated by a real failure dataset in a two-dimensional context, this paper presents an extension of the Markov modulated Poisson process (MMPP) to two dimensions. The one-dimensional MMPP has been proposed for the modeling of dependent and non-exponential inter-failure times (in contexts as queuing, risk or reliability, among others). The novel two-dimensional MMPP allows for dependence between the two sequences of inter-failure times, while at the same time preserves the MMPP properties, marginally. The generalization is based on the Marshall-Olkin exponential distribution. Inference is undertaken for the new model through a method combining a matching moments approach with an Approximate Bayesian Computation (ABC) algorithm. The performance of the method is shown on simulated and real datasets representing times and distances covered between consecutive failures in a public transport company. For the real dataset, some quantities of importance associated with the reliability of the system are estimated as the probabilities and expected number of failures at different times and distances covered by trains until the occurrence of a failure.
△ Less
Submitted 26 January, 2024;
originally announced January 2024.
-
Placement of Biological Membrane Patches in a Nanofluidic Gap with Control over Position and Orientation
Authors:
Francesca Ruggeri,
Christian Schwemmer,
Mirko Stauffer,
Philippe M. Nicollier,
Jacqueline Figueiredo da Silva,
Patrick D. Bosshart,
Kirstin Kochems,
Dimitrios Fotiadis,
Armin Knoll,
Heiko Wolf
Abstract:
Purple membranes from the archaeon Halobacterium salinarum consist of two-dimensional crystals of the light-driven proton pump bacteriorhodopsin, which convert photons into a proton gradient across the cell membrane. This functional feature and the structural rigidity make them appealing candidates for integration into biomimetic devices. To this end, and in order to carry out their function, purp…
▽ More
Purple membranes from the archaeon Halobacterium salinarum consist of two-dimensional crystals of the light-driven proton pump bacteriorhodopsin, which convert photons into a proton gradient across the cell membrane. This functional feature and the structural rigidity make them appealing candidates for integration into biomimetic devices. To this end, and in order to carry out their function, purple membranes must be positioned in the correct orientation at the position of interest. Precise placement and control over the orientation of nanoscale objects still constitutes a formidable challenge. Here we show that isolated purple membrane patches can be transported and positioned at predefined locations in nanofluidic confinement, with control over their orientation at the target sites. The transport is achieved through a rocking Brownian motor scheme, while the controlled deposition of the membranes is realized by engineering the surface potential of a fluid-filled nanofluidic slit. This controlled manipulation of purple membrane patches outlines a new pathway towards the integration of biological or other delicate supramolecular structures into top-down-fabricated patterns, for the assembly of nanoscale hybrid devices that serve as a light-driven source of (chemical) energy.
△ Less
Submitted 24 November, 2023;
originally announced November 2023.
-
Gate Electrodes Enable Tunable Nanofluidic Particle Traps
Authors:
Philippe M. Nicollier,
Aaron D. Ratschow,
Francesca Ruggeri,
Ute Drechsler,
Steffen Hardt,
Federico Paratore,
Armin W. Knoll
Abstract:
The ability to control the location of nanoscale objects in liquids is essential for fundamental and applied research from nanofluidics to molecular biology. To overcome their random Brownian motion, the electrostatic fluidic trap creates local minima in potential energy by sha** electrostatic interactions with a tailored wall topography. However, this strategy is inherently static -- once fabri…
▽ More
The ability to control the location of nanoscale objects in liquids is essential for fundamental and applied research from nanofluidics to molecular biology. To overcome their random Brownian motion, the electrostatic fluidic trap creates local minima in potential energy by sha** electrostatic interactions with a tailored wall topography. However, this strategy is inherently static -- once fabricated the potential wells cannot be modulated. Here, we propose and experimentally demonstrate that such a trap can be controlled through a buried gate electrode.We measure changes in the average escape times of nanoparticles from the traps to quantify the induced modulations of $0.7k_\rm{B}T$ in potential energy and 50 mV in surface potential. Finally, we summarize the mechanism in a parameter-free predictive model, including surface chemistry and electrostatic fringing, that reproduces the experimental results. Our findings open a route towards real-time controllable nanoparticle traps.
△ Less
Submitted 22 September, 2023;
originally announced September 2023.
-
Dependent Cluster Map** (DCMAP): Optimal clustering of directed acyclic graphs for statistical inference
Authors:
Paul Pao-Yen Wu,
Fabrizio Ruggeri,
Kerrie Mengersen
Abstract:
A Directed Acyclic Graph (DAG) can be partitioned or mapped into clusters to support and make inference more computationally efficient in Bayesian Network (BN), Markov process and other models. However, optimal partitioning with an arbitrary cost function is challenging, especially in statistical inference as the local cluster cost is dependent on both nodes within a cluster, and the map** of cl…
▽ More
A Directed Acyclic Graph (DAG) can be partitioned or mapped into clusters to support and make inference more computationally efficient in Bayesian Network (BN), Markov process and other models. However, optimal partitioning with an arbitrary cost function is challenging, especially in statistical inference as the local cluster cost is dependent on both nodes within a cluster, and the map** of clusters connected via parent and/or child nodes, which we call dependent clusters. We propose a novel algorithm called DCMAP for optimal cluster map** with dependent clusters. Given an arbitrarily defined, positive cost function based on the DAG, we show that DCMAP converges to find all optimal clusters, and returns near-optimal solutions along the way. Empirically, we find that the algorithm is time-efficient for a Dynamic BN (DBN) model of a seagrass complex system using a computation cost function. For a 25 and 50-node DBN, the search space size was $9.91\times 10^9$ and $1.51\times10^{21}$ possible cluster map**s, and the first optimal solution was found at iteration 934 $(\text{95\% CI } 926,971)$, and 2256 $(2150,2271)$ with a cost that was 4\% and 0.2\% of the naive heuristic cost, respectively.
△ Less
Submitted 7 February, 2024; v1 submitted 7 August, 2023;
originally announced August 2023.
-
A Corpus for Sentence-level Subjectivity Detection on English News Articles
Authors:
Francesco Antici,
Andrea Galassi,
Federico Ruggeri,
Katerina Korre,
Arianna Muti,
Alessandra Bardi,
Alice Fedotova,
Alberto Barrón-Cedeño
Abstract:
We develop novel annotation guidelines for sentence-level subjectivity detection, which are not limited to language-specific cues. We use our guidelines to collect NewsSD-ENG, a corpus of 638 objective and 411 subjective sentences extracted from English news articles on controversial topics. Our corpus paves the way for subjectivity detection in English and across other languages without relying o…
▽ More
We develop novel annotation guidelines for sentence-level subjectivity detection, which are not limited to language-specific cues. We use our guidelines to collect NewsSD-ENG, a corpus of 638 objective and 411 subjective sentences extracted from English news articles on controversial topics. Our corpus paves the way for subjectivity detection in English and across other languages without relying on language-specific tools, such as lexicons or machine translation. We evaluate state-of-the-art multilingual transformer-based models on the task in mono-, multi-, and cross-language settings. For this purpose, we re-annotate an existing Italian corpus. We observe that models trained in the multilingual setting achieve the best performance on the task.
△ Less
Submitted 24 May, 2024; v1 submitted 29 May, 2023;
originally announced May 2023.
-
A predictive model for planning emergency events rescue during COVID-19 in Lombardy, Italy
Authors:
Angela Andreella,
Antonietta Mira,
Spyros Balafas,
Ernst C. Wit,
Fabrizio Ruggeri,
Giovanni Nattino,
Giulia Ghilardi,
Guido Bertolini
Abstract:
Italy, particularly the Lombardy region, was among the first countries outside of Asia to report cases of COVID-19. The emergency medical service called Regional Emergency Agency (AREU) coordinates the intra- and inter-regional non-hospital emergency network and the European emergency number service in Lombardy. AREU must deal with daily and seasonal variations of call volume. The number and type…
▽ More
Italy, particularly the Lombardy region, was among the first countries outside of Asia to report cases of COVID-19. The emergency medical service called Regional Emergency Agency (AREU) coordinates the intra- and inter-regional non-hospital emergency network and the European emergency number service in Lombardy. AREU must deal with daily and seasonal variations of call volume. The number and type of emergency calls changed dramatically during the COVID-19 pandemic. A model to predict incoming calls and how many of these turn into events, i.e., dispatch of transport and equipment until the rescue is completed, was developed to address the emergency period. We used the generalized additive model with a negative binomial family to predict the number of events one, two, five, and seven days ahead. The over-dispersion of the data was tackled by using the negative binomial family and the nonlinear relationship between the number of events and covariates (e.g., seasonal effects) by smoothing splines. The model coefficients show the effect of variables, e.g., the day of the week, on the number of events and how these effects change during the pre-COVID-19 period. The proposed model returns reasonable mean absolute errors for most of the 2020-2021 period.
△ Less
Submitted 27 March, 2022;
originally announced March 2022.
-
Competitors-Aware Stochastic Lap Strategy Optimisation for Race Hybrid Vehicles
Authors:
Francesco Braghin,
Luca Paparusso,
Manuel Riani,
Fabio Ruggeri
Abstract:
World Endurance Championship (WEC) racing events are characterised by a relevant performance gap among competitors. The fastest vehicles category, consisting in hybrid vehicles, has to respect energy usage constraints set by the technical regulation. Considering absence of competitors, i.e. traffic conditions, the optimal energy usage strategy for lap time minimisation is typically computed throug…
▽ More
World Endurance Championship (WEC) racing events are characterised by a relevant performance gap among competitors. The fastest vehicles category, consisting in hybrid vehicles, has to respect energy usage constraints set by the technical regulation. Considering absence of competitors, i.e. traffic conditions, the optimal energy usage strategy for lap time minimisation is typically computed through a constrained optimisation problem. To the best of our knowledge, the majority of state-of-the-art works neglects competitors. This leads to a mismatch with the real world, where traffic generates considerable time losses. To bridge this gap, we propose a new framework to offline compute optimal strategies for the powertrain energy management considering competitors. Through analysis of the available data from previous events, statistics on the sector times and overtaking probabilities are extracted to encode the competitors' behaviour. Adopting a multi-agent model, the statistics are then used to generate realistic Monte Carlo (MC) simulation of their position along the track. The simulator is then adopted to identify the optimal strategy as follows. We develop a longitudinal vehicle model for the ego-vehicle and implement an optimisation problem for lap time minimisation in absence of traffic, based on Genetic Algorithms. Solving the optimisation problem for a variety of constraints generates a set of candidate optimal strategies. Stochastic Dynamic Programming is finally implemented to choose the best strategy considering competitors, whose motion is generated by the MC simulator. Our approach, validated on data from a real stint of race, allows to significantly reduce the lap time.
△ Less
Submitted 28 February, 2022;
originally announced March 2022.
-
ArgSciChat: A Dataset for Argumentative Dialogues on Scientific Papers
Authors:
Federico Ruggeri,
Mohsen Mesgar,
Iryna Gurevych
Abstract:
The applications of conversational agents for scientific disciplines (as expert domains) are understudied due to the lack of dialogue data to train such agents. While most data collection frameworks, such as Amazon Mechanical Turk, foster data collection for generic domains by connecting crowd workers and task designers, these frameworks are not much optimized for data collection in expert domains…
▽ More
The applications of conversational agents for scientific disciplines (as expert domains) are understudied due to the lack of dialogue data to train such agents. While most data collection frameworks, such as Amazon Mechanical Turk, foster data collection for generic domains by connecting crowd workers and task designers, these frameworks are not much optimized for data collection in expert domains. Scientists are rarely present in these frameworks due to their limited time budget. Therefore, we introduce a novel framework to collect dialogues between scientists as domain experts on scientific papers. Our framework lets scientists present their scientific papers as groundings for dialogues and participate in dialogue they like its paper title. We use our framework to collect a novel argumentative dialogue dataset, ArgSciChat. It consists of 498 messages collected from 41 dialogues on 20 scientific papers. Alongside extensive analysis on ArgSciChat, we evaluate a recent conversational agent on our dataset. Experimental results show that this agent poorly performs on ArgSciChat, motivating further research on argumentative scientific agents. We release our framework and the dataset.
△ Less
Submitted 12 October, 2022; v1 submitted 14 February, 2022;
originally announced February 2022.
-
Combining Transformers with Natural Language Explanations
Authors:
Federico Ruggeri,
Marco Lippi,
Paolo Torroni
Abstract:
Many NLP applications require models to be interpretable. However, many successful neural architectures, including transformers, still lack effective interpretation methods. A possible solution could rely on building explanations from domain knowledge, which is often available as plain, natural language text. We thus propose an extension to transformer models that makes use of external memories to…
▽ More
Many NLP applications require models to be interpretable. However, many successful neural architectures, including transformers, still lack effective interpretation methods. A possible solution could rely on building explanations from domain knowledge, which is often available as plain, natural language text. We thus propose an extension to transformer models that makes use of external memories to store natural language explanations and use them to explain classification outputs. We conduct an experimental evaluation on two domains, legal text analysis and argument mining, to show that our approach can produce relevant explanations while retaining or even improving classification performance.
△ Less
Submitted 3 April, 2024; v1 submitted 2 September, 2021;
originally announced October 2021.
-
Tree-Constrained Graph Neural Networks For Argument Mining
Authors:
Federico Ruggeri,
Marco Lippi,
Paolo Torroni
Abstract:
We propose a novel architecture for Graph Neural Networks that is inspired by the idea behind Tree Kernels of measuring similarity between trees by taking into account their common substructures, named fragments. By imposing a series of regularization constraints to the learning problem, we exploit a pooling mechanism that incorporates such notion of fragments within the node soft assignment funct…
▽ More
We propose a novel architecture for Graph Neural Networks that is inspired by the idea behind Tree Kernels of measuring similarity between trees by taking into account their common substructures, named fragments. By imposing a series of regularization constraints to the learning problem, we exploit a pooling mechanism that incorporates such notion of fragments within the node soft assignment function that produces the embeddings. We present an extensive experimental evaluation on a collection of sentence classification tasks conducted on several argument mining corpora, showing that the proposed approach performs well with respect to state-of-the-art techniques.
△ Less
Submitted 2 September, 2021;
originally announced October 2021.
-
Current Overview of Statistical Fiber Bundles Model and Its Application to Physics-based Reliability Analysis of Thin-film Dielectrics
Authors:
James U. Gleaton,
David Han,
James D. Lynch,
Hon Keung Tony Ng,
Fabrizio Ruggeri
Abstract:
In this paper, we present a critical overview of statistical fiber bundles models. We discuss relevant aspects, like assumptions and consequences stemming from models in the literature and propose new ones. This is accomplished by concentrating on both the physical and statistical aspects of a specific load-sharing example, the breakdown (BD) for circuits of capacitors and related dielectrics. For…
▽ More
In this paper, we present a critical overview of statistical fiber bundles models. We discuss relevant aspects, like assumptions and consequences stemming from models in the literature and propose new ones. This is accomplished by concentrating on both the physical and statistical aspects of a specific load-sharing example, the breakdown (BD) for circuits of capacitors and related dielectrics. For series and parallel/series circuits (series/parallel reliability systems) of ordinary capacitors, the load-sharing rules are derived from the electrical laws. This with the BD formalism is then used to obtain the BD distribution of the circuit. The BD distribution and Gibbs measure are given for a series circuit and the size effects are illustrated for simulations of series and parallel/series circuits. This is related to the finite weakest link adjustments for the BD distribution that arise in large series/parallel reliability load-sharing systems, such as dielectric BD, from their extreme value approximations.
An elementary but in-depth discussion of the physical aspects of SiO$_2$ and HfO$_2$ dielectrics and cell models is given. This is used to study a load-sharing cell model for the BD of HfO$_2$ dielectrics and the BD formalism. The latter study is based on an analysis of Kim and Lee (2004)'s data for such dielectrics. Here, several BD distributions are compared in the analysis and proportional hazard regression models are used to study the BD formalism. In addition, some areas of open research are discussed.
△ Less
Submitted 25 January, 2023; v1 submitted 9 April, 2021;
originally announced April 2021.
-
A stochastic SIR model for the analysis of the COVID-19 Italian epidemic
Authors:
Sara Pasquali,
Antonio Pievatolo,
Antonella Bodini,
Fabrizio Ruggeri
Abstract:
We propose a stochastic SIR model, specified as a system of stochastic differential equations, to analyse the data of the Italian COVID-19 epidemic, taking also into account the under-detection of infected and recovered individuals in the population. We find that a correct assessment of the amount of under-detection is important to obtain reliable estimates of the critical model parameters. Moreov…
▽ More
We propose a stochastic SIR model, specified as a system of stochastic differential equations, to analyse the data of the Italian COVID-19 epidemic, taking also into account the under-detection of infected and recovered individuals in the population. We find that a correct assessment of the amount of under-detection is important to obtain reliable estimates of the critical model parameters. Moreover, a single SIR model over the whole epidemic period is unable to correctly describe the behaviour of the pandemic. Then, the adaptation of the model in every time-interval between relevant government decrees that implement contagion mitigation measures, provides short-term predictions and a continuously updated assessment of the basic reproduction number.
△ Less
Submitted 19 February, 2021; v1 submitted 15 February, 2021;
originally announced February 2021.
-
Memory networks for consumer protection:unfairness exposed
Authors:
Federico Ruggeri,
Francesca Lagioia,
Marco Lippi,
Paolo Torroni
Abstract:
Recent work has demonstrated how data-driven AI methods can leverage consumer protection by supporting the automated analysis of legal documents. However, a shortcoming of data-driven approaches is poor explainability. We posit that in this domain useful explanations of classifier outcomes can be provided by resorting to legal rationales. We thus consider several configurations of memory-augmented…
▽ More
Recent work has demonstrated how data-driven AI methods can leverage consumer protection by supporting the automated analysis of legal documents. However, a shortcoming of data-driven approaches is poor explainability. We posit that in this domain useful explanations of classifier outcomes can be provided by resorting to legal rationales. We thus consider several configurations of memory-augmented neural networks where rationales are given a special role in the modeling of context knowledge. Our results show that rationales not only contribute to improve the classification accuracy, but are also able to offer meaningful, natural language explanations of otherwise opaque classifier outcomes.
△ Less
Submitted 24 July, 2020;
originally announced August 2020.
-
Nanometer scale resolution, multi-channel separation of spherical particles in a rocking ratchet with increasing barrier heights
Authors:
Philippe Nicollier,
Christian Schwemmer,
Francesca Ruggeri,
Daniel Widmer,
Xiaoyu Ma,
Armin W. Knoll
Abstract:
We present a nanoparticle size-separation device based on a nanofluidic rocking Brownian motor. It features a ratchet-shaped electrostatic particle potential with increasing barrier heights along the particle transport direction. The sharp drop of the particle current with barrier height is exploited to separate a particle suspension into multiple sub-populations. By solving the Fokker--Planck equ…
▽ More
We present a nanoparticle size-separation device based on a nanofluidic rocking Brownian motor. It features a ratchet-shaped electrostatic particle potential with increasing barrier heights along the particle transport direction. The sharp drop of the particle current with barrier height is exploited to separate a particle suspension into multiple sub-populations. By solving the Fokker--Planck equation, we show that the physics of the separation mechanism is governed by the energy landscape under forward tilt of the ratchet. For a given device geometry and sorting duration, the applied force is thus the only tunable parameter to increase the separation resolution. For the experimental conditions of 3.5 V applied voltage and 20 s sorting, we predict a separation resolution of $\sim 2$ nm, supported by experimental data for separating spherical gold particles of nominal 80 and 100 nm diameters.
△ Less
Submitted 8 July, 2020;
originally announced July 2020.
-
Protecting Classifiers From Attacks. A Bayesian Approach
Authors:
Victor Gallego,
Roi Naveiro,
Alberto Redondo,
David Rios Insua,
Fabrizio Ruggeri
Abstract:
Classification problems in security settings are usually modeled as confrontations in which an adversary tries to fool a classifier manipulating the covariates of instances to obtain a benefit. Most approaches to such problems have focused on game-theoretic ideas with strong underlying common knowledge assumptions, which are not realistic in the security realm. We provide an alternative Bayesian f…
▽ More
Classification problems in security settings are usually modeled as confrontations in which an adversary tries to fool a classifier manipulating the covariates of instances to obtain a benefit. Most approaches to such problems have focused on game-theoretic ideas with strong underlying common knowledge assumptions, which are not realistic in the security realm. We provide an alternative Bayesian framework that accounts for the lack of precise knowledge about the attacker's behavior using adversarial risk analysis. A key ingredient required by our framework is the ability to sample from the distribution of originating instances given the possibly attacked observed one. We propose a sampling procedure based on approximate Bayesian computation, in which we simulate the attacker's problem taking into account our uncertainty about his elements. For large scale problems, we propose an alternative, scalable approach that could be used when dealing with differentiable classifiers. Within it, we move the computational load to the training phase, simulating attacks from an adversary, adapting the framework to obtain a classifier robustified against attacks.
△ Less
Submitted 18 April, 2020;
originally announced April 2020.
-
Duality between Approximate Bayesian Methods and Prior Robustness
Authors:
Chaitanya Joshi,
Fabrizio Ruggeri
Abstract:
In this paper we show that there is a link between approximate Bayesian methods and prior robustness. We show that what is typically recognized as an approximation to the likelihood, either due to the simulated data as in the Approximate Bayesian Computation (ABC) methods or due to the functional approximation to the likelihood, can instead also be viewed upon as an implicit exercise in prior robu…
▽ More
In this paper we show that there is a link between approximate Bayesian methods and prior robustness. We show that what is typically recognized as an approximation to the likelihood, either due to the simulated data as in the Approximate Bayesian Computation (ABC) methods or due to the functional approximation to the likelihood, can instead also be viewed upon as an implicit exercise in prior robustness. We first define two new classes of priors for the cases where the sufficient statistics is available, establish their mathematical properties and show, for a simple illustrative example, that these classes of priors can also be used to obtain the posterior distribution that would be obtained by implementing ABC. We then generalize and define two further classes of priors that are applicable in very general scenarios; one where the sufficient statistics is not available and another where the likelihood is approximated using a functional approximation. We then discuss the interpretation and elicitation aspects of the classes proposed here as well as their potential applications and possible computational benefits. These classes establish the duality between approximate Bayesian inference and prior robustness for a wide category of Bayesian inference methods.
△ Less
Submitted 1 April, 2020;
originally announced April 2020.
-
Validation of a computer code for the energy consumption of a building, with application to optimal electric bill pricing
Authors:
M. Keller,
G. Damblin,
A. Pasanisi,
M. Schuman,
P. Barbillon,
F. Ruggeri,
E. Parent
Abstract:
In this paper, we propose a practical Bayesian framework for the calibration and validation of a computer code, and apply it to a case study concerning the energy consumption forecasting of a building. Validation allows to quantify forecasting uncertainties in view of the code's final use. Here we explore the situation where an energy provider promotes new energy contracts for residential building…
▽ More
In this paper, we propose a practical Bayesian framework for the calibration and validation of a computer code, and apply it to a case study concerning the energy consumption forecasting of a building. Validation allows to quantify forecasting uncertainties in view of the code's final use. Here we explore the situation where an energy provider promotes new energy contracts for residential buildings, tailored to each customer's needs, and including a guarantee of energy performance.
Based on power field measurements, collected from an experimental building cell over a certain time period, the code is calibrated, effectively reducing the epistemic uncertainty affecting some code parameters (here albedo, thermal bridge factor and convective coefficient). Validation is conducted by testing the goodness of fit of the code with respect to field measures, and then by propagating the a posteriori parametric uncertainty through the code, yielding probabilistic forecasts of the average electric power delivered inside the cell over a given time period.
To illustrate the benefits of the proposed Bayesian validation framework, we address the decision problem for an energy supplier offering a new type of contract, wherein the customer pays a fixed fee chosen in advance, based on an overall energy consumption forecast. According to Bayesian decision theory, we show how to choose such a fee optimally from the point of view of the supplier, in order to balance short-terms benefits with customer loyalty.
△ Less
Submitted 21 September, 2018;
originally announced October 2018.
-
Likelihood-free parameter estimation for dynamic queueing networks: case study of passenger flow in an international airport terminal
Authors:
Anthony Ebert,
Ritabrata Dutta,
Kerrie Mengersen,
Antonietta Mira,
Fabrizio Ruggeri,
Paul Wu
Abstract:
Dynamic queueing networks (DQN) model queueing systems where demand varies strongly with time, such as airport terminals. With rapidly rising global air passenger traffic placing increasing pressure on airport terminals, efficient allocation of resources is more important than ever. Parameter inference and quantification of uncertainty are key challenges for develo** decision support tools. The…
▽ More
Dynamic queueing networks (DQN) model queueing systems where demand varies strongly with time, such as airport terminals. With rapidly rising global air passenger traffic placing increasing pressure on airport terminals, efficient allocation of resources is more important than ever. Parameter inference and quantification of uncertainty are key challenges for develo** decision support tools. The DQN likelihood function is, in general, intractable and current approaches to simulation make likelihood-free parameter inference methods, such as approximate Bayesian computation (ABC), infeasible since simulating from these models is computationally expensive. By leveraging a recent advance in computationally efficient queueing simulation, we develop the first parameter inference approach for DQNs. We demonstrate our approach with data of passenger flows in a real airport terminal, and we show that our model accurately recreates the behaviour of the system and is useful for decision support. Special care must be taken in develo** the distance for ABC since any useful output must vary with time. We use maximum mean discrepancy, a metric on probability measures, as the distance function for ABC. Prediction intervals of performance measures for decision support tools are easily constructed using draws from posterior samples, which we demonstrate with a scenario of a delayed flight.
△ Less
Submitted 22 March, 2019; v1 submitted 7 April, 2018;
originally announced April 2018.
-
Adversarial classification: An adversarial risk analysis approach
Authors:
Roi Naveiro,
Alberto Redondo,
David Ríos Insua,
Fabrizio Ruggeri
Abstract:
Classification problems in security settings are usually contemplated as confrontations in which one or more adversaries try to fool a classifier to obtain a benefit. Most approaches to such adversarial classification problems have focused on game theoretical ideas with strong underlying common knowledge assumptions, which are actually not realistic in security domains. We provide an alternative f…
▽ More
Classification problems in security settings are usually contemplated as confrontations in which one or more adversaries try to fool a classifier to obtain a benefit. Most approaches to such adversarial classification problems have focused on game theoretical ideas with strong underlying common knowledge assumptions, which are actually not realistic in security domains. We provide an alternative framework to such problem based on adversarial risk analysis, which we illustrate with several examples. Computational and implementation issues are discussed.
△ Less
Submitted 24 September, 2019; v1 submitted 21 February, 2018;
originally announced February 2018.
-
Computationally Efficient Simulation of Queues: The R Package queuecomputer
Authors:
Anthony Ebert,
Paul Wu,
Kerrie Mengersen,
Fabrizio Ruggeri
Abstract:
Large networks of queueing systems model important real-world systems such as MapReduce clusters, web-servers, hospitals, call centers and airport passenger terminals. To model such systems accurately, we must infer queueing parameters from data. Unfortunately, for many queueing networks there is no clear way to proceed with parameter inference from data. Approximate Bayesian computation could off…
▽ More
Large networks of queueing systems model important real-world systems such as MapReduce clusters, web-servers, hospitals, call centers and airport passenger terminals. To model such systems accurately, we must infer queueing parameters from data. Unfortunately, for many queueing networks there is no clear way to proceed with parameter inference from data. Approximate Bayesian computation could offer a straightforward way to infer parameters for such networks if we could simulate data quickly enough.
We present a computationally efficient method for simulating from a very general set of queueing networks with the R package queuecomputer. Remarkable speedups of more than 2 orders of magnitude are observed relative to the popular DES packages simmer and simpy. We replicate output from these packages to validate the package.
The package is modular and integrates well with the popular R package dplyr. Complex queueing networks with tandem, parallel and fork/join topologies can easily be built with these two packages together. We show how to use this package with two examples: a call center and an airport terminal.
△ Less
Submitted 6 March, 2019; v1 submitted 6 March, 2017;
originally announced March 2017.
-
Modelling the Proliferation of Terrorism via Diffusion and Contagion
Authors:
Gentry White,
Fabrizio Ruggeri,
Michael D. Porter
Abstract:
The proliferation of terrorism is a serious concern in national and international security, as its spread is seen as an existential threat to Western liberal democracies. Understanding and effectively modelling the spread of terrorism provides useful insight into formulating effective responses. A mathematical model capturing the theoretical constructs of contagion and diffusion is constructed for…
▽ More
The proliferation of terrorism is a serious concern in national and international security, as its spread is seen as an existential threat to Western liberal democracies. Understanding and effectively modelling the spread of terrorism provides useful insight into formulating effective responses. A mathematical model capturing the theoretical constructs of contagion and diffusion is constructed for explaining the spread of terrorist activity and used to analyse data from the Global Terrorism Database from 2000--2016 for Afghanistan, Iraq, and Israel.
△ Less
Submitted 11 February, 2019; v1 submitted 7 December, 2016;
originally announced December 2016.
-
A hierarchical Bayesian setting for an inverse problem in linear parabolic PDEs with noisy boundary conditions
Authors:
Fabrizio Ruggeri,
Zaid Sawlan,
Marco Scavino,
Raul Tempone
Abstract:
In this work we develop a Bayesian setting to infer unknown parameters in initial-boundary value problems related to linear parabolic partial differential equations. We realistically assume that the boundary data are noisy, for a given prescribed initial condition. We show how to derive the joint likelihood function for the forward problem, given some measurements of the solution field subject to…
▽ More
In this work we develop a Bayesian setting to infer unknown parameters in initial-boundary value problems related to linear parabolic partial differential equations. We realistically assume that the boundary data are noisy, for a given prescribed initial condition. We show how to derive the joint likelihood function for the forward problem, given some measurements of the solution field subject to Gaussian noise. Given Gaussian priors for the time-dependent Dirichlet boundary values, we analytically marginalize the joint likelihood using the linearity of the equation. Our hierarchical Bayesian approach is fully implemented in an example that involves the heat equation. In this example, the thermal diffusivity is the unknown parameter. We assume that the thermal diffusivity parameter can be modeled a priori through a lognormal random variable or by means of a space-dependent stationary lognormal random field. Synthetic data are used to test the inference. We exploit the behavior of the non-normalized log posterior distribution of the thermal diffusivity. Then, we use the Laplace method to obtain an approximated Gaussian posterior and therefore avoid costly Markov Chain Monte Carlo computations. Expected information gains and predictive posterior densities for observable quantities are numerically estimated using Laplace approximation for different experimental setups.
△ Less
Submitted 28 January, 2015; v1 submitted 20 January, 2015;
originally announced January 2015.
-
Imprecise Dirichlet Process with application to the hypothesis test on the probability that X< Y
Authors:
Alessio Benavoli,
Francesca Mangili,
Fabrizio Ruggeri,
Marco Zaffalon
Abstract:
The Dirichlet process (DP) is one of the most popular Bayesian nonparametric models. An open problem with the DP is how to choose its infinite dimensional parameter (base measure) in case of lack of prior information. In this work we present the Imprecise DP (IDP) -- a prior near-ignorance DP-based model that does not require any choice of this probability measure. It consists of a class of DPs ob…
▽ More
The Dirichlet process (DP) is one of the most popular Bayesian nonparametric models. An open problem with the DP is how to choose its infinite dimensional parameter (base measure) in case of lack of prior information. In this work we present the Imprecise DP (IDP) -- a prior near-ignorance DP-based model that does not require any choice of this probability measure. It consists of a class of DPs obtained by letting the normalized base measure of the DP vary in the set of all probability measures. We discuss the tight connections of this approach with Bayesian robustness and in particular prior near-ignorance modeling via sets of probabilities. We use this model to perform a Bayesian hypothesis test on the probability P(X<Y). We study the theoretical properties of the IDP test (e.g., asymptotic consistency), and compare it with the frequentist Mann-Whitney-Wilcoxon rank test that is commonly employed as a test on P(X< Y). In particular we will show that our method is more robust, in the sense that it is able to isolate instances in which the aforementioned test is virtually guessing at random.
△ Less
Submitted 20 February, 2014; v1 submitted 12 February, 2014;
originally announced February 2014.
-
Nanoscale spatially resolved infrared spectra from single microdroplets
Authors:
Thomas Müller,
Francesco Simone Ruggeri,
Andrzej J. Kulik,
Ulyana Shimanovich,
Thomas O. Mason,
Tuomas P. J. Knowles,
Giovanni Dietler
Abstract:
Droplet microfluidics has emerged as a powerful platform allowing a large number of individual reactions to be carried out in spatially distinct microcompartments. Due to their small size, however, the spectroscopic characterisation of species encapsulated in such systems remains challenging. In this paper, we demonstrate the acquisition of infrared spectra from single microdroplets containing agg…
▽ More
Droplet microfluidics has emerged as a powerful platform allowing a large number of individual reactions to be carried out in spatially distinct microcompartments. Due to their small size, however, the spectroscopic characterisation of species encapsulated in such systems remains challenging. In this paper, we demonstrate the acquisition of infrared spectra from single microdroplets containing aggregation-prone proteins. To this effect, droplets are generated in a microfluidic flow-focussing device and subsequently deposited in a square array onto a ZnSe prism using a micro stamp. After drying, the solutes present in the droplets are illuminated locally by an infrared laser through the prism, and their thermal expansion upon absorption of infrared radiation is measured with an atomic force microscopy tip, granting nanoscale resolution. Using this approach, we resolve structural differences in the amide bands of the spectra of monomeric and aggregated lysozyme from single microdroplets with picolitre volume.
△ Less
Submitted 31 January, 2014;
originally announced January 2014.
-
Nucleation Rate of Hadron Bubbles in Baryon-Free Quark-Gluon Plasma
Authors:
Franco Ruggeri,
William Friedman
Abstract:
We evaluate the factor $κ$ appearing in Langer's expression for the nucleation rate extended to the case of hadron bubbles forming in zero baryon number cooled quark-gluon plasma. We consider both the absence and presence of viscosity and show that viscous effects introduce only small changes in the value of $κ$
We evaluate the factor $κ$ appearing in Langer's expression for the nucleation rate extended to the case of hadron bubbles forming in zero baryon number cooled quark-gluon plasma. We consider both the absence and presence of viscosity and show that viscous effects introduce only small changes in the value of $κ$
△ Less
Submitted 30 November, 1995;
originally announced November 1995.