Search | arXiv e-print repository

Differential Privacy for Anomaly Detection: Analyzing the Trade-off Between Privacy and Explainability

Authors: Fatima Ezzeddine, Mirna Saad, Omran Ayoub, Davide Andreoletti, Martin Gjoreski, Ihab Sbeity, Marc Langheinrich, Silvia Giordano

Abstract: Anomaly detection (AD), also referred to as outlier detection, is a statistical process aimed at identifying observations within a dataset that significantly deviate from the expected pattern of the majority of the data. Such a process finds wide application in various fields, such as finance and healthcare. While the primary objective of AD is to yield high detection accuracy, the requirements of… ▽ More Anomaly detection (AD), also referred to as outlier detection, is a statistical process aimed at identifying observations within a dataset that significantly deviate from the expected pattern of the majority of the data. Such a process finds wide application in various fields, such as finance and healthcare. While the primary objective of AD is to yield high detection accuracy, the requirements of explainability and privacy are also paramount. The first ensures the transparency of the AD process, while the second guarantees that no sensitive information is leaked to untrusted parties. In this work, we exploit the trade-off of applying Explainable AI (XAI) through SHapley Additive exPlanations (SHAP) and differential privacy (DP). We perform AD with different models and on various datasets, and we thoroughly evaluate the cost of privacy in terms of decreased accuracy and explainability. Our results show that the enforcement of privacy through DP has a significant impact on detection accuracy and explainability, which depends on both the dataset and the considered AD model. We further show that the visual interpretation of explanations is also influenced by the choice of the AD algorithm. △ Less

Submitted 9 April, 2024; originally announced April 2024.

arXiv:2306.05717 [pdf, other]

A Novel Satellite Selection Algorithm Using LSTM Neural Networks For Single-epoch Localization

Authors: Ibrahim Sbeity, Christophe Villien, Christophe Combettes, Benoît Denis, E Veronica Belmega, Marwa Chafii

Abstract: This work presents a new approach for detection and exclusion (or de-weighting) of pseudo-range measurements from the Global Navigation Satellite System (GNSS) in order to improve the accuracy of single-epoch positioning, which is an essential prerequisite for maintaining good navigation performance in challenging operating contexts (e.g., under Non-Line of Sight and/or multipath propagation). Bey… ▽ More This work presents a new approach for detection and exclusion (or de-weighting) of pseudo-range measurements from the Global Navigation Satellite System (GNSS) in order to improve the accuracy of single-epoch positioning, which is an essential prerequisite for maintaining good navigation performance in challenging operating contexts (e.g., under Non-Line of Sight and/or multipath propagation). Beyond the usual preliminary hard decision stage, which can mainly reject obvious outliers, our approach exploits machine learning to optimize the relative contributions from all available satellites feeding the positioning solver. For this, we construct a customized matrix of pseudorange residuals that is used as an input to the proposed longshort term memory neural network (LSTM NN) architecture. The latter is trained to predict several quality indicators that roughly approximate the standard deviations of pseudo-range errors, which are further integrated in the calculation of weights. Our numerical evaluations on both synthetic and real data show that the proposed solution is able to outperform conventional weighting and signal selection strategies from the state-of-theart, while fairly approaching optimal positioning accuracy. △ Less

Submitted 9 June, 2023; originally announced June 2023.

Comments: arXiv admin note: text overlap with arXiv:2306.05319

Journal ref: IEEE/ION PLANS 2023, Apr 2023, California, United States

arXiv:2306.05319 [pdf, other]

RNN-Based GNSS Positioning using Satellite Measurement Features and Pseudorange Residuals

Authors: Ibrahim Sbeity, Christophe Villien, Benoît Denis, E. Veronica Belmega

Abstract: In the Global Navigation Satellite System (GNSS) context, the growing number of available satellites has lead to many challenges when it comes to choosing the most accurate pseudorange contributions, given the strong impact of biased measurements on positioning accuracy, particularly in single-epoch scenarios. This work leverages the potential of machine learning in predicting link-wise measuremen… ▽ More In the Global Navigation Satellite System (GNSS) context, the growing number of available satellites has lead to many challenges when it comes to choosing the most accurate pseudorange contributions, given the strong impact of biased measurements on positioning accuracy, particularly in single-epoch scenarios. This work leverages the potential of machine learning in predicting link-wise measurement quality factors and, hence, optimize measurement weighting. For this purpose, we use a customized matrix composed of heterogeneous features such as conditional pseudorange residuals and per-link satellite metrics (e.g., carrier-to-noise power density ratio and its empirical statistics, satellite elevation, carrier phase lock time). This matrix is then fed as an input to a recurrent neural network (RNN) (i.e., a long-short term memory (LSTM) network). Our experimental results on real data, obtained from extensive field measurements, demonstrate the high potential of our proposed solution being able to outperform traditional measurements weighting and selection strategies from state-of-the-art. △ Less

Submitted 8 June, 2023; originally announced June 2023.

arXiv:2210.08786 [pdf, other]

doi 10.1140/epjds/s13688-023-00423-4

Exposing Influence Campaigns in the Age of LLMs: A Behavioral-Based AI Approach to Detecting State-Sponsored Trolls

Authors: Fatima Ezzeddine, Luca Luceri, Omran Ayoub, Ihab Sbeity, Gianluca Nogara, Emilio Ferrara, Silvia Giordano

Abstract: The detection of state-sponsored trolls operating in influence campaigns on social media is a critical and unsolved challenge for the research community, which has significant implications beyond the online realm. To address this challenge, we propose a new AI-based solution that identifies troll accounts solely through behavioral cues associated with their sequences of sharing activity, encompass… ▽ More The detection of state-sponsored trolls operating in influence campaigns on social media is a critical and unsolved challenge for the research community, which has significant implications beyond the online realm. To address this challenge, we propose a new AI-based solution that identifies troll accounts solely through behavioral cues associated with their sequences of sharing activity, encompassing both their actions and the feedback they receive from others. Our approach does not incorporate any textual content shared and consists of two steps: First, we leverage an LSTM-based classifier to determine whether account sequences belong to a state-sponsored troll or an organic, legitimate user. Second, we employ the classified sequences to calculate a metric named the "Troll Score", quantifying the degree to which an account exhibits troll-like behavior. To assess the effectiveness of our method, we examine its performance in the context of the 2016 Russian interference campaign during the U.S. Presidential election. Our experiments yield compelling results, demonstrating that our approach can identify account sequences with an AUC close to 99% and accurately differentiate between Russian trolls and organic users with an AUC of 91%. Notably, our behavioral-based approach holds a significant advantage in the ever-evolving landscape, where textual and linguistic properties can be easily mimicked by Large Language Models (LLMs): In contrast to existing language-based techniques, it relies on more challenging-to-replicate behavioral cues, ensuring greater resilience in identifying influence campaigns, especially given the potential increase in the usage of LLMs for generating inauthentic content. Finally, we assessed the generalizability of our solution to various entities driving different information operations and found promising results that will guide future research. △ Less

Submitted 11 October, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

Comments: 22

Journal ref: EPJ Data Sci. 12, 46 (2023)

arXiv:1409.2650 [pdf]

Combining the analytical hierarchy process and the genetic algorithm to solve the timetable problem

Authors: Ihab Sbeity, Mohamed Dbouk, Habib Kobeissi

Abstract: The main problems of school course timetabling are time, curriculum, and classrooms. In addition there are other problems that vary from one institution to another. This paper is intended to solve the problem of satisfying the teachers preferred schedule in a way that regards the importance of the teacher to the supervising institute, i.e. his score according to some criteria. Genetic algorithm (G… ▽ More The main problems of school course timetabling are time, curriculum, and classrooms. In addition there are other problems that vary from one institution to another. This paper is intended to solve the problem of satisfying the teachers preferred schedule in a way that regards the importance of the teacher to the supervising institute, i.e. his score according to some criteria. Genetic algorithm (GA) has been presented as an elegant method in solving timetable problem (TTP) in order to produce solutions with no conflict. In this paper, we consider the analytic hierarchy process (AHP) to efficiently obtain a score for each teacher, and consequently produce a GA-based TTP solution that satisfies most of the teachers preferences. △ Less

Submitted 9 September, 2014; originally announced September 2014.

Comments: International Journal of Software Engineering & Applications (IJSEA), Vol.5, No.4, July 2014

arXiv:1202.0414 [pdf]

Generating a Performance Stochastic Model from UML Specifications

Authors: Ihab Sbeity, Leonardo Brenner, Mohamed Dbouk

Abstract: Since its initiation by Connie Smith, the process of Software Performance Engineering (SPE) is becoming a growing concern. The idea is to bring performance evaluation into the software design process. This suitable methodology allows software designers to determine the performance of software during design. Several approaches have been proposed to provide such techniques. Some of them propose to d… ▽ More Since its initiation by Connie Smith, the process of Software Performance Engineering (SPE) is becoming a growing concern. The idea is to bring performance evaluation into the software design process. This suitable methodology allows software designers to determine the performance of software during design. Several approaches have been proposed to provide such techniques. Some of them propose to derive from a UML (Unified Modeling Language) model a performance model such as Stochastic Petri Net (SPN) or Stochastic process Algebra (SPA) models. Our work belongs to the same category. We propose to derive from a UML model a Stochastic Automata Network (SAN) in order to obtain performance predictions. Our approach is more flexible due to the SAN modularity and its high resemblance to UML' state-chart diagram. △ Less

Submitted 2 February, 2012; originally announced February 2012.

Showing 1–6 of 6 results for author: Sbeity, I