Search | arXiv e-print repository

arXiv:2406.20080 [pdf, other]

AI for Extreme Event Modeling and Understanding: Methodologies and Challenges

Authors: Gustau Camps-Valls, Miguel-Ángel Fernández-Torres, Kai-Hendrik Cohrs, Adrian Höhl, Andrea Castelletti, Aytac Pacal, Claire Robin, Francesco Martinuzzi, Ioannis Papoutsis, Ioannis Prapas, Jorge Pérez-Aracil, Katja Weigel, Maria Gonzalez-Calabuig, Markus Reichstein, Martin Rabel, Matteo Giuliani, Miguel Mahecha, Oana-Iuliana Popescu, Oscar J. Pellicer-Valero, Said Ouala, Sancho Salcedo-Sanz, Sebastian Sippel, Spyros Kondylatos, Tamara Happé, Tristan Williams

Abstract: In recent years, artificial intelligence (AI) has deeply impacted various fields, including Earth system sciences. Here, AI improved weather forecasting, model emulation, parameter estimation, and the prediction of extreme events. However, the latter comes with specific challenges, such as develo** accurate predictors from noisy, heterogeneous and limited annotated data. This paper reviews how A… ▽ More In recent years, artificial intelligence (AI) has deeply impacted various fields, including Earth system sciences. Here, AI improved weather forecasting, model emulation, parameter estimation, and the prediction of extreme events. However, the latter comes with specific challenges, such as develo** accurate predictors from noisy, heterogeneous and limited annotated data. This paper reviews how AI is being used to analyze extreme events (like floods, droughts, wildfires and heatwaves), highlighting the importance of creating accurate, transparent, and reliable AI models. We discuss the hurdles of dealing with limited data, integrating information in real-time, deploying models, and making them understandable, all crucial for gaining the trust of stakeholders and meeting regulatory needs. We provide an overview of how AI can help identify and explain extreme events more effectively, improving disaster response and communication. We emphasize the need for collaboration across different fields to create AI solutions that are practical, understandable, and trustworthy for analyzing and predicting extreme events. Such collaborative efforts aim to enhance disaster readiness and disaster risk reduction. △ Less

Submitted 28 June, 2024; originally announced June 2024.

arXiv:2404.15682 [pdf, other]

Fixed points for three point generalized orbital triangular contractions

Authors: Cristina Maria Pacurar, Ovidiu Popescu

Abstract: In this paper we introduce and study new classes of map**s in metric spaces. The main class of map**s is called generalized orbital triangular contractions and it generalizes some existing results (such as Banach contractions, map**s contracting perimeters of triangles). We prove that these contractions are not necessarily continuous and have a unique fixed point under certain conditions. Mo… ▽ More In this paper we introduce and study new classes of map**s in metric spaces. The main class of map**s is called generalized orbital triangular contractions and it generalizes some existing results (such as Banach contractions, map**s contracting perimeters of triangles). We prove that these contractions are not necessarily continuous and have a unique fixed point under certain conditions. Moreover, we extend our class to generalized orbital triangular Kannan contractions and generalized orbital triangular Chatterjea contractions. △ Less

Submitted 24 April, 2024; originally announced April 2024.

Comments: Submitted to Journal on 24 April 2024

arXiv:2404.00782 [pdf, other]

Fixed point theorem for generalized Chatterjea type map**s

Authors: Ovidiu Popescu, Cristina Maria Păcurar

Abstract: We introduce a new type of map**s in metric space which are three-point analogue of the well-known Chatterjea type map**s, and call them generalized Chatterjea type map**s. It is shown that such map**s can be discontinuous as is the case of Chatterjea type map**s and this new class includes the class of Chatterjea type map**s. The fixed point theorem for generalized Chatterjea type map… ▽ More We introduce a new type of map**s in metric space which are three-point analogue of the well-known Chatterjea type map**s, and call them generalized Chatterjea type map**s. It is shown that such map**s can be discontinuous as is the case of Chatterjea type map**s and this new class includes the class of Chatterjea type map**s. The fixed point theorem for generalized Chatterjea type map**s is proven. △ Less

Submitted 31 March, 2024; originally announced April 2024.

arXiv:2403.19488 [pdf, other]

Map**s contracting triangles

Authors: Ovidiu Popescu, Cristina Maria Pacurar

Abstract: The aim of the current paper is to introduce a new class of contractive map**s, which are contracting (a feature of) triangles. We prove that maps contracting triangles are continuous and give the fixed point result for such map**s. We emphasize that our main theorem encompasses many functions, with significant applicability, for which the result holds, thereby representing a notable advanceme… ▽ More The aim of the current paper is to introduce a new class of contractive map**s, which are contracting (a feature of) triangles. We prove that maps contracting triangles are continuous and give the fixed point result for such map**s. We emphasize that our main theorem encompasses many functions, with significant applicability, for which the result holds, thereby representing a notable advancement in this research domain. △ Less

Submitted 28 March, 2024; originally announced March 2024.

arXiv:2312.05448 [pdf, other]

Domain Adaptation of a State of the Art Text-to-SQL Model: Lessons Learned and Challenges Found

Authors: Irene Manotas, Octavian Popescu, Ngoc Phuoc An Vo, Vadim Sheinin

Abstract: There are many recent advanced developments for the Text-to-SQL task, where the Picard model is one of the the top performing models as measured by the Spider dataset competition. However, bringing Text-to-SQL systems to realistic use-cases through domain adaptation remains a tough challenge. We analyze how well the base T5 Language Model and Picard perform on query structures different from the S… ▽ More There are many recent advanced developments for the Text-to-SQL task, where the Picard model is one of the the top performing models as measured by the Spider dataset competition. However, bringing Text-to-SQL systems to realistic use-cases through domain adaptation remains a tough challenge. We analyze how well the base T5 Language Model and Picard perform on query structures different from the Spider dataset, we fine-tuned the base model on the Spider data and on independent databases (DB). To avoid accessing the DB content online during inference, we also present an alternative way to disambiguate the values in an input question using a rule-based approach that relies on an intermediate representation of the semantic concepts of an input question. In our results we show in what cases T5 and Picard can deliver good performance, we share the lessons learned, and discuss current domain adaptation challenges. △ Less

Submitted 8 December, 2023; originally announced December 2023.

ACM Class: I.2.7

arXiv:2310.11132 [pdf, other]

Non-parametric Conditional Independence Testing for Mixed Continuous-Categorical Variables: A Novel Method and Numerical Evaluation

Authors: Oana-Iuliana Popescu, Andreas Gerhardus, Jakob Runge

Abstract: Conditional independence testing (CIT) is a common task in machine learning, e.g., for variable selection, and a main component of constraint-based causal discovery. While most current CIT approaches assume that all variables are numerical or all variables are categorical, many real-world applications involve mixed-type datasets that include numerical and categorical variables. Non-parametric CIT… ▽ More Conditional independence testing (CIT) is a common task in machine learning, e.g., for variable selection, and a main component of constraint-based causal discovery. While most current CIT approaches assume that all variables are numerical or all variables are categorical, many real-world applications involve mixed-type datasets that include numerical and categorical variables. Non-parametric CIT can be conducted using conditional mutual information (CMI) estimators combined with a local permutation scheme. Recently, two novel CMI estimators for mixed-type datasets based on k-nearest-neighbors (k-NN) have been proposed. As with any k-NN method, these estimators rely on the definition of a distance metric. One approach computes distances by a one-hot encoding of the categorical variables, essentially treating categorical variables as discrete-numerical, while the other expresses CMI by entropy terms where the categorical variables appear as conditions only. In this work, we study these estimators and propose a variation of the former approach that does not treat categorical variables as numeric. Our numerical experiments show that our variant detects dependencies more robustly across different data distributions and preprocessing types. △ Less

Submitted 5 November, 2023; v1 submitted 17 October, 2023; originally announced October 2023.

arXiv:2307.04524 [pdf, ps, other]

doi 10.37193/CJM.2024.03.11

Some remarks on expansive map**s in metric spaces

Authors: Ovidiu Popescu, Cristina Maria Pacurar

Abstract: The aim of this paper is to generalize the results on expansive map**s of Yesilkaya and Aydin from \cite{Yesilkaya}. We give some fixed point results for q-expansive map**s in metric spaces and prove some fixed point theorems for this class of map**s. Finally, we present some examples to support the new results. The aim of this paper is to generalize the results on expansive map**s of Yesilkaya and Aydin from \cite{Yesilkaya}. We give some fixed point results for q-expansive map**s in metric spaces and prove some fixed point theorems for this class of map**s. Finally, we present some examples to support the new results. △ Less

Submitted 10 July, 2023; originally announced July 2023.

Journal ref: Carpathian Journal of Mathematics 2024

arXiv:2301.10604 [pdf, other]

Automated multilingual detection of Pro-Kremlin propaganda in newspapers and Telegram posts

Authors: Veronika Solopova, Oana-Iuliana Popescu, Christoph Benzmüller, Tim Landgraf

Abstract: The full-scale conflict between the Russian Federation and Ukraine generated an unprecedented amount of news articles and social media data reflecting opposing ideologies and narratives. These polarized campaigns have led to mutual accusations of misinformation and fake news, sha** an atmosphere of confusion and mistrust for readers worldwide. This study analyses how the media affected and mirro… ▽ More The full-scale conflict between the Russian Federation and Ukraine generated an unprecedented amount of news articles and social media data reflecting opposing ideologies and narratives. These polarized campaigns have led to mutual accusations of misinformation and fake news, sha** an atmosphere of confusion and mistrust for readers worldwide. This study analyses how the media affected and mirrored public opinion during the first month of the war using news articles and Telegram news channels in Ukrainian, Russian, Romanian and English. We propose and compare two methods of multilingual automated pro-Kremlin propaganda identification, based on Transformers and linguistic features. We analyse the advantages and disadvantages of both methods, their adaptability to new genres and languages, and ethical considerations of their usage for content moderation. With this work, we aim to lay the foundation for further development of moderation tools tailored to the current conflict. △ Less

Submitted 25 January, 2023; originally announced January 2023.

Comments: 9 pages, 3 figures

arXiv:2204.11642 [pdf, other]

Do Users Benefit From Interpretable Vision? A User Study, Baseline, And Dataset

Authors: Leon Sixt, Martin Schuessler, Oana-Iuliana Popescu, Philipp Weiß, Tim Landgraf

Abstract: A variety of methods exist to explain image classification models. However, whether they provide any benefit to users over simply comparing various inputs and the model's respective predictions remains unclear. We conducted a user study (N=240) to test how such a baseline explanation technique performs against concept-based and counterfactual explanations. To this end, we contribute a synthetic da… ▽ More A variety of methods exist to explain image classification models. However, whether they provide any benefit to users over simply comparing various inputs and the model's respective predictions remains unclear. We conducted a user study (N=240) to test how such a baseline explanation technique performs against concept-based and counterfactual explanations. To this end, we contribute a synthetic dataset generator capable of biasing individual attributes and quantifying their relevance to the model. In a study, we assess if participants can identify the relevant set of attributes compared to the ground-truth. Our results show that the baseline outperformed concept-based explanations. Counterfactual explanations from an invertible neural network performed similarly as the baseline. Still, they allowed users to identify some attributes more accurately. Our results highlight the importance of measuring how well users can reason about biases of a model, rather than solely relying on technical evaluations or proxy tasks. We open-source our study and dataset so it can serve as a blue-print for future studies. For code see, https://github.com/berleon/do_users_benefit_from_interpretable_vision △ Less

Submitted 25 April, 2022; originally announced April 2022.

Comments: Published at ICLR 2022

arXiv:2104.00660 [pdf, other]

Recognizing and Splitting Conditional Sentences for Automation of Business Processes Management

Authors: Ngoc Phuoc An Vo, Irene Manotas, Octavian Popescu, Algimantas Cerniauskas, Vadim Sheinin

Abstract: Business Process Management (BPM) is the discipline which is responsible for management of discovering, analyzing, redesigning, monitoring, and controlling business processes. One of the most crucial tasks of BPM is discovering and modelling business processes from text documents. In this paper, we present our system that resolves an end-to-end problem consisting of 1) recognizing conditional sent… ▽ More Business Process Management (BPM) is the discipline which is responsible for management of discovering, analyzing, redesigning, monitoring, and controlling business processes. One of the most crucial tasks of BPM is discovering and modelling business processes from text documents. In this paper, we present our system that resolves an end-to-end problem consisting of 1) recognizing conditional sentences from technical documents, 2) finding boundaries to extract conditional and resultant clauses from each conditional sentence, and 3) categorizing resultant clause as Action or Consequence which later helps to generate new steps in our business process model automatically. We created a new dataset and three models solve this problem. Our best model achieved very promising results of 83.82, 87.84, and 85.75 for Precision, Recall, and F1, respectively, for extracting Condition, Action, and Consequence clauses using Exact Match metric. △ Less

Submitted 1 April, 2021; originally announced April 2021.

Comments: Preprint

arXiv:2102.00951 [pdf, other]

Counterfactual Generation with Knockoffs

Authors: Oana-Iuliana Popescu, Maha Shadaydeh, Joachim Denzler

Abstract: Human interpretability of deep neural networks' decisions is crucial, especially in domains where these directly affect human lives. Counterfactual explanations of already trained neural networks can be generated by perturbing input features and attributing importance according to the change in the classifier's outcome after perturbation. Perturbation can be done by replacing features using heuris… ▽ More Human interpretability of deep neural networks' decisions is crucial, especially in domains where these directly affect human lives. Counterfactual explanations of already trained neural networks can be generated by perturbing input features and attributing importance according to the change in the classifier's outcome after perturbation. Perturbation can be done by replacing features using heuristic or generative in-filling methods. The choice of in-filling function significantly impacts the number of artifacts, i.e., false-positive attributions. Heuristic methods result in false-positive artifacts because the image after the perturbation is far from the original data distribution. Generative in-filling methods reduce artifacts by producing in-filling values that respect the original data distribution. However, current generative in-filling methods may also increase false-negatives due to the high correlation of in-filling values with the original data. In this paper, we propose to alleviate this by generating in-fillings with the statistically-grounded Knockoffs framework, which was developed by Barber and Candès in 2015 as a tool for variable selection with controllable false discovery rate. Knockoffs are statistically null-variables as decorrelated as possible from the original data, which can be swapped with the originals without changing the underlying data distribution. A comparison of different in-filling methods indicates that in-filling with knockoffs can reveal explanations in a more causal sense while still maintaining the compactness of the explanations. △ Less

Submitted 1 February, 2021; originally announced February 2021.

Comments: 12 pages, 10 figures

Showing 1–11 of 11 results for author: Popescu, O