-
A Named Entity Recognition and Topic Modeling-based Solution for Locating and Better Assessment of Natural Disasters in Social Media
Authors:
Ayaz Mehmood,
Muhammad Tayyab Zamir,
Muhammad Asif Ayub,
Nasir Ahmad,
Kashif Ahmad
Abstract:
Over the last decade, similar to other application domains, social media content has been proven very effective in disaster informatics. However, due to the unstructured nature of the data, several challenges are associated with disaster analysis in social media content. To fully explore the potential of social media content in disaster informatics, access to relevant content and the correct geo-l…
▽ More
Over the last decade, similar to other application domains, social media content has been proven very effective in disaster informatics. However, due to the unstructured nature of the data, several challenges are associated with disaster analysis in social media content. To fully explore the potential of social media content in disaster informatics, access to relevant content and the correct geo-location information is very critical. In this paper, we propose a three-step solution to tackling these challenges. Firstly, the proposed solution aims to classify social media posts into relevant and irrelevant posts followed by the automatic extraction of location information from the posts' text through Named Entity Recognition (NER) analysis. Finally, to quickly analyze the topics covered in large volumes of social media posts, we perform topic modeling resulting in a list of top keywords, that highlight the issues discussed in the tweet. For the Relevant Classification of Twitter Posts (RCTP), we proposed a merit-based fusion framework combining the capabilities of four different models namely BERT, RoBERTa, Distil BERT, and ALBERT obtaining the highest F1-score of 0.933 on a benchmark dataset. For the Location Extraction from Twitter Text (LETT), we evaluated four models namely BERT, RoBERTa, Distil BERTA, and Electra in an NER framework obtaining the highest F1-score of 0.960. For topic modeling, we used the BERTopic library to discover the hidden topic patterns in the relevant tweets. The experimental results of all the components of the proposed end-to-end solution are very encouraging and hint at the potential of social media content and NLP in disaster management.
△ Less
Submitted 1 May, 2024;
originally announced May 2024.
-
Social Media and Artificial Intelligence for Sustainable Cities and Societies: A Water Quality Analysis Use-case
Authors:
Muhammad Asif Auyb,
Muhammad Tayyab Zamir,
Imran Khan,
Hannia Naseem,
Nasir Ahmad,
Kashif Ahmad
Abstract:
This paper focuses on a very important societal challenge of water quality analysis. Being one of the key factors in the economic and social development of society, the provision of water and ensuring its quality has always remained one of the top priorities of public authorities. To ensure the quality of water, different methods for monitoring and assessing the water networks, such as offline and…
▽ More
This paper focuses on a very important societal challenge of water quality analysis. Being one of the key factors in the economic and social development of society, the provision of water and ensuring its quality has always remained one of the top priorities of public authorities. To ensure the quality of water, different methods for monitoring and assessing the water networks, such as offline and online surveys, are used. However, these surveys have several limitations, such as the limited number of participants and low frequency due to the labor involved in conducting such surveys. In this paper, we propose a Natural Language Processing (NLP) framework to automatically collect and analyze water-related posts from social media for data-driven decisions. The proposed framework is composed of two components, namely (i) text classification, and (ii) topic modeling. For text classification, we propose a merit-fusion-based framework incorporating several Large Language Models (LLMs) where different weight selection and optimization methods are employed to assign weights to the LLMs. In topic modeling, we employed the BERTopic library to discover the hidden topic patterns in the water-related tweets. We also analyzed relevant tweets originating from different regions and countries to explore global, regional, and country-specific issues and water-related concerns. We also collected and manually annotated a large-scale dataset, which is expected to facilitate future research on the topic.
△ Less
Submitted 23 April, 2024;
originally announced April 2024.
-
Stylometry Analysis of Multi-authored Documents for Authorship and Author Style Change Detection
Authors:
Muhammad Tayyab Zamir,
Muhammad Asif Ayub,
Asma Gul,
Nasir Ahmad,
Kashif Ahmad
Abstract:
In recent years, the increasing use of Artificial Intelligence based text generation tools has posed new challenges in document provenance, authentication, and authorship detection. However, advancements in stylometry have provided opportunities for automatic authorship and author change detection in multi-authored documents using style analysis techniques. Style analysis can serve as a primary st…
▽ More
In recent years, the increasing use of Artificial Intelligence based text generation tools has posed new challenges in document provenance, authentication, and authorship detection. However, advancements in stylometry have provided opportunities for automatic authorship and author change detection in multi-authored documents using style analysis techniques. Style analysis can serve as a primary step toward document provenance and authentication through authorship detection. This paper investigates three key tasks of style analysis: (i) classification of single and multi-authored documents, (ii) single change detection, which involves identifying the point where the author switches, and (iii) multiple author-switching detection in multi-authored documents. We formulate all three tasks as classification problems and propose a merit-based fusion framework that integrates several state-of-the-art natural language processing (NLP) algorithms and weight optimization techniques. We also explore the potential of special characters, which are typically removed during pre-processing in NLP applications, on the performance of the proposed methods for these tasks by conducting extensive experiments on both cleaned and raw datasets. Experimental results demonstrate significant improvements over existing solutions for all three tasks on a benchmark dataset.
△ Less
Submitted 12 January, 2024;
originally announced January 2024.
-
Elliptic cross sections in blood flow regulation
Authors:
Chris Brimacombe,
Robert M. Corless,
Mair Zamir
Abstract:
Arterial deformations arise in blood flow when surrounding tissue invades the space available for a blood vessel to maintain its circular cross section, the most immediate effects being a reduction in blood flow and redistribution of shear stress. Here we consider deformations from circular to elliptic cross sections. Solution of this problem in steady flow is fairly straightforward. The focus in…
▽ More
Arterial deformations arise in blood flow when surrounding tissue invades the space available for a blood vessel to maintain its circular cross section, the most immediate effects being a reduction in blood flow and redistribution of shear stress. Here we consider deformations from circular to elliptic cross sections. Solution of this problem in steady flow is fairly straightforward. The focus in the present paper is on pulsatile flow where the change from circular to elliptic cross sections is associated with a transition in the character of the equations governing the flow from Bessel to Mathieu equations. The study of this problem has been hampered in the past because of difficulties involved in the solution of the governing equations. In the present study we describe methods we have used to overcome some of these difficulties and present a comprehensive set of results based on these methods. In particular, vessel deformation is examined under two different conditions relevant to blood flow regulation: (i) kee** cross sectional area constant and (ii) kee** cross sectional circumference constant. The results provide an important context for the mechanism of neurovascular control of blood flow under the pathological conditions of vessel deformation.
△ Less
Submitted 19 January, 2023;
originally announced April 2023.
-
Document Provenance and Authentication through Authorship Classification
Authors:
Muhammad Tayyab Zamir,
Muhammad Asif Ayub,
Jebran Khan,
Muhammad Jawad Ikram,
Nasir Ahmad,
Kashif Ahmad
Abstract:
Style analysis, which is relatively a less explored topic, enables several interesting applications. For instance, it allows authors to adjust their writing style to produce a more coherent document in collaboration. Similarly, style analysis can also be used for document provenance and authentication as a primary step. In this paper, we propose an ensemble-based text-processing framework for the…
▽ More
Style analysis, which is relatively a less explored topic, enables several interesting applications. For instance, it allows authors to adjust their writing style to produce a more coherent document in collaboration. Similarly, style analysis can also be used for document provenance and authentication as a primary step. In this paper, we propose an ensemble-based text-processing framework for the classification of single and multi-authored documents, which is one of the key tasks in style analysis. The proposed framework incorporates several state-of-the-art text classification algorithms including classical Machine Learning (ML) algorithms, transformers, and deep learning algorithms both individually and in merit-based late fusion. For the merit-based late fusion, we employed several weight optimization and selection methods to assign merit-based weights to the individual text classification algorithms. We also analyze the impact of the characters on the task that are usually excluded in NLP applications during pre-processing by conducting experiments on both clean and un-clean data. The proposed framework is evaluated on a large-scale benchmark dataset, significantly improving performance over the existing solutions.
△ Less
Submitted 2 March, 2023;
originally announced March 2023.
-
Computation and applications of Mathieu functions: A historical perspective
Authors:
Chris Brimacombe,
Robert M. Corless,
Mair Zamir
Abstract:
Mathieu functions of period $π$ or $2π$, also called elliptic cylinder functions, were introduced in 1868 by Émile Mathieu together with so-called modified Mathieu functions, in order to help understand the vibrations of an elastic membrane set in a fixed elliptical hoop. These functions still occur frequently in applications today: our interest, for instance, was stimulated by a problem of pulsat…
▽ More
Mathieu functions of period $π$ or $2π$, also called elliptic cylinder functions, were introduced in 1868 by Émile Mathieu together with so-called modified Mathieu functions, in order to help understand the vibrations of an elastic membrane set in a fixed elliptical hoop. These functions still occur frequently in applications today: our interest, for instance, was stimulated by a problem of pulsatile blood flow in a blood vessel compressed into an elliptical cross-section. This paper surveys and recapitulates the historical development of the theory and methods of computation for Mathieu functions and modified Mathieu functions and identifies some gaps in current software capability, particularly to do with double eigenvalues of the Mathieu equation. We demonstrate how to compute Puiseux expansions of the Mathieu eigenvalues about such double eigenvalues, and give methods to compute the generalized eigenfunctions that arise there. In examining Mathieu's original contribution, we bring out that his use of anti-secularity predates that of Lindstedt. For interest, we also provide short biographies of some of the major mathematical researchers involved in the history of the Mathieu functions: Émile Mathieu, Sir Edmund Whittaker, Edward Ince, and Gertrude Blanch.
△ Less
Submitted 30 June, 2021; v1 submitted 4 August, 2020;
originally announced August 2020.
-
Reproduction Number And Asymptotic Stability For The Dynamics of a Honey Bee Colony with Continuous Age Structure
Authors:
Matthew Betti,
Lindi Wahl,
Mair Zamir
Abstract:
A system of partial differential equations is derived as a model for the dynamics of a honey bee colony with a continuous age distribution, and the system is then extended to include the effects of a simplified infectious disease. In the disease-free case we analytically derive the equilibrium age distribution within the colony and propose a novel approach for determining the global asymptotic sta…
▽ More
A system of partial differential equations is derived as a model for the dynamics of a honey bee colony with a continuous age distribution, and the system is then extended to include the effects of a simplified infectious disease. In the disease-free case we analytically derive the equilibrium age distribution within the colony and propose a novel approach for determining the global asymptotic stability of a reduced model. Furthermore, we present a method for determining the basic reproduction number $R_0$ of the infection; the method can be applied to other age-structured disease models with interacting susceptible classes. The results of asymptotic stability indicate that a honey bee colony suffering losses will recover naturally so long as the cause of the losses is removed before the colony collapses. Our expression for $R_0$ has potential uses in the tracking and control of an infectious disease within a bee colony.
△ Less
Submitted 2 November, 2016;
originally announced November 2016.