-
YOLO based Ocean Eddy Localization with AWS SageMaker
Authors:
Seraj Al Mahmud Mostafa,
**bo Wang,
Benjamin Holt,
Jianwu Wang
Abstract:
Ocean eddies play a significant role both on the sea surface and beneath it, contributing to the sustainability of marine life dependent on oceanic behaviors. Therefore, it is crucial to investigate ocean eddies to monitor changes in the Earth, particularly in the oceans, and their impact on climate. This study aims to pinpoint ocean eddies using AWS cloud services, specifically SageMaker. The pri…
▽ More
Ocean eddies play a significant role both on the sea surface and beneath it, contributing to the sustainability of marine life dependent on oceanic behaviors. Therefore, it is crucial to investigate ocean eddies to monitor changes in the Earth, particularly in the oceans, and their impact on climate. This study aims to pinpoint ocean eddies using AWS cloud services, specifically SageMaker. The primary objective is to detect small-scale (<20km) ocean eddies from satellite remote images and assess the feasibility of utilizing SageMaker, which offers tools for deploying AI applications. Moreover, this research not only explores the deployment of cloud-based services for remote sensing of Earth data but also evaluates several YOLO (You Only Look Once) models using single and multi-GPU-based services in the cloud. Furthermore, this study underscores the potential of these services, their limitations, challenges related to deployment and resource management, and their user-riendliness for Earth science projects.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
Analyzing Misinformation Claims During the 2022 Brazilian General Election on WhatsApp, Twitter, and Kwai
Authors:
Scott A. Hale,
Adriano Belisario,
Ahmed Mostafa,
Chico Camargo
Abstract:
This study analyzes misinformation from WhatsApp, Twitter, and Kwai during the 2022 Brazilian general election. Given the democratic importance of accurate information during elections, multiple fact-checking organizations collaborated to identify and respond to misinformation via WhatsApp tiplines and power a fact-checking feature within a chatbot operated by Brazil's election authority, the TSE.…
▽ More
This study analyzes misinformation from WhatsApp, Twitter, and Kwai during the 2022 Brazilian general election. Given the democratic importance of accurate information during elections, multiple fact-checking organizations collaborated to identify and respond to misinformation via WhatsApp tiplines and power a fact-checking feature within a chatbot operated by Brazil's election authority, the TSE. WhatsApp is installed on over 99% of smartphones in Brazil, and the TSE chatbot was used by millions of citizens in the run-up to the elections. During the same period, we collected social media data from Twitter (now X) and Kwai (a popular video-sharing app similar to TikTok). Using the WhatsApp, Kwai, and Twitter data along with fact-checks from three Brazilian fact-checking organizations, we find unique claims on each platform. Even when the same claims are present on different platforms, they often differ in format, detail, length, or other characteristics. Our research highlights the limitations of current claim matching algorithms to match claims across platforms with such differences and identifies areas for further algorithmic development. Finally, we perform a descriptive analysis examining the formats (image, video, audio, text) and content themes of popular misinformation claims.
△ Less
Submitted 4 January, 2024;
originally announced January 2024.
-
Critical Role of Artificially Intelligent Conversational Chatbot
Authors:
Seraj A. M. Mostafa,
Md Z. Islam,
Mohammad Z. Islam,
Fairose Jeehan,
Saujanna Jafreen,
Raihan U. Islam
Abstract:
Artificially intelligent chatbot, such as ChatGPT, represents a recent and powerful advancement in the AI domain. Users prefer them for obtaining quick and precise answers, avoiding the usual hassle of clicking through multiple links in traditional searches. ChatGPT's conversational approach makes it comfortable and accessible for finding answers quickly and in an organized manner. However, it is…
▽ More
Artificially intelligent chatbot, such as ChatGPT, represents a recent and powerful advancement in the AI domain. Users prefer them for obtaining quick and precise answers, avoiding the usual hassle of clicking through multiple links in traditional searches. ChatGPT's conversational approach makes it comfortable and accessible for finding answers quickly and in an organized manner. However, it is important to note that these chatbots have limitations, especially in terms of providing accurate answers as well as ethical concerns. In this study, we explore various scenarios involving ChatGPT's ethical implications within academic contexts, its limitations, and the potential misuse by specific user groups. To address these challenges, we propose architectural solutions aimed at preventing inappropriate use and promoting responsible AI interactions.
△ Less
Submitted 31 October, 2023;
originally announced October 2023.
-
An End-to-End OCR Framework for Robust Arabic-Handwriting Recognition using a Novel Transformers-based Model and an Innovative 270 Million-Words Multi-Font Corpus of Classical Arabic with Diacritics
Authors:
Aly Mostafa,
Omar Mohamed,
Ali Ashraf,
Ahmed Elbehery,
Salma Jamal,
Anas Salah,
Amr S. Ghoneim
Abstract:
This research is the second phase in a series of investigations on develo** an Optical Character Recognition (OCR) of Arabic historical documents and examining how different modeling procedures interact with the problem. The first research studied the effect of Transformers on our custom-built Arabic dataset. One of the downsides of the first research was the size of the training data, a mere 15…
▽ More
This research is the second phase in a series of investigations on develo** an Optical Character Recognition (OCR) of Arabic historical documents and examining how different modeling procedures interact with the problem. The first research studied the effect of Transformers on our custom-built Arabic dataset. One of the downsides of the first research was the size of the training data, a mere 15000 images from our 30 million images, due to lack of resources. Also, we add an image enhancement layer, time and space optimization, and Post-Correction layer to aid the model in predicting the correct word for the correct context. Notably, we propose an end-to-end text recognition approach using Vision Transformers as an encoder, namely BEIT, and vanilla Transformer as a decoder, eliminating CNNs for feature extraction and reducing the model's complexity. The experiments show that our end-to-end model outperforms Convolutions Backbones. The model attained a CER of 4.46%.
△ Less
Submitted 26 August, 2022; v1 submitted 20 August, 2022;
originally announced August 2022.
-
DeepTrust: A Deep Learning Approach for Measuring Social Media Users Trustworthiness
Authors:
Majed Alrubaian,
Muhammad Al-Qurishi,
Sherif Omar,
Mohamed A. Mostafa
Abstract:
Veracity of data posted on the microblog platforms has in recent years been a subject of intensive study by professionals specializing in various fields of informatics as well as sociology, particularly in the light of increasing importance of online tools for news spreading. On Twitter and similar sites, it is possible to report on ongoing situations globally with minimal delay, while the cost of…
▽ More
Veracity of data posted on the microblog platforms has in recent years been a subject of intensive study by professionals specializing in various fields of informatics as well as sociology, particularly in the light of increasing importance of online tools for news spreading. On Twitter and similar sites, it is possible to report on ongoing situations globally with minimal delay, while the cost of such reporting remains negligible. One of the most important features of this social network is that content delivery can be customized to allow users to focus only on news items covering subject matters they find interesting. With this in mind, it becomes necessary to create verification mechanisms that can ascertain whether the claims made on Twitter can be taken seriously and prevent false content from spreading too far. This study demonstrates an innovative System for verification of information that can fulfill the role described above. The System is comprised of four mutually connected modules: a legacy module, a trustworthiness classifier; a module managing user authority, and a ranking procedure. All of the modules function within an integrated framework and jointly contribute to an accurate classification of messages and authors. Effectiveness of the solution was evaluated empirically on a sample of Twitter users, with a strict 10-fold evaluation procedure applied for each module. The findings indicate that the solution successfully meets the primary objectives of the study and performs its function as expected.
△ Less
Submitted 19 January, 2021;
originally announced January 2021.
-
Hyperbolic Deep Neural Networks: A Survey
Authors:
Wei Peng,
Tuomas Varanka,
Abdelrahman Mostafa,
Henglin Shi,
Guoying Zhao
Abstract:
Recently, there has been a rising surge of momentum for deep representation learning in hyperbolic spaces due to theirhigh capacity of modeling data like knowledge graphs or synonym hierarchies, possessing hierarchical structure. We refer to the model as hyperbolic deep neural network in this paper. Such a hyperbolic neural architecture potentially leads to drastically compact model withmuch more…
▽ More
Recently, there has been a rising surge of momentum for deep representation learning in hyperbolic spaces due to theirhigh capacity of modeling data like knowledge graphs or synonym hierarchies, possessing hierarchical structure. We refer to the model as hyperbolic deep neural network in this paper. Such a hyperbolic neural architecture potentially leads to drastically compact model withmuch more physical interpretability than its counterpart in Euclidean space. To stimulate future research, this paper presents acoherent and comprehensive review of the literature around the neural components in the construction of hyperbolic deep neuralnetworks, as well as the generalization of the leading deep approaches to the Hyperbolic space. It also presents current applicationsaround various machine learning tasks on several publicly available datasets, together with insightful observations and identifying openquestions and promising future directions.
△ Less
Submitted 17 February, 2021; v1 submitted 12 January, 2021;
originally announced January 2021.
-
Parkinson's Disease Detection with Ensemble Architectures based on ILSVRC Models
Authors:
Tahjid Ashfaque Mostafa,
Irene Cheng
Abstract:
In this work, we explore various neural network architectures using Magnetic Resonance (MR) T1 images of the brain to identify Parkinson's Disease (PD), which is one of the most common neurodegenerative and movement disorders. We propose three ensemble architectures combining some winning Convolutional Neural Network models of ImageNet Large Scale Visual Recognition Challenge (ILSVRC). All of our…
▽ More
In this work, we explore various neural network architectures using Magnetic Resonance (MR) T1 images of the brain to identify Parkinson's Disease (PD), which is one of the most common neurodegenerative and movement disorders. We propose three ensemble architectures combining some winning Convolutional Neural Network models of ImageNet Large Scale Visual Recognition Challenge (ILSVRC). All of our proposed architectures outperform existing approaches to detect PD from MR images, achieving upto 95\% detection accuracy. We also find that when we construct our ensemble architecture using models pretrained on the ImageNet dataset unrelated to PD, the detection performance is significantly better compared to models without any prior training. Our finding suggests a promising direction when no or insufficient training data is available.
△ Less
Submitted 23 July, 2020;
originally announced July 2020.
-
Parkinson's Disease Detection Using Ensemble Architecture from MR Images
Authors:
Tahjid Ashfaque Mostafa,
Irene Cheng
Abstract:
Parkinson's Disease(PD) is one of the major nervous system disorders that affect people over 60. PD can cause cognitive impairments. In this work, we explore various approaches to identify Parkinson's using Magnetic Resonance (MR) T1 images of the brain. We experiment with ensemble architectures combining some winning Convolutional Neural Network models of ImageNet Large Scale Visual Recognition C…
▽ More
Parkinson's Disease(PD) is one of the major nervous system disorders that affect people over 60. PD can cause cognitive impairments. In this work, we explore various approaches to identify Parkinson's using Magnetic Resonance (MR) T1 images of the brain. We experiment with ensemble architectures combining some winning Convolutional Neural Network models of ImageNet Large Scale Visual Recognition Challenge (ILSVRC) and propose two architectures. We find that detection accuracy increases drastically when we focus on the Gray Matter (GM) and White Matter (WM) regions from the MR images instead of using whole MR images. We achieved an average accuracy of 94.7\% using smoothed GM and WM extracts and one of our proposed architectures. We also perform occlusion analysis and determine which brain areas are relevant in the architecture decision making process.
△ Less
Submitted 1 July, 2020;
originally announced July 2020.
-
Learning non-rigid surface reconstruction from spatio-temporal image patches
Authors:
Matteo Pedone,
Abdelrahman Mostafa,
Janne heikkilä
Abstract:
We present a method to reconstruct a dense spatio-temporal depth map of a non-rigidly deformable object directly from a video sequence. The estimation of depth is performed locally on spatio-temporal patches of the video, and then the full depth video of the entire shape is recovered by combining them together. Since the geometric complexity of a local spatio-temporal patch of a deforming non-rigi…
▽ More
We present a method to reconstruct a dense spatio-temporal depth map of a non-rigidly deformable object directly from a video sequence. The estimation of depth is performed locally on spatio-temporal patches of the video, and then the full depth video of the entire shape is recovered by combining them together. Since the geometric complexity of a local spatio-temporal patch of a deforming non-rigid object is often simple enough to be faithfully represented with a parametric model, we artificially generate a database of small deforming rectangular meshes rendered with different material properties and light conditions, along with their corresponding depth videos, and use such data to train a convolutional neural network. We tested our method on both synthetic and Kinect data and experimentally observed that the reconstruction error is significantly lower than the one obtained using other approaches like conventional non-rigid structure from motion.
△ Less
Submitted 18 June, 2020;
originally announced June 2020.
-
Modeling an Augmented Reality Game Environment to Enhance Behavior of ADHD Patients
Authors:
Saad Alqithami,
Musaad Alzahrani,
Abdulkareem Alzahrani,
Ahmed Mostafa
Abstract:
The paper generically models an augmented reality game-based environment to project the gamification of an online cognitive behavioral therapist that performs instant measurements for patients with a predefined Attention Deficit Hyperactivity Disorder (ADHD). ADHD is one of the most common neurodevelopmental disorders in which patients have difficulties related to inattention, hyperactivity, and i…
▽ More
The paper generically models an augmented reality game-based environment to project the gamification of an online cognitive behavioral therapist that performs instant measurements for patients with a predefined Attention Deficit Hyperactivity Disorder (ADHD). ADHD is one of the most common neurodevelopmental disorders in which patients have difficulties related to inattention, hyperactivity, and impulsivity. Those patients are in need for a psychological therapy; the use of cognitive behavioral therapy as a firmly-established treatment is to help in enhancing the way they think and behave. A major limitation in traditional cognitive behavioral therapies is that therapists may face difficulty to optimize patients' neuropsychological stimulus following a specified treatment plan, i.e., therapists struggle to draw clear images when stimulating patients' mindset to a point where they should be. Other limitations recognized here include availability, accessibility and level-of-experience of the therapists. Therefore, the paper present a gamification model, we term as "AR-Therapist," in order to take advantages of augmented reality developments to engage patients in both real and virtual game-based environments. The model provides an on-time measurements of patients' progress throughout the treatment sessions which, in result, overcomes limitations observed in traditional cognitive behavioral therapies.
△ Less
Submitted 3 November, 2019;
originally announced November 2019.
-
A Framework for a Smart Social Blood Donation System Based on Mobile Cloud Computing
Authors:
Almetwally M. Mostafa,
Ahmed E. Youssef,
Gamal Alshorbagy
Abstract:
Blood Donation and Blood Transfusion Services (BTS) are crucial for saving people lives. Recently, worldwide efforts have been undertaken to utilize social media and smartphone applications to make the blood donation process more convenient, offer additional services, and create communities around blood donation centers. Blood banks suffer frequent shortage of blood; hence, advertisements are freq…
▽ More
Blood Donation and Blood Transfusion Services (BTS) are crucial for saving people lives. Recently, worldwide efforts have been undertaken to utilize social media and smartphone applications to make the blood donation process more convenient, offer additional services, and create communities around blood donation centers. Blood banks suffer frequent shortage of blood; hence, advertisements are frequently seen on social networks urging healthy individuals to donate blood for patients who urgently require blood transfusion. The blood donation process usually consumes a lot of time and effort from both donors and medical staff since there is no concrete information system that allows donors and blood donation centers communicate efficiently and coordinate with each other to minimize time and effort required for blood donation process. Moreover, most blood banks work in isolation and are not integrated with other blood donation centers and health organizations which affect the blood donation and blood transfusion services quality. This work aims at develo** a Blood Donation System (BDS) based on the cutting-edge information technologies of cloud computing and mobile computing.
△ Less
Submitted 8 April, 2019; v1 submitted 23 December, 2014;
originally announced December 2014.