-
Trustable Mobile Crowd Sourcing for Acquiring Information from a Flooded Smart Area
Authors:
Sajedeh Abbasi,
Hamed Vahdat-Nejad,
Hamideh Hajiabadi
Abstract:
Flood is a natural phenomenon that causes severe environmental damage and destruction in smart cities. After a flood, topographic, geological, and living conditions change. As a result, the previous information regarding the environment is no more valid. Rescue and relief organizations that intend to help the affected people need to obtain new and accurate information about the conditions of the f…
▽ More
Flood is a natural phenomenon that causes severe environmental damage and destruction in smart cities. After a flood, topographic, geological, and living conditions change. As a result, the previous information regarding the environment is no more valid. Rescue and relief organizations that intend to help the affected people need to obtain new and accurate information about the conditions of the flooded environment. Acquiring this required information in the shortest time is a challenge for realizing smart cities. Due to the advances in the Internet of Things technology and the prevalence of smartphones with several sensors and functionalities, it is possible to obtain the required information by leveraging the Crowdsourcing model. In this paper, the information required from a flooded area is classified into four categories: victim, Facility and Livelihood, medical, and transfer. Next, a crowdsourcing scheme for acquiring information is proposed, including malicious user detection to ensure the accuracy of information received. Finally, simulation results indicate that the proposed scheme correctly detects malicious users and ensures the quality of obtained information.
△ Less
Submitted 9 March, 2022;
originally announced March 2022.
-
Investigating the Impact of COVID-19 on Education by Social Network Mining
Authors:
Mohadese Jamalian,
Hamed Vahdat-Nejad,
Hamideh Hajiabadi
Abstract:
The Covid-19 virus has been one of the most discussed topics on social networks in 2020 and 2021 and has affected the classic educational paradigm, worldwide. In this research, many tweets related to the Covid-19 virus and education are considered and geo-tagged with the help of the GeoNames geographic database, which contains a large number of place names. To detect the feeling of users, sentimen…
▽ More
The Covid-19 virus has been one of the most discussed topics on social networks in 2020 and 2021 and has affected the classic educational paradigm, worldwide. In this research, many tweets related to the Covid-19 virus and education are considered and geo-tagged with the help of the GeoNames geographic database, which contains a large number of place names. To detect the feeling of users, sentiment analysis is performed using the RoBERTa language-based model. Finally, we obtain the trends of frequency of total, positive, and negative tweets for countries with a high number of Covid-19 confirmed cases. Investigating the results reveals a correlation between the trends of tweet frequency and the official statistic of confirmed cases for several countries.
△ Less
Submitted 13 March, 2022;
originally announced March 2022.
-
Extracting Feelings of People Regarding COVID-19 by Social Network Mining
Authors:
Hamed Vahdat-Nejad,
Fatemeh Salmani,
Mahdi Hajiabadi,
Faezeh Azizi,
Sajedeh Abbasi,
Mohadese Jamalian,
Reyhane Mosafer,
Hamideh Hajiabadi
Abstract:
In 2020, COVID-19 became the chief concern of the world and is still reflected widely in all social networks. Each day, users post millions of tweets and comments on this subject, which contain significant implicit information about the public opinion. In this regard, a dataset of COVID-related tweets in English language is collected, which consists of more than two million tweets from March 23 to…
▽ More
In 2020, COVID-19 became the chief concern of the world and is still reflected widely in all social networks. Each day, users post millions of tweets and comments on this subject, which contain significant implicit information about the public opinion. In this regard, a dataset of COVID-related tweets in English language is collected, which consists of more than two million tweets from March 23 to June 23 of 2020 to extract the feelings of the people in various countries in the early stages of this outbreak. To this end, first, we use a lexicon-based approach in conjunction with the GeoNames geographic database to label the tweets with their locations. Next, a method based on the recently introduced and widely cited RoBERTa model is proposed to analyze their sentimental content. After that, the trend graphs of the frequency of tweets as well as sentiments are produced for the world and the nations that were more engaged with COVID-19. Graph analysis shows that the frequency graphs of the tweets for the majority of nations are significantly correlated with the official statistics of the daily afflicted in them. Moreover, several implicit knowledge is extracted and discussed.
△ Less
Submitted 12 October, 2021;
originally announced October 2021.
-
Analyzing the Impact of COVID-19 on Economy from the Perspective of Users Reviews
Authors:
Fatemeh Salmani,
Hamed Vahdat-Nejad,
Hamideh Hajiabadi
Abstract:
One of the most important incidents in the world in 2020 is the outbreak of the Coronavirus. Users on social networks publish a large number of comments about this event. These comments contain important hidden information of public opinion regarding this pandemic. In this research, a large number of Coronavirus-related tweets are considered and analyzed using natural language processing and infor…
▽ More
One of the most important incidents in the world in 2020 is the outbreak of the Coronavirus. Users on social networks publish a large number of comments about this event. These comments contain important hidden information of public opinion regarding this pandemic. In this research, a large number of Coronavirus-related tweets are considered and analyzed using natural language processing and information retrieval science. Initially, the location of the tweets is determined using a dictionary prepared through the Geo-Names geographic database, which contains detailed and complete information of places such as city names, streets, and postal codes. Then, using a large dictionary prepared from the terms of economics, related tweets are extracted and sentiments corresponded to tweets are analyzed with the help of the RoBERTa language-based model, which has high accuracy and good performance. Finally, the frequency chart of tweets related to the economy and their sentiment scores (positive and negative tweets) is plotted over time for the entire world and the top 10 economies. From the analysis of the charts, we learn that the reason for publishing economic tweets is not only the increase in the number of people infected with the Coronavirus but also imposed restrictions and lockdowns in countries. The consequences of these restrictions include the loss of millions of jobs and the economic downturn.
△ Less
Submitted 5 October, 2021;
originally announced October 2021.
-
Extracting Major Topics of COVID-19 Related Tweets
Authors:
Faezeh Azizi,
Hamed Vahdat-Nejad,
Hamideh Hajiabadi,
Mohammad Hossein Khosravi
Abstract:
With the outbreak of the Covid-19 virus, the activity of users on Twitter has significantly increased. Some studies have investigated the hot topics of tweets in this period; however, little attention has been paid to presenting and analyzing the spatial and temporal trends of Covid-19 topics. In this study, we use the topic modeling method to extract global topics during the nationwide quarantine…
▽ More
With the outbreak of the Covid-19 virus, the activity of users on Twitter has significantly increased. Some studies have investigated the hot topics of tweets in this period; however, little attention has been paid to presenting and analyzing the spatial and temporal trends of Covid-19 topics. In this study, we use the topic modeling method to extract global topics during the nationwide quarantine periods (March 23 to June 23, 2020) on Covid-19 tweets. We implement the Latent Dirichlet Allocation (LDA) algorithm to extract the topics and then name them with the "reopening", "death cases", "telecommuting", "protests", "anger expression", "masking", "medication", "social distance", "second wave", and "peak of the disease" titles. We additionally analyze temporal trends of the topics for the whole world and four countries. By analyzing the graphs, fascinating results are obtained from altering users' focus on topics over time.
△ Less
Submitted 5 October, 2021;
originally announced October 2021.
-
Robust Metric Learning based on the Rescaled Hinge Loss
Authors:
Sumia Abdulhussien Razooqi Al-Obaidi,
Davood Zabihzadeh,
Hamideh Hajiabadi
Abstract:
Distance/Similarity learning is a fundamental problem in machine learning. For example, kNN classifier or clustering methods are based on a distance/similarity measure. Metric learning algorithms enhance the efficiency of these methods by learning an optimal distance function from data. Most metric learning methods need training information in the form of pair or triplet sets. Nowadays, this train…
▽ More
Distance/Similarity learning is a fundamental problem in machine learning. For example, kNN classifier or clustering methods are based on a distance/similarity measure. Metric learning algorithms enhance the efficiency of these methods by learning an optimal distance function from data. Most metric learning methods need training information in the form of pair or triplet sets. Nowadays, this training information often is obtained from the Internet via crowdsourcing methods. Therefore, this information may contain label noise or outliers leading to the poor performance of the learned metric. It is even possible that the learned metric functions perform worse than the general metrics such as Euclidean distance. To address this challenge, this paper presents a new robust metric learning method based on the Rescaled Hinge loss. This loss function is a general case of the popular Hinge loss and initially introduced in (Xu et al. 2017) to develop a new robust SVM algorithm. In this paper, we formulate the metric learning problem using the Rescaled Hinge loss function and then develop an efficient algorithm based on HQ (Half-Quadratic) to solve the problem. Experimental results on a variety of both real and synthetic datasets confirm that our new robust algorithm considerably outperforms state-of-the-art metric learning methods in the presence of label noise and outliers.
△ Less
Submitted 22 January, 2020; v1 submitted 26 April, 2019;
originally announced April 2019.
-
RELF: Robust Regression Extended with Ensemble Loss Function
Authors:
Hamideh Hajiabadi,
Reza Monsefi,
Hadi Sadoghi Yazdi
Abstract:
Ensemble techniques are powerful approaches that combine several weak learners to build a stronger one. As a meta-learning framework, ensemble techniques can easily be applied to many machine learning methods. Inspired by ensemble techniques, in this paper we propose an ensemble loss functions applied to a simple regressor. We then propose a half-quadratic learning algorithm in order to find the p…
▽ More
Ensemble techniques are powerful approaches that combine several weak learners to build a stronger one. As a meta-learning framework, ensemble techniques can easily be applied to many machine learning methods. Inspired by ensemble techniques, in this paper we propose an ensemble loss functions applied to a simple regressor. We then propose a half-quadratic learning algorithm in order to find the parameter of the regressor and the optimal weights associated with each loss function. Moreover, we show that our proposed loss function is robust in noisy environments. For a particular class of loss functions, we show that our proposed ensemble loss function is Bayes consistent and robust. Experimental evaluations on several datasets demonstrate that our proposed ensemble loss function significantly improves the performance of a simple regressor in comparison with state-of-the-art methods.
△ Less
Submitted 25 October, 2018;
originally announced October 2018.
-
On Extending Neural Networks with Loss Ensembles for Text Classification
Authors:
Hamideh Hajiabadi,
Diego Molla-Aliod,
Reza Monsefi
Abstract:
Ensemble techniques are powerful approaches that combine several weak learners to build a stronger one. As a meta learning framework, ensemble techniques can easily be applied to many machine learning techniques. In this paper we propose a neural network extended with an ensemble loss function for text classification. The weight of each weak loss function is tuned within the training phase through…
▽ More
Ensemble techniques are powerful approaches that combine several weak learners to build a stronger one. As a meta learning framework, ensemble techniques can easily be applied to many machine learning techniques. In this paper we propose a neural network extended with an ensemble loss function for text classification. The weight of each weak loss function is tuned within the training phase through the gradient propagation optimization method of the neural network. The approach is evaluated on several text classification datasets. We also evaluate its performance in various environments with several degrees of label noise. Experimental results indicate an improvement of the results and strong resilience against label noise in comparison with other methods.
△ Less
Submitted 14 November, 2017;
originally announced November 2017.