Search | arXiv e-print repository

Trustable Mobile Crowd Sourcing for Acquiring Information from a Flooded Smart Area

Authors: Sajedeh Abbasi, Hamed Vahdat-Nejad, Hamideh Hajiabadi

Abstract: Flood is a natural phenomenon that causes severe environmental damage and destruction in smart cities. After a flood, topographic, geological, and living conditions change. As a result, the previous information regarding the environment is no more valid. Rescue and relief organizations that intend to help the affected people need to obtain new and accurate information about the conditions of the f… ▽ More Flood is a natural phenomenon that causes severe environmental damage and destruction in smart cities. After a flood, topographic, geological, and living conditions change. As a result, the previous information regarding the environment is no more valid. Rescue and relief organizations that intend to help the affected people need to obtain new and accurate information about the conditions of the flooded environment. Acquiring this required information in the shortest time is a challenge for realizing smart cities. Due to the advances in the Internet of Things technology and the prevalence of smartphones with several sensors and functionalities, it is possible to obtain the required information by leveraging the Crowdsourcing model. In this paper, the information required from a flooded area is classified into four categories: victim, Facility and Livelihood, medical, and transfer. Next, a crowdsourcing scheme for acquiring information is proposed, including malicious user detection to ensure the accuracy of information received. Finally, simulation results indicate that the proposed scheme correctly detects malicious users and ensures the quality of obtained information. △ Less

Submitted 9 March, 2022; originally announced March 2022.

Comments: The article consists of 3 pages

ACM Class: H.4; I.5

arXiv:2203.06584 [pdf]

Investigating the Impact of COVID-19 on Education by Social Network Mining

Authors: Mohadese Jamalian, Hamed Vahdat-Nejad, Hamideh Hajiabadi

Abstract: The Covid-19 virus has been one of the most discussed topics on social networks in 2020 and 2021 and has affected the classic educational paradigm, worldwide. In this research, many tweets related to the Covid-19 virus and education are considered and geo-tagged with the help of the GeoNames geographic database, which contains a large number of place names. To detect the feeling of users, sentimen… ▽ More The Covid-19 virus has been one of the most discussed topics on social networks in 2020 and 2021 and has affected the classic educational paradigm, worldwide. In this research, many tweets related to the Covid-19 virus and education are considered and geo-tagged with the help of the GeoNames geographic database, which contains a large number of place names. To detect the feeling of users, sentiment analysis is performed using the RoBERTa language-based model. Finally, we obtain the trends of frequency of total, positive, and negative tweets for countries with a high number of Covid-19 confirmed cases. Investigating the results reveals a correlation between the trends of tweet frequency and the official statistic of confirmed cases for several countries. △ Less

Submitted 13 March, 2022; originally announced March 2022.

Comments: 6 pages, 1 figures

arXiv:2110.06151 [pdf]

Extracting Feelings of People Regarding COVID-19 by Social Network Mining

Authors: Hamed Vahdat-Nejad, Fatemeh Salmani, Mahdi Hajiabadi, Faezeh Azizi, Sajedeh Abbasi, Mohadese Jamalian, Reyhane Mosafer, Hamideh Hajiabadi

Abstract: In 2020, COVID-19 became the chief concern of the world and is still reflected widely in all social networks. Each day, users post millions of tweets and comments on this subject, which contain significant implicit information about the public opinion. In this regard, a dataset of COVID-related tweets in English language is collected, which consists of more than two million tweets from March 23 to… ▽ More In 2020, COVID-19 became the chief concern of the world and is still reflected widely in all social networks. Each day, users post millions of tweets and comments on this subject, which contain significant implicit information about the public opinion. In this regard, a dataset of COVID-related tweets in English language is collected, which consists of more than two million tweets from March 23 to June 23 of 2020 to extract the feelings of the people in various countries in the early stages of this outbreak. To this end, first, we use a lexicon-based approach in conjunction with the GeoNames geographic database to label the tweets with their locations. Next, a method based on the recently introduced and widely cited RoBERTa model is proposed to analyze their sentimental content. After that, the trend graphs of the frequency of tweets as well as sentiments are produced for the world and the nations that were more engaged with COVID-19. Graph analysis shows that the frequency graphs of the tweets for the majority of nations are significantly correlated with the official statistics of the daily afflicted in them. Moreover, several implicit knowledge is extracted and discussed. △ Less

Submitted 12 October, 2021; originally announced October 2021.

arXiv:2110.02198 [pdf]

Analyzing the Impact of COVID-19 on Economy from the Perspective of Users Reviews

Authors: Fatemeh Salmani, Hamed Vahdat-Nejad, Hamideh Hajiabadi

Abstract: One of the most important incidents in the world in 2020 is the outbreak of the Coronavirus. Users on social networks publish a large number of comments about this event. These comments contain important hidden information of public opinion regarding this pandemic. In this research, a large number of Coronavirus-related tweets are considered and analyzed using natural language processing and infor… ▽ More One of the most important incidents in the world in 2020 is the outbreak of the Coronavirus. Users on social networks publish a large number of comments about this event. These comments contain important hidden information of public opinion regarding this pandemic. In this research, a large number of Coronavirus-related tweets are considered and analyzed using natural language processing and information retrieval science. Initially, the location of the tweets is determined using a dictionary prepared through the Geo-Names geographic database, which contains detailed and complete information of places such as city names, streets, and postal codes. Then, using a large dictionary prepared from the terms of economics, related tweets are extracted and sentiments corresponded to tweets are analyzed with the help of the RoBERTa language-based model, which has high accuracy and good performance. Finally, the frequency chart of tweets related to the economy and their sentiment scores (positive and negative tweets) is plotted over time for the entire world and the top 10 economies. From the analysis of the charts, we learn that the reason for publishing economic tweets is not only the increase in the number of people infected with the Coronavirus but also imposed restrictions and lockdowns in countries. The consequences of these restrictions include the loss of millions of jobs and the economic downturn. △ Less

Submitted 5 October, 2021; originally announced October 2021.

arXiv:2110.01876 [pdf]

Extracting Major Topics of COVID-19 Related Tweets

Authors: Faezeh Azizi, Hamed Vahdat-Nejad, Hamideh Hajiabadi, Mohammad Hossein Khosravi

Abstract: With the outbreak of the Covid-19 virus, the activity of users on Twitter has significantly increased. Some studies have investigated the hot topics of tweets in this period; however, little attention has been paid to presenting and analyzing the spatial and temporal trends of Covid-19 topics. In this study, we use the topic modeling method to extract global topics during the nationwide quarantine… ▽ More With the outbreak of the Covid-19 virus, the activity of users on Twitter has significantly increased. Some studies have investigated the hot topics of tweets in this period; however, little attention has been paid to presenting and analyzing the spatial and temporal trends of Covid-19 topics. In this study, we use the topic modeling method to extract global topics during the nationwide quarantine periods (March 23 to June 23, 2020) on Covid-19 tweets. We implement the Latent Dirichlet Allocation (LDA) algorithm to extract the topics and then name them with the "reopening", "death cases", "telecommuting", "protests", "anger expression", "masking", "medication", "social distance", "second wave", and "peak of the disease" titles. We additionally analyze temporal trends of the topics for the whole world and four countries. By analyzing the graphs, fascinating results are obtained from altering users' focus on topics over time. △ Less

Submitted 5 October, 2021; originally announced October 2021.

arXiv:1904.11711 [pdf]

doi 10.1007/s13042-020-01137-z

Robust Metric Learning based on the Rescaled Hinge Loss

Authors: Sumia Abdulhussien Razooqi Al-Obaidi, Davood Zabihzadeh, Hamideh Hajiabadi

Abstract: Distance/Similarity learning is a fundamental problem in machine learning. For example, kNN classifier or clustering methods are based on a distance/similarity measure. Metric learning algorithms enhance the efficiency of these methods by learning an optimal distance function from data. Most metric learning methods need training information in the form of pair or triplet sets. Nowadays, this train… ▽ More Distance/Similarity learning is a fundamental problem in machine learning. For example, kNN classifier or clustering methods are based on a distance/similarity measure. Metric learning algorithms enhance the efficiency of these methods by learning an optimal distance function from data. Most metric learning methods need training information in the form of pair or triplet sets. Nowadays, this training information often is obtained from the Internet via crowdsourcing methods. Therefore, this information may contain label noise or outliers leading to the poor performance of the learned metric. It is even possible that the learned metric functions perform worse than the general metrics such as Euclidean distance. To address this challenge, this paper presents a new robust metric learning method based on the Rescaled Hinge loss. This loss function is a general case of the popular Hinge loss and initially introduced in (Xu et al. 2017) to develop a new robust SVM algorithm. In this paper, we formulate the metric learning problem using the Rescaled Hinge loss function and then develop an efficient algorithm based on HQ (Half-Quadratic) to solve the problem. Experimental results on a variety of both real and synthetic datasets confirm that our new robust algorithm considerably outperforms state-of-the-art metric learning methods in the presence of label noise and outliers. △ Less

Submitted 22 January, 2020; v1 submitted 26 April, 2019; originally announced April 2019.

Report number: https://link.springer.com/article/10.1007/s13042-020-01137-z

Journal ref: Robust metric learning based on the rescaled hinge loss. International Journal of Machine Learning and Cybernetics 11 (2020): 2515-2528

arXiv:1810.11071 [pdf, other]

doi 10.1007/s10489-018-1341-9

RELF: Robust Regression Extended with Ensemble Loss Function

Authors: Hamideh Hajiabadi, Reza Monsefi, Hadi Sadoghi Yazdi

Abstract: Ensemble techniques are powerful approaches that combine several weak learners to build a stronger one. As a meta-learning framework, ensemble techniques can easily be applied to many machine learning methods. Inspired by ensemble techniques, in this paper we propose an ensemble loss functions applied to a simple regressor. We then propose a half-quadratic learning algorithm in order to find the p… ▽ More Ensemble techniques are powerful approaches that combine several weak learners to build a stronger one. As a meta-learning framework, ensemble techniques can easily be applied to many machine learning methods. Inspired by ensemble techniques, in this paper we propose an ensemble loss functions applied to a simple regressor. We then propose a half-quadratic learning algorithm in order to find the parameter of the regressor and the optimal weights associated with each loss function. Moreover, we show that our proposed loss function is robust in noisy environments. For a particular class of loss functions, we show that our proposed ensemble loss function is Bayes consistent and robust. Experimental evaluations on several datasets demonstrate that our proposed ensemble loss function significantly improves the performance of a simple regressor in comparison with state-of-the-art methods. △ Less

Submitted 25 October, 2018; originally announced October 2018.

Comments: 18 Pages, 7 figures, Accepted in Applied Intelligence- Springer The International Journal of Research on Intelligent Systems for Real Life Complex Problems

arXiv:1711.05170 [pdf, other]

On Extending Neural Networks with Loss Ensembles for Text Classification

Authors: Hamideh Hajiabadi, Diego Molla-Aliod, Reza Monsefi

Abstract: Ensemble techniques are powerful approaches that combine several weak learners to build a stronger one. As a meta learning framework, ensemble techniques can easily be applied to many machine learning techniques. In this paper we propose a neural network extended with an ensemble loss function for text classification. The weight of each weak loss function is tuned within the training phase through… ▽ More Ensemble techniques are powerful approaches that combine several weak learners to build a stronger one. As a meta learning framework, ensemble techniques can easily be applied to many machine learning techniques. In this paper we propose a neural network extended with an ensemble loss function for text classification. The weight of each weak loss function is tuned within the training phase through the gradient propagation optimization method of the neural network. The approach is evaluated on several text classification datasets. We also evaluate its performance in various environments with several degrees of label noise. Experimental results indicate an improvement of the results and strong resilience against label noise in comparison with other methods. △ Less

Submitted 14 November, 2017; originally announced November 2017.

Comments: 5 pages, 5 tables, 1 figure. Camera-ready submitted to The 2017 Australasian Language Technology Association Workshop (ALTA 2017)

Showing 1–8 of 8 results for author: Hajiabadi, H