-
Computer Vision in the Food Industry: Accurate, Real-time, and Automatic Food Recognition with Pretrained MobileNetV2
Authors:
Shayan Rokhva,
Babak Teimourpour,
Amir Hossein Soltani
Abstract:
In contemporary society, the application of artificial intelligence for automatic food recognition offers substantial potential for nutrition tracking, reducing food waste, and enhancing productivity in food production and consumption scenarios. Modern technologies such as Computer Vision and Deep Learning are highly beneficial, enabling machines to learn automatically, thereby facilitating automa…
▽ More
In contemporary society, the application of artificial intelligence for automatic food recognition offers substantial potential for nutrition tracking, reducing food waste, and enhancing productivity in food production and consumption scenarios. Modern technologies such as Computer Vision and Deep Learning are highly beneficial, enabling machines to learn automatically, thereby facilitating automatic visual recognition. Despite some research in this field, the challenge of achieving accurate automatic food recognition quickly remains a significant research gap. Some models have been developed and implemented, but maintaining high performance swiftly, with low computational cost and low access to expensive hardware accelerators, still needs further exploration and research. This study employs the pretrained MobileNetV2 model, which is efficient and fast, for food recognition on the public Food11 dataset, comprising 16643 images. It also utilizes various techniques such as dataset understanding, transfer learning, data augmentation, regularization, dynamic learning rate, hyperparameter tuning, and consideration of images in different sizes to enhance performance and robustness. These techniques aid in choosing appropriate metrics, achieving better performance, avoiding overfitting and accuracy fluctuations, speeding up the model, and increasing the generalization of findings, making the study and its results applicable to practical applications. Despite employing a light model with a simpler structure and fewer trainable parameters compared to some deep and dense models in the deep learning area, it achieved commendable accuracy in a short time. This underscores the potential for practical implementation, which is the main intention of this study.
△ Less
Submitted 19 May, 2024;
originally announced May 2024.
-
Analysis of Persian News Agencies on Instagram, A Words Co-occurrence Graph-based Approach
Authors:
Mohammad Heydari,
Babak Teimourpour
Abstract:
The rise of the Internet and the exponential increase in data have made manual data summarization and analysis a challenging task. Instagram social network is a prominent social network widely utilized in Iran for information sharing and communication across various age groups. The inherent structure of Instagram, characterized by its text-rich content and graph-like data representation, enables t…
▽ More
The rise of the Internet and the exponential increase in data have made manual data summarization and analysis a challenging task. Instagram social network is a prominent social network widely utilized in Iran for information sharing and communication across various age groups. The inherent structure of Instagram, characterized by its text-rich content and graph-like data representation, enables the utilization of text and graph processing techniques for data analysis purposes. The degree distributions of these networks exhibit scale-free characteristics, indicating non-random growth patterns. Recently, word co-occurrence has gained attention from researchers across multiple disciplines due to its simplicity and practicality. Keyword extraction is a crucial task in natural language processing. In this study, we demonstrated that high-precision extraction of keywords from Instagram posts in the Persian language can be achieved using unsupervised word co-occurrence methods without resorting to conventional techniques such as clustering or pre-trained models. After graph visualization and community detection, it was observed that the top topics covered by news agencies are represented by these graphs. This approach is generalizable to new and diverse datasets and can provide acceptable outputs for new data. To the author's knowledge, this method has not been employed in the Persian language before on Instagram social network. The new crawled data has been publicly released on GitHub for exploration by other researchers. By employing this method, it is possible to use other graph-based algorithms, such as community detections. The results help us to identify the key role of different news agencies in information diffusion among the public, identify hidden communities, and discover latent patterns among a massive amount of data.
△ Less
Submitted 19 February, 2024;
originally announced February 2024.
-
Graph Representation Learning Towards Patents Network Analysis
Authors:
Mohammad Heydari,
Babak Teimourpour
Abstract:
Patent analysis has recently been recognized as a powerful technique for large companies worldwide to lend them insight into the age of competition among various industries. This technique is considered a shortcut for develo** countries since it can significantly accelerate their technology development. Therefore, as an inevitable process, patent analysis can be utilized to monitor rival compani…
▽ More
Patent analysis has recently been recognized as a powerful technique for large companies worldwide to lend them insight into the age of competition among various industries. This technique is considered a shortcut for develo** countries since it can significantly accelerate their technology development. Therefore, as an inevitable process, patent analysis can be utilized to monitor rival companies and diverse industries. This research employed a graph representation learning approach to create, analyze, and find similarities in the patent data registered in the Iranian Official Gazette. The patent records were scrapped and wrangled through the Iranian Official Gazette portal. Afterward, the key entities were extracted from the scrapped patents dataset to create the Iranian patents graph from scratch based on novel natural language processing and entity resolution techniques. Finally, thanks to the utilization of novel graph algorithms and text mining methods, we identified new areas of industry and research from Iranian patent data, which can be used extensively to prevent duplicate patents, familiarity with similar and connected inventions, Awareness of legal entities supporting patents and knowledge of researchers and linked stakeholders in a particular research field.
△ Less
Submitted 25 September, 2023;
originally announced September 2023.
-
A Food Recommender System in Academic Environments Based on Machine Learning Models
Authors:
Abolfazl Ajami,
Babak Teimourpour
Abstract:
Background: People's health depends on the use of proper diet as an important factor. Today, with the increasing mechanization of people's lives, proper eating habits and behaviors are neglected. On the other hand, food recommendations in the field of health have also tried to deal with this issue. But with the introduction of the Western nutrition style and the advancement of Western chemical med…
▽ More
Background: People's health depends on the use of proper diet as an important factor. Today, with the increasing mechanization of people's lives, proper eating habits and behaviors are neglected. On the other hand, food recommendations in the field of health have also tried to deal with this issue. But with the introduction of the Western nutrition style and the advancement of Western chemical medicine, many issues have emerged in the field of disease treatment and nutrition. Recent advances in technology and the use of artificial intelligence methods in information systems have led to the creation of recommender systems in order to improve people's health. Methods: A hybrid recommender system including, collaborative filtering, content-based, and knowledge-based models was used. Machine learning models such as Decision Tree, k-Nearest Neighbors (kNN), AdaBoost, and Bagging were investigated in the field of food recommender systems on 2519 students in the nutrition management system of a university. Student information including profile information for basal metabolic rate, student reservation records, and selected diet type is received online. Among the 15 features collected and after consulting nutrition experts, the most effective features are selected through feature engineering. Using machine learning models based on energy indicators and food selection history by students, food from the university menu is recommended to students. Results: The AdaBoost model has the highest performance in terms of accuracy with a rate of 73.70 percent. Conclusion: Considering the importance of diet in people's health, recommender systems are effective in obtaining useful information from a huge amount of data. Keywords: Recommender system, Food behavior and habits, Machine learning, Classification
△ Less
Submitted 26 June, 2023;
originally announced June 2023.
-
Cancer driver gene detection in transcriptional regulatory networks using the structure analysis of weighted regulatory interactions
Authors:
Mostafa Akhavan Safar,
Babak Teimourpour,
Abbas Nozari-Dalini
Abstract:
Identification of genes that initiate cell anomalies and cause cancer in humans is among the important fields in the oncology researches. The mutation and development of anomalies in these genes are then transferred to other genes in the cell and therefore disrupt the normal functionality of the cell. These genes are known as cancer driver genes (CDGs). Various methods have been proposed for predi…
▽ More
Identification of genes that initiate cell anomalies and cause cancer in humans is among the important fields in the oncology researches. The mutation and development of anomalies in these genes are then transferred to other genes in the cell and therefore disrupt the normal functionality of the cell. These genes are known as cancer driver genes (CDGs). Various methods have been proposed for predicting CDGs, most of which based on genomic data and based on computational methods. Therefore, some researchers have developed novel bioinformatics approaches. In this study, we propose an algorithm, which is able to calculate the effectiveness and strength of each gene and rank them by using the gene regulatory networks and the stochastic analysis of regulatory linking structures between genes. To do so, firstly we constructed the regulatory network using gene expression data and the list of regulatory interactions. Then, using biological and topological features of the network, we weighted the regulatory interactions. After that, the obtained regulatory interactions weight was used in interaction structure analysis process. Interaction analysis was achieved using two separate Markov chains on the bipartite graph obtained from the main graph of the gene network. To do so, the stochastic approach for link-structure analysis has been implemented. The proposed algorithm categorizes higher-ranked genes as driver genes. The efficiency of the proposed algorithm, regarding the F-measure value and number of identified driver genes, was compared with 23 other computational and network-based methods.
△ Less
Submitted 19 January, 2021;
originally announced January 2021.
-
KatzDriver: A network Based method to predict cancer causal genes in GR Network
Authors:
Mostafa Akhavansafar,
Babak Teimourpour
Abstract:
One of the important issues in oncology is finding the genes that perturbation the cell functionality, and result in cancer propagation. The genes, namely driver genes, when they mutate in expression, result in cancer through activation of the mutated proteins. So, many methods have been introduced to predict this group of genes. These are mostly computational methods based on the number of mutati…
▽ More
One of the important issues in oncology is finding the genes that perturbation the cell functionality, and result in cancer propagation. The genes, namely driver genes, when they mutate in expression, result in cancer through activation of the mutated proteins. So, many methods have been introduced to predict this group of genes. These are mostly computational methods based on the number of mutations of each gene. Recently, some network-based methods have been proposed to predict Cancer Driver Genes (CDGs). In this study, we use a network-based approach and relative importance of each gene in the propagation and absorption of genes anomalies in the network to recognize CDGs. The experimental results are compared with 19 previous methods that show our proposed algorithm is better than the others in terms of accuracy, precision, and the number of recognized CDGs.
△ Less
Submitted 13 March, 2020;
originally announced March 2020.
-
Analysis of ResearchGate, A Community Detection Approach
Authors:
Mohammad Heydari,
Babak Teimourpour
Abstract:
We are living in the data age. Communications over scientific networks creates new opportunities for researchers who aim to discover the hidden pattern in these huge repositories. This study utilizes network science to create collaboration network of Iranian Scientific Institutions. A modularity-based approach applied to find network communities. To reach a big picture of science production flow,…
▽ More
We are living in the data age. Communications over scientific networks creates new opportunities for researchers who aim to discover the hidden pattern in these huge repositories. This study utilizes network science to create collaboration network of Iranian Scientific Institutions. A modularity-based approach applied to find network communities. To reach a big picture of science production flow, analysis of the collaboration network is crucial. Our results demonstrated that geographic location closeness and ethnic attributes has important roles in academic collaboration network establishment. Besides, it shows that famous scientific centers in the capital city of Iran, Tehran has strong influence on the production flow of scientific activities. These academic papers are mostly viewed and downloaded from the United State of America, China, India, and Iran. The motivation of this research is that by discovering hidden communities in the network and finding the structure of intuitions communications, we can identify each scientific center research potential separately and clear mutual scientific fields. Therefore, an efficient strategic program can be designed, developed and tested to keep scientific centers in progress way and navigate their research goals into a straight useful roadmap to identify and fill the unknown gaps.
△ Less
Submitted 18 September, 2023; v1 submitted 11 March, 2020;
originally announced March 2020.
-
A Network Science Approach to Driver Gene Detection In Human Regulatory Network Using Genes Influence Evaluation
Authors:
Mostafa Akhavan Safar,
Babak Teimourpour,
Mehrdad Kargari
Abstract:
Cancer disease occurs because of a disorder in the cellular regulatory mechanism, Which causes cellular malformation. The genes that start the malformation are called Cancer driver genes (CDGs) . Numerous computational methods have been introduced to identify cancer driver genes that use the concept of mutation.Regarding abnormalities spread in human cell and tumor development, CDGs are likely to…
▽ More
Cancer disease occurs because of a disorder in the cellular regulatory mechanism, Which causes cellular malformation. The genes that start the malformation are called Cancer driver genes (CDGs) . Numerous computational methods have been introduced to identify cancer driver genes that use the concept of mutation.Regarding abnormalities spread in human cell and tumor development, CDGs are likely to be the potential types of gene with high influence in the network. This increases the importance of influence diffusion concept for the identification of CDGs.recently developed a method based on influence maximization for identifying cancer driver genes. One of the challenges in these types of networks is to find the power of regulatory interaction between edges.The current study developed a technique to identify cancer driver gene and predict the impact of regulatory interactions in a transcriptional regulatory network. This technique utilizes the concept of influence diffusion and optimizes the Hyperlink-Induced Topic Search algorithm based on the influence diffusion. The results suggest the better performance of our proposed technique than the other computational and network-based approaches.
△ Less
Submitted 26 January, 2020;
originally announced January 2020.
-
The detection of professional fraud in automobile insurance using social network analysis
Authors:
Arezo Bodaghi,
Babak Teimourpour
Abstract:
The Automobile Insurance Fraud is one of the main challenges for insurance companies. This form of fraud is performed either opportunistic or professional occurring through group cooperation that leads to greater financial losses, while most presented methods thus far are unsuited for flagging these groups. The article has put forward a new approach for identification, representation, and analysis…
▽ More
The Automobile Insurance Fraud is one of the main challenges for insurance companies. This form of fraud is performed either opportunistic or professional occurring through group cooperation that leads to greater financial losses, while most presented methods thus far are unsuited for flagging these groups. The article has put forward a new approach for identification, representation, and analysis of organized fraudulent groups in automobile insurance through focusing on structural aspects of networks, and cycles in particular, that demonstrate the occurrence of potential fraud. Suspicious groups have been detected by applying cycle detection algorithms (using both DFS, BFS trees), afterward, the probability of being fraudulent for suspicious components were investigated to reveal fraudulent groups with the maximum likelihood, and their reviews were prioritized. The actual data of Iran Insurance Company is used for evaluating the provided approach. As a result, the detection of cycles is not only more efficient, accurate, but also less time-consuming in comparison with previous methods for finding such groups.
△ Less
Submitted 19 June, 2018; v1 submitted 24 May, 2018;
originally announced May 2018.