Search | arXiv e-print repository

Machine Learning Advances aiding Recognition and Classification of Indian Monuments and Landmarks

Authors: Aditya Jyoti Paul, Smaranjit Ghose, Kanishka Aggarwal, Niketha Nethaji, Shivam Pal, Arnab Dutta Purkayastha

Abstract: Tourism in India plays a quintessential role in the country's economy with an estimated 9.2% GDP share for the year 2018. With a yearly growth rate of 6.2%, the industry holds a huge potential for being the primary driver of the economy as observed in the nations of the Middle East like the United Arab Emirates. The historical and cultural diversity exhibited throughout the geography of the nation… ▽ More Tourism in India plays a quintessential role in the country's economy with an estimated 9.2% GDP share for the year 2018. With a yearly growth rate of 6.2%, the industry holds a huge potential for being the primary driver of the economy as observed in the nations of the Middle East like the United Arab Emirates. The historical and cultural diversity exhibited throughout the geography of the nation is a unique spectacle for people around the world and therefore serves to attract tourists in tens of millions in number every year. Traditionally, tour guides or academic professionals who study these heritage monuments were responsible for providing information to the visitors regarding their architectural and historical significance. However, unfortunately this system has several caveats when considered on a large scale such as unavailability of sufficient trained people, lack of accurate information, failure to convey the richness of details in an attractive format etc. Recently, machine learning approaches revolving around the usage of monument pictures have been shown to be useful for rudimentary analysis of heritage sights. This paper serves as a survey of the research endeavors undertaken in this direction which would eventually provide insights for building an automated decision system that could be utilized to make the experience of tourism in India more modernized for visitors. △ Less

Submitted 29 July, 2021; originally announced July 2021.

Comments: Currently under review

arXiv:2107.14061 [pdf]

The Need and Status of Sea Turtle Conservation and Survey of Associated Computer Vision Advances

Authors: Aditya Jyoti Paul

Abstract: For over hundreds of millions of years, sea turtles and their ancestors have swum in the vast expanses of the ocean. They have undergone a number of evolutionary changes, leading to speciation and sub-speciation. However, in the past few decades, some of the most notable forces driving the genetic variance and population decline have been global warming and anthropogenic impact ranging from large-… ▽ More For over hundreds of millions of years, sea turtles and their ancestors have swum in the vast expanses of the ocean. They have undergone a number of evolutionary changes, leading to speciation and sub-speciation. However, in the past few decades, some of the most notable forces driving the genetic variance and population decline have been global warming and anthropogenic impact ranging from large-scale poaching, collecting turtle eggs for food, besides dum** trash including plastic waste into the ocean. This leads to severe detrimental effects in the sea turtle population, driving them to extinction. This research focusses on the forces causing the decline in sea turtle population, the necessity for the global conservation efforts along with its successes and failures, followed by an in-depth analysis of the modern advances in detection and recognition of sea turtles, involving Machine Learning and Computer Vision systems, aiding the conservation efforts. △ Less

Submitted 29 July, 2021; originally announced July 2021.

Comments: Currently under review

arXiv:2106.01739 [pdf]

Advances in Classifying the Stages of Diabetic Retinopathy Using Convolutional Neural Networks in Low Memory Edge Devices

Authors: Aditya Jyoti Paul

Abstract: Diabetic Retinopathy (DR) is a severe complication that may lead to retinal vascular damage and is one of the leading causes of vision impairment and blindness. DR broadly is classified into two stages - non-proliferative (NPDR), where there are almost no symptoms, except a few microaneurysms, and proliferative (PDR) involving a huge number of microaneurysms and hemorrhages, soft and hard exudates… ▽ More Diabetic Retinopathy (DR) is a severe complication that may lead to retinal vascular damage and is one of the leading causes of vision impairment and blindness. DR broadly is classified into two stages - non-proliferative (NPDR), where there are almost no symptoms, except a few microaneurysms, and proliferative (PDR) involving a huge number of microaneurysms and hemorrhages, soft and hard exudates, neo-vascularization, macular ischemia or a combination of these, making it easier to detect. More specifically, DR is usually classified into five levels, labeled 0-4, from 0 indicating no DR to 4 which is most severe. This paper firstly presents a discussion on the risk factors of the disease, then surveys the recent literature on the topic followed by examining certain techniques which were found to be highly effective in improving the prognosis accuracy. Finally, a convolutional neural network model is proposed to detect all the stages of DR on a low-memory edge microcontroller. The model has a size of just 5.9 MB, accuracy and F1 score both of 94% and an inference speed of about 20 frames per second. △ Less

Submitted 3 June, 2021; originally announced June 2021.

Comments: This paper is currently under review at IEEE MASCON 2021. http://ieeemascon.in

MSC Class: 68T45; 68T10; 68T07; 68U10 ACM Class: I.2.10; I.4.8; I.5.1; J.3; I.4.1; K.4.2

arXiv:2012.04156 [pdf]

doi 10.1109/RAICS51191.2020.9332470

An Efficient Analyses of the Behavior of One Dimensional Chaotic Maps using 0-1 Test and Three State Test

Authors: Joan S. Muthu, Aditya Jyoti Paul, P. Murali

Abstract: In this paper, a rigorous analysis of the behavior of the standard logistic map, Logistic Tent system (LTS), Logistic-Sine system (LSS) and Tent-Sine system (TSS) is performed using 0-1 test and three state test (3ST). In this work, it has been proved that the strength of the chaotic behavior is not uniform. Through extensive experiment and analysis, the strong and weak chaotic regions of LTS, LSS… ▽ More In this paper, a rigorous analysis of the behavior of the standard logistic map, Logistic Tent system (LTS), Logistic-Sine system (LSS) and Tent-Sine system (TSS) is performed using 0-1 test and three state test (3ST). In this work, it has been proved that the strength of the chaotic behavior is not uniform. Through extensive experiment and analysis, the strong and weak chaotic regions of LTS, LSS and TSS have been identified. This would enable researchers using these maps, to have better choices of control parameters as key values, for stronger encryption. In addition, this paper serves as a precursor to stronger testing practices in cryptosystem research, as Lyapunov exponent alone has been shown to fail as a true representation of the chaotic nature of a map. △ Less

Submitted 13 February, 2021; v1 submitted 7 December, 2020; originally announced December 2020.

Comments: 6 pages, Published in IEEE RAICS 2020, see https://www.raics.in

MSC Class: 37H20; 34F10; 34H10; 49J15; 49K15; 47J15 ACM Class: G.1.0; G.1.2; G.1.3; G.2.3; G.4; C.3; E.3; I.6.4

Journal ref: 2020 IEEE Recent Advances in Intelligent Computational Systems (RAICS), 2020, pp. 125-130

arXiv:2011.14858 [pdf, other]

doi 10.1007/978-981-16-0749-3_52

A Tiny CNN Architecture for Medical Face Mask Detection for Resource-Constrained Endpoints

Authors: Puranjay Mohan, Aditya Jyoti Paul, Abhay Chirania

Abstract: The world is going through one of the most dangerous pandemics of all time with the rapid spread of the novel coronavirus (COVID-19). According to the World Health Organisation, the most effective way to thwart the transmission of coronavirus is to wear medical face masks. Monitoring the use of face masks in public places has been a challenge because manual monitoring could be unsafe. This paper p… ▽ More The world is going through one of the most dangerous pandemics of all time with the rapid spread of the novel coronavirus (COVID-19). According to the World Health Organisation, the most effective way to thwart the transmission of coronavirus is to wear medical face masks. Monitoring the use of face masks in public places has been a challenge because manual monitoring could be unsafe. This paper proposes an architecture for detecting medical face masks for deployment on resource-constrained endpoints having extremely low memory footprints. A small development board with an ARM Cortex-M7 microcontroller clocked at 480 Mhz and having just 496 KB of framebuffer RAM, has been used for the deployment of the model. Using the TensorFlow Lite framework, the model is quantized to further reduce its size. The proposed model is 138 KB post quantization and runs at the inference speed of 30 FPS. △ Less

Submitted 3 June, 2021; v1 submitted 30 November, 2020; originally announced November 2020.

Comments: 11 pages, Published in Springer LNEE at http://link.springer.com/chapter/10.1007%2F978-981-16-0749-3_52

MSC Class: 68T45; 68T10; 68T07; 68U10 ACM Class: C.3; I.2.6; I.2.10; I.4.9; I.5.1; I.5.2; I.5.4; I.5.5; K.4.1; K.4.3

Journal ref: Innovations in Electrical and Electronic Engineering. Lecture Notes in Electrical Engineering, vol 756, pp 657-670, Springer, Singapore, 2021

Showing 1–5 of 5 results for author: Paul, A J