Search | arXiv e-print repository

Image-based Agarwood Resinous Area Segmentation using Deep Learning

Authors: Irwandi Hipiny, Johari Abdullah, Noor Alamshah Bolhassan

Abstract: The manual extraction method of Agarwood resinous compound is laborious work, requires skilled workers, and is subject to human errors. Commercial Agarwood industries have been actively exploring using Computer Numerical Control (CNC) machines to replace human effort for this particular task. The CNC machine accepts a G-code script produced from a binary image in which the wood region that needs t… ▽ More The manual extraction method of Agarwood resinous compound is laborious work, requires skilled workers, and is subject to human errors. Commercial Agarwood industries have been actively exploring using Computer Numerical Control (CNC) machines to replace human effort for this particular task. The CNC machine accepts a G-code script produced from a binary image in which the wood region that needs to be chiselled off is marked with (0, 0, 0) as its RGB value. Rather than requiring a human expert to perform the region marking, we propose using a Deep learning image segmentation method instead. Our setup involves a camera that captures the cross-section image and then passes the image file to a computer. The computer performs the automated image segmentation and feeds the CNC machine with a G-code script. In this article, we report the initial segmentation results achieved using a state-of-the-art Deep learning segmentation method and discuss potential improvements to refine the segmentation accuracy. △ Less

Submitted 7 April, 2024; originally announced April 2024.

Comments: 15 pages, 6 figures, 3 tables

arXiv:2010.12199 [pdf]

The Analysis of Facial Feature Deformation using Optical Flow Algorithm

Authors: Dayang Nur Zulhijah Awang Jesemi, Hamimah Ujir, Irwandi Hipiny, Sarah Flora Samson Juan

Abstract: Facial features deformed according to the intended facial expression. Specific facial features are associated with specific facial expression, i.e. happy means the deformation of mouth. This paper presents the study of facial feature deformation for each facial expression by using an optical flow algorithm and segmented into three different regions of interest. The deformation of facial features s… ▽ More Facial features deformed according to the intended facial expression. Specific facial features are associated with specific facial expression, i.e. happy means the deformation of mouth. This paper presents the study of facial feature deformation for each facial expression by using an optical flow algorithm and segmented into three different regions of interest. The deformation of facial features shows the relation between facial the and facial expression. Based on the experiments, the deformations of eye and mouth are significant in all expressions except happy. For happy expression, cheeks and mouths are the significant regions. This work also suggests that different facial features' intensity varies in the way that they contribute to the recognition of the different facial expression intensity. The maximum magnitude across all expressions is shown by the mouth for surprise expression which is 9x10-4. While the minimum magnitude is shown by the mouth for angry expression which is 0.4x10-4. △ Less

Submitted 12 November, 2020; v1 submitted 23 October, 2020; originally announced October 2020.

Comments: 9 pages

Journal ref: IJEECS, Vol. 15, No. 2, pp. 769-777 (2019)

arXiv:1909.11277 [pdf]

doi 10.5614/itbj.ict.res.appl.2018.12.3.4.

Towards Automated Biometric Identification of Sea Turtles (Chelonia mydas)

Authors: Irwandi Hipiny, Hamimah Ujir, Aazani Mujahid, Nurhartini Kamalia Yahya

Abstract: Passive biometric identification enables wildlife monitoring with minimal disturbance. Using a motion-activated camera placed at an elevated position and facing downwards, we collected images of sea turtle carapace, each belonging to one of sixteen Chelonia mydas juveniles. We then learned co-variant and robust image descriptors from these images, enabling indexing and retrieval. In this work, we… ▽ More Passive biometric identification enables wildlife monitoring with minimal disturbance. Using a motion-activated camera placed at an elevated position and facing downwards, we collected images of sea turtle carapace, each belonging to one of sixteen Chelonia mydas juveniles. We then learned co-variant and robust image descriptors from these images, enabling indexing and retrieval. In this work, we presented several classification results of sea turtle carapaces using the learned image descriptors. We found that a template-based descriptor, i.e., Histogram of Oriented Gradients (HOG) performed exceedingly better during classification than keypoint-based descriptors. For our dataset, a high-dimensional descriptor is a must due to the minimal gradient and color information inside the carapace images. Using HOG, we obtained an average classification accuracy of 65%. △ Less

Submitted 23 June, 2021; v1 submitted 25 September, 2019; originally announced September 2019.

Comments: Published in Journal of ICT Research and Applications, [S.l.], v. 12, n. 3, p. 256-266, dec. 2018

Journal ref: Journal of ICT Research and Applications, [S.l.], v. 12, n. 3, p. 256-266, dec. 2018. ISSN 2338-5499

arXiv:1810.01564 [pdf]

Assessing Performance of Aerobic Routines using Background Subtraction and Intersected Image Region

Authors: Faustine John, Irwandi Hipiny, Hamimah Ujir, Mohd Shahrizal Sunar

Abstract: It is recommended for a novice to engage a trained and experience person, i.e., a coach before starting an unfamiliar aerobic or weight routine. The coach's task is to provide real-time feedbacks to ensure that the routine is performed in a correct manner. This greatly reduces the risk of injury and maximise physical gains. We present a simple image similarity measure based on intersected image re… ▽ More It is recommended for a novice to engage a trained and experience person, i.e., a coach before starting an unfamiliar aerobic or weight routine. The coach's task is to provide real-time feedbacks to ensure that the routine is performed in a correct manner. This greatly reduces the risk of injury and maximise physical gains. We present a simple image similarity measure based on intersected image region to assess a subject's performance of an aerobic routine. The method is implemented inside an Augmented Reality (AR) desktop app that employs a single RGB camera to capture still images of the subject as he or she progresses through the routine. The background-subtracted body pose image is compared against the exemplar body pose image (i.e., AR template) at specific intervals. Based on a limited dataset, our pose matching function is reported to have an accuracy of 93.67%. △ Less

Submitted 2 October, 2018; originally announced October 2018.

Comments: Presented at The International UNIMAS STEM Engineering Conference 2018 (ENCON2018). Accepted for publication in MATEC Web of Conferences

arXiv:1810.01562 [pdf]

Performance Evaluation of SIFT Descriptor against Common Image Deformations on Iban Plaited Mat Motifs

Authors: Silvia Joseph, Irwandi Hipiny, Hamimah Ujir

Abstract: Borneo indigenous communities are blessed with rich craft heritage. One such examples is the Iban's plaited mat craft. There have been many efforts by UNESCO and the Sarawak Government to preserve and promote the craft. One such method is by develo** a mobile app capable of recognising the different mat motifs. As a first step towards this aim, we presents a novel image dataset consisting of sev… ▽ More Borneo indigenous communities are blessed with rich craft heritage. One such examples is the Iban's plaited mat craft. There have been many efforts by UNESCO and the Sarawak Government to preserve and promote the craft. One such method is by develo** a mobile app capable of recognising the different mat motifs. As a first step towards this aim, we presents a novel image dataset consisting of seven mat motif classes. Each class possesses a unique variation of chevrons, diagonal shapes, symmetrical, repetitive, geometric and non geometric patterns. In this study, the performance of the Scale invariant feature transform (SIFT) descriptor is evaluated against five common image deformations, i.e., zoom and rotation, viewpoint, image blur, JPEG compression and illumination. Using our dataset, SIFT performed favourably with test sequences belonging to Illumination changes, Viewpoint changes, JPEG compression and Zoom and Rotation. However, it did not performed well with Image blur test sequences with an average of 1.61 percents retained pairwise matching after blurring with a Gaussian kernel of 8.0 radius. △ Less

Submitted 2 October, 2018; originally announced October 2018.

Comments: 14th International Borneo Research Council Conference, 6 to 8 August 2018, UNIMAS, Sarawak

arXiv:1712.00195 [pdf]

3D Facial Action Units Recognition for Emotional Expression

Authors: N. Hussain, H. Ujir, I. Hipiny, J-L Minoi

Abstract: The muscular activities caused the activation of certain AUs for every facial expression at the certain duration of time throughout the facial expression. This paper presents the methods to recognise facial Action Unit (AU) using facial distance of the facial features which activates the muscles. The seven facial action units involved are AU1, AU4, AU6, AU12, AU15, AU17 and AU25 that characterises… ▽ More The muscular activities caused the activation of certain AUs for every facial expression at the certain duration of time throughout the facial expression. This paper presents the methods to recognise facial Action Unit (AU) using facial distance of the facial features which activates the muscles. The seven facial action units involved are AU1, AU4, AU6, AU12, AU15, AU17 and AU25 that characterises happy and sad expression. The recognition is performed on each AU according to rules defined based on the distance of each facial points. The facial distances chosen are extracted from twelve facial features. Then the facial distances are trained using Support Vector Machine (SVM) and Neural Network (NN). Classification result using SVM is presented with several different SVM kernels while result using NN is presented for each training, validation and testing phase. △ Less

Submitted 1 December, 2017; originally announced December 2017.

Comments: To be published in Advanced Science Letters Volume 24 (ICCSE2017)

arXiv:1710.00189 [pdf]

doi 10.1109/ICSIPA.2017.8120669

Unsupervised Classification of Intrusive Igneous Rock Thin Section Images using Edge Detection and Colour Analysis

Authors: S. Joseph, H. Ujir, I. Hipiny

Abstract: Classification of rocks is one of the fundamental tasks in a geological study. The process requires a human expert to examine sampled thin section images under a microscope. In this study, we propose a method that uses microscope automation, digital image acquisition, edge detection and colour analysis (histogram). We collected 60 digital images from 20 standard thin sections using a digital camer… ▽ More Classification of rocks is one of the fundamental tasks in a geological study. The process requires a human expert to examine sampled thin section images under a microscope. In this study, we propose a method that uses microscope automation, digital image acquisition, edge detection and colour analysis (histogram). We collected 60 digital images from 20 standard thin sections using a digital camera mounted on a conventional microscope. Each image is partitioned into a finite number of cells that form a grid structure. Edge and colour profile of pixels inside each cell determine its classification. The individual cells then determine the thin section image classification via a majority voting scheme. Our method yielded successful results as high as 90% to 100% precision. △ Less

Submitted 23 June, 2021; v1 submitted 30 September, 2017; originally announced October 2017.

Comments: Published in 2017 IEEE International Conference On Signal and Image Processing Applications

arXiv:1710.00187 [pdf]

doi 10.1109/ICSIPA.2017.8120635

Unsupervised Segmentation of Action Segments in Egocentric Videos using Gaze

Authors: I. Hipiny, H. Ujir, J. L. Minoi, S. F. Samson Juan, M. A. Khairuddin, M. S. Sunar

Abstract: Unsupervised segmentation of action segments in egocentric videos is a desirable feature in tasks such as activity recognition and content-based video retrieval. Reducing the search space into a finite set of action segments facilitates a faster and less noisy matching. However, there exist a substantial gap in machine understanding of natural temporal cuts during a continuous human activity. This… ▽ More Unsupervised segmentation of action segments in egocentric videos is a desirable feature in tasks such as activity recognition and content-based video retrieval. Reducing the search space into a finite set of action segments facilitates a faster and less noisy matching. However, there exist a substantial gap in machine understanding of natural temporal cuts during a continuous human activity. This work reports on a novel gaze-based approach for segmenting action segments in videos captured using an egocentric camera. Gaze is used to locate the region-of-interest inside a frame. By tracking two simple motion-based parameters inside successive regions-of-interest, we discover a finite set of temporal cuts. We present several results using combinations (of the two parameters) on a dataset, i.e., BRISGAZE-ACTIONS. The dataset contains egocentric videos depicting several daily-living activities. The quality of the temporal cuts is further improved by implementing two entropy measures. △ Less

Submitted 23 June, 2021; v1 submitted 30 September, 2017; originally announced October 2017.

Comments: Published in 2017 IEEE International Conference On Signal and Image Processing Applications

Showing 1–8 of 8 results for author: Hipiny, I