Search | arXiv e-print repository

Robotic Pollination of Apples in Commercial Orchards

Authors: Ranjan Sapkota, Dawood Ahmed, Salik Ram Khanal, Uddhav Bhattarai, Changki Mo, Matthew D. Whiting, Manoj Karkee

Abstract: This research presents a novel, robotic pollination system designed for targeted pollination of apple flowers in modern fruiting wall orchards. Developed in response to the challenges of global colony collapse disorder, climate change, and the need for sustainable alternatives to traditional pollinators, the system utilizes a commercial manipulator, a vision system, and a spray nozzle for pollen a… ▽ More This research presents a novel, robotic pollination system designed for targeted pollination of apple flowers in modern fruiting wall orchards. Developed in response to the challenges of global colony collapse disorder, climate change, and the need for sustainable alternatives to traditional pollinators, the system utilizes a commercial manipulator, a vision system, and a spray nozzle for pollen application. Initial tests in April 2022 pollinated 56% of the target flower clusters with at least one fruit with a cycle time of 6.5 s. Significant improvements were made in 2023, with the system accurately detecting 91% of available flowers and pollinating 84% of target flowers with a reduced cycle time of 4.8 s. This system showed potential for precision artificial pollination that can also minimize the need for labor-intensive field operations such as flower and fruitlet thinning. △ Less

Submitted 3 February, 2024; v1 submitted 10 November, 2023; originally announced November 2023.

Comments: 2 Page, 1 figure

arXiv:2308.03998 [pdf, other]

Real-time Strawberry Detection Based on Improved YOLOv5s Architecture for Robotic Harvesting in open-field environment

Authors: Zixuan He, Salik Ram Khanal, Xin Zhang, Manoj Karkee, Qin Zhang

Abstract: This study proposed a YOLOv5-based custom object detection model to detect strawberries in an outdoor environment. The original architecture of the YOLOv5s was modified by replacing the C3 module with the C2f module in the backbone network, which provided a better feature gradient flow. Secondly, the Spatial Pyramid Pooling Fast in the final layer of the backbone network of YOLOv5s was combined wi… ▽ More This study proposed a YOLOv5-based custom object detection model to detect strawberries in an outdoor environment. The original architecture of the YOLOv5s was modified by replacing the C3 module with the C2f module in the backbone network, which provided a better feature gradient flow. Secondly, the Spatial Pyramid Pooling Fast in the final layer of the backbone network of YOLOv5s was combined with Cross Stage Partial Net to improve the generalization ability over the strawberry dataset in this study. The proposed architecture was named YOLOv5s-Straw. The RGB images dataset of the strawberry canopy with three maturity classes (immature, nearly mature, and mature) was collected in open-field environment and augmented through a series of operations including brightness reduction, brightness increase, and noise adding. To verify the superiority of the proposed method for strawberry detection in open-field environment, four competitive detection models (YOLOv3-tiny, YOLOv5s, YOLOv5s-C2f, and YOLOv8s) were trained, and tested under the same computational environment and compared with YOLOv5s-Straw. The results showed that the highest mean average precision of 80.3% was achieved using the proposed architecture whereas the same was achieved with YOLOv3-tiny, YOLOv5s, YOLOv5s-C2f, and YOLOv8s were 73.4%, 77.8%, 79.8%, 79.3%, respectively. Specifically, the average precision of YOLOv5s-Straw was 82.1% in the immature class, 73.5% in the nearly mature class, and 86.6% in the mature class, which were 2.3% and 3.7%, respectively, higher than that of the latest YOLOv8s. The model included 8.6*10^6 network parameters with an inference speed of 18ms per image while the inference speed of YOLOv8s had a slower inference speed of 21.0ms and heavy parameters of 11.1*10^6, which indicates that the proposed model is fast enough for real time strawberry detection and localization for the robotic picking. △ Less

Submitted 12 October, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

Comments: 20 pages; 15 figures

arXiv:2307.00112 [pdf]

Performance of ChatGPT on USMLE: Unlocking the Potential of Large Language Models for AI-Assisted Medical Education

Authors: Prabin Sharma, Kisan Thapa, Dikshya Thapa, Prastab Dhakal, Mala Deep Upadhaya, Santosh Adhikari, Salik Ram Khanal

Abstract: Artificial intelligence is gaining traction in more ways than ever before. The popularity of language models and AI-based businesses has soared since ChatGPT was made available to the general public via OpenAI. It is becoming increasingly common for people to use ChatGPT both professionally and personally. Considering the widespread use of ChatGPT and the reliance people place on it, this study de… ▽ More Artificial intelligence is gaining traction in more ways than ever before. The popularity of language models and AI-based businesses has soared since ChatGPT was made available to the general public via OpenAI. It is becoming increasingly common for people to use ChatGPT both professionally and personally. Considering the widespread use of ChatGPT and the reliance people place on it, this study determined how reliable ChatGPT can be for answering complex medical and clinical questions. Harvard University gross anatomy along with the United States Medical Licensing Examination (USMLE) questionnaire were used to accomplish the objective. The paper evaluated the obtained results using a 2-way ANOVA and posthoc analysis. Both showed systematic covariation between format and prompt. Furthermore, the physician adjudicators independently rated the outcome's accuracy, concordance, and insight. As a result of the analysis, ChatGPT-generated answers were found to be more context-oriented and represented a better model for deductive reasoning than regular Google search results. Furthermore, ChatGPT obtained 58.8% on logical questions and 60% on ethical questions. This means that the ChatGPT is approaching the passing range for logical questions and has crossed the threshold for ethical questions. The paper believes ChatGPT and other language learning models can be invaluable tools for e-learners; however, the study suggests that there is still room to improve their accuracy. In order to improve ChatGPT's performance in the future, further research is needed to better understand how it can answer different types of questions. △ Less

Submitted 27 July, 2023; v1 submitted 30 June, 2023; originally announced July 2023.

Comments: 12 pages, 4 Figues, 4 tables

arXiv:2306.14300 [pdf]

Screening Autism Spectrum Disorder in childrens using Deep Learning Approach : Evaluating the classification model of YOLOv8 by comparing with other models

Authors: Subash Gautam, Prabin Sharma, Kisan Thapa, Mala Deep Upadhaya, Dikshya Thapa, Salik Ram Khanal, Vítor Manuel de Jesus Filipe

Abstract: Autism spectrum disorder (ASD) is a developmental condition that presents significant challenges in social interaction, communication, and behavior. Early intervention plays a pivotal role in enhancing cognitive abilities and reducing autistic symptoms in children with ASD. Numerous clinical studies have highlighted distinctive facial characteristics that distinguish ASD children from typically de… ▽ More Autism spectrum disorder (ASD) is a developmental condition that presents significant challenges in social interaction, communication, and behavior. Early intervention plays a pivotal role in enhancing cognitive abilities and reducing autistic symptoms in children with ASD. Numerous clinical studies have highlighted distinctive facial characteristics that distinguish ASD children from typically develo** (TD) children. In this study, we propose a practical solution for ASD screening using facial images using YoloV8 model. By employing YoloV8, a deep learning technique, on a dataset of Kaggle, we achieved exceptional results. Our model achieved a remarkable 89.64% accuracy in classification and an F1-score of 0.89. Our findings provide support for the clinical observations regarding facial feature discrepancies between children with ASD. The high F1-score obtained demonstrates the potential of deep learning models in screening children with ASD. We conclude that the newest version of YoloV8 which is usually used for object detection can be used for classification problem of Austistic and Non-autistic images. △ Less

Submitted 25 June, 2023; originally announced June 2023.

Comments: 17 pages,12 figures

arXiv:2304.09351 [pdf]

Machine Vision System for Early-stage Apple Flowers and Flower Clusters Detection for Precision Thinning and Pollination

Authors: Salik Ram Khanal, Ranjan Sapkota, Dawood Ahmed, Uddhav Bhattarai, Manoj Karkee

Abstract: Early-stage identification of fruit flowers that are in both opened and unopened condition in an orchard environment is significant information to perform crop load management operations such as flower thinning and pollination using automated and robotic platforms. These operations are important in tree-fruit agriculture to enhance fruit quality, manage crop load, and enhance the overall profit. T… ▽ More Early-stage identification of fruit flowers that are in both opened and unopened condition in an orchard environment is significant information to perform crop load management operations such as flower thinning and pollination using automated and robotic platforms. These operations are important in tree-fruit agriculture to enhance fruit quality, manage crop load, and enhance the overall profit. The recent development in agricultural automation suggests that this can be done using robotics which includes machine vision technology. In this article, we proposed a vision system that detects early-stage flowers in an unstructured orchard environment using YOLOv5 object detection algorithm. For the robotics implementation, the position of a cluster of the flower blossom is important to navigate the robot and the end effector. The centroid of individual flowers (both open and unopen) was identified and associated with flower clusters via K-means clustering. The accuracy of the opened and unopened flower detection is achieved up to mAP of 81.9% in commercial orchard images. △ Less

Submitted 18 April, 2023; originally announced April 2023.

arXiv:2303.12974 [pdf]

Performance Analysis and Evaluation of Cloud Vision Emotion APIs

Authors: Salik Ram Khanal, Prabin Sharma, Hugo Fernandes, João Barroso, Vítor Manuel de Jesus Filipe

Abstract: Facial expression is a way of communication that can be used to interact with computers or other electronic devices and the recognition of emotion from faces is an emerging practice with application in many fields. There are many cloud-based vision application programming interfaces available that recognize emotion from facial images and video. In this article, the performances of two well-known A… ▽ More Facial expression is a way of communication that can be used to interact with computers or other electronic devices and the recognition of emotion from faces is an emerging practice with application in many fields. There are many cloud-based vision application programming interfaces available that recognize emotion from facial images and video. In this article, the performances of two well-known APIs were compared using a public dataset of 980 images of facial emotions. For these experiments, a client program was developed which iterates over the image set, calls the cloud services, and caches the results of the emotion detection for each image. The performance was evaluated in each class of emotions using prediction accuracy. It has been found that the prediction accuracy for each emotion varies according to the cloud service being used. Similarly, each service provider presents a strong variation of performance according to the class being analyzed, as can be seen with more detail in this artilects. △ Less

Submitted 22 March, 2023; originally announced March 2023.

Comments: 10 pages, 6 figures

arXiv:2103.10080 [pdf]

doi 10.1088/2051-672X/abe71f

Comprehensive topography characterization of polycrystalline diamond coatings

Authors: Abhijeet Gujrati, Antoine Sanner, Subarna R. Khanal, Nicolaie Moldovan, Hongjun Zeng, Lars Pastewka, Tevis D. B. Jacobs

Abstract: The surface topography of diamond coatings strongly affects surface properties such as adhesion, friction, wear, and biocompatibility. However, the understanding of multi-scale topography, and its effect on properties, has been hindered by conventional measurement methods, which capture only a single length scale. Here, four different polycrystalline diamond coatings are characterized using transm… ▽ More The surface topography of diamond coatings strongly affects surface properties such as adhesion, friction, wear, and biocompatibility. However, the understanding of multi-scale topography, and its effect on properties, has been hindered by conventional measurement methods, which capture only a single length scale. Here, four different polycrystalline diamond coatings are characterized using transmission electron microscopy to assess the roughness down to the sub-nanometer scale. Then these measurements are combined, using the power spectral density (PSD), with conventional methods (stylus profilometry and atomic force microscopy) to characterize all scales of topography. The results demonstrate the critical importance of measuring topography across all length scales, especially because their PSDs cross over one another, such that a surface that is rougher at a larger scale may be smoother at a smaller scale and vice versa. Furthermore, these measurements reveal the connection between multi-scale topography and grain size, with characteristic scaling behavior at and slightly below the mean grain size, and self-affine fractal-like roughness at other length scales. At small (subgrain) scales, unpolished surfaces exhibit a common form of residual roughness that is self-affine in nature but difficult to detect with conventional methods. This approach of capturing topography from the atomic- to the macro-scale is termed comprehensive topography characterization, and all of the topography data from these surfaces has been made available for further analysis by experimentalists and theoreticians. Scientifically, this investigation has identified four characteristic regions of topography scaling in polycrystalline diamond materials. △ Less

Submitted 18 March, 2021; originally announced March 2021.

Comments: 13 pages, 6 figures

Journal ref: Surf. Topogr.: Metrol. Prop. 9, 014003 (2021)

arXiv:1909.12913 [pdf]

Student Engagement Detection Using Emotion Analysis, Eye Tracking and Head Movement with Machine Learning

Authors: Prabin Sharma, Shubham Joshi, Subash Gautam, Sneha Maharjan, Salik Ram Khanal, Manuel Cabral Reis, João Barroso, Vítor Manuel de Jesus Filipe

Abstract: With the increase of distance learning, in general, and e-learning, in particular, having a system capable of determining the engagement of students is of primordial importance, and one of the biggest challenges, both for teachers, researchers and policy makers. Here, we present a system to detect the engagement level of the students. It uses only information provided by the typical built-in web-c… ▽ More With the increase of distance learning, in general, and e-learning, in particular, having a system capable of determining the engagement of students is of primordial importance, and one of the biggest challenges, both for teachers, researchers and policy makers. Here, we present a system to detect the engagement level of the students. It uses only information provided by the typical built-in web-camera present in a laptop computer, and was designed to work in real time. We combine information about the movements of the eyes and head, and facial emotions to produce a concentration index with three classes of engagement: "very engaged", "nominally engaged" and "not engaged at all". The system was tested in a typical e-learning scenario, and the results show that it correctly identifies each period of time where students were "very engaged", "nominally engaged" and "not engaged at all". Additionally, the results also show that the students with best scores also have higher concentration indexes. △ Less

Submitted 23 March, 2023; v1 submitted 18 September, 2019; originally announced September 2019.

Comments: 9 pages, 9 Figures, 2 tables

arXiv:1907.12491 [pdf]

doi 10.1073/pnas.1913126116

Linking energy loss in soft adhesion to surface roughness

Authors: Siddhesh Dalvi, Abhijeet Gujrati, Subarna R. Khanal, Lars Pastewka, Ali Dhinojwala, Tevis D. B. Jacobs

Abstract: A mechanistic understanding of adhesion in soft materials is critical in the fields of transportation (tires, gaskets, seals), biomaterials, micro-contact printing, and soft robotics. Measurements have long demonstrated that the apparent work of adhesion coming into contact is consistently lower than the intrinsic work of adhesion for the materials, and that there is adhesion hysteresis during sep… ▽ More A mechanistic understanding of adhesion in soft materials is critical in the fields of transportation (tires, gaskets, seals), biomaterials, micro-contact printing, and soft robotics. Measurements have long demonstrated that the apparent work of adhesion coming into contact is consistently lower than the intrinsic work of adhesion for the materials, and that there is adhesion hysteresis during separation, commonly explained by viscoelastic dissipation. Still lacking is a quantitative experimentally validated link between adhesion and measured topography. Here, we used in situ measurements of contact size to investigate the adhesion behavior of soft elastic polydimethylsiloxane (PDMS) hemispheres (modulus ranging from 0.7 to 10 MPa) on four different polycrystalline diamond substrates with topography characterized across eight orders of magnitude, including down to the Ångström-scale. The results show that the reduction in apparent work of adhesion is equal to the energy required to achieve conformal contact. Further, the energy loss during contact and removal is equal to the product of intrinsic work of adhesion and the true contact area. These findings provide a simple mechanism to quantitatively link the widely-observed adhesion hysteresis to roughness rather than viscoelastic dissipation. △ Less

Submitted 2 December, 2019; v1 submitted 29 July, 2019; originally announced July 2019.

Comments: Proceedings of the National Academy of Sciences (2019)

Showing 1–9 of 9 results for author: Khanal, S R