Search | arXiv e-print repository

Mastery Guided Non-parametric Clustering to Scale-up Strategy Prediction

Authors: Anup Shakya, Vasile Rus, Deepak Venugopal

Abstract: Predicting the strategy (sequence of concepts) that a student is likely to use in problem-solving helps Adaptive Instructional Systems (AISs) better adapt themselves to different types of learners based on their learning abilities. This can lead to a more dynamic, engaging, and personalized experience for students. To scale up training a prediction model (such as LSTMs) over large-scale education… ▽ More Predicting the strategy (sequence of concepts) that a student is likely to use in problem-solving helps Adaptive Instructional Systems (AISs) better adapt themselves to different types of learners based on their learning abilities. This can lead to a more dynamic, engaging, and personalized experience for students. To scale up training a prediction model (such as LSTMs) over large-scale education datasets, we develop a non-parametric approach to cluster symmetric instances in the data. Specifically, we learn a representation based on Node2Vec that encodes symmetries over mastery or skill level since, to solve a problem, it is natural that a student's strategy is likely to involve concepts in which they have gained mastery. Using this representation, we use DP-Means to group symmetric instances through a coarse-to-fine refinement of the clusters. We apply our model to learn strategies for Math learning from large-scale datasets from MATHia, a leading AIS for middle-school math learning. Our results illustrate that our approach can consistently achieve high accuracy using a small sample that is representative of the full dataset. Further, we show that this approach helps us learn strategies with high accuracy for students at different skill levels, i.e., leveraging symmetries improves fairness in the prediction model. △ Less

Submitted 4 January, 2024; originally announced January 2024.

Comments: Proceedings of 37th AAAI Conference on Artificial Intelligence Artificial Intelligence for Education. arXiv admin note: substantial text overlap with arXiv:2308.03892

arXiv:2401.02703 [pdf, other]

Verifying Relational Explanations: A Probabilistic Approach

Authors: Abisha Thapa Magar, Anup Shakya, Somdeb Sarkhel, Deepak Venugopal

Abstract: Explanations on relational data are hard to verify since the explanation structures are more complex (e.g. graphs). To verify interpretable explanations (e.g. explanations of predictions made in images, text, etc.), typically human subjects are used since it does not necessarily require a lot of expertise. However, to verify the quality of a relational explanation requires expertise and is hard to… ▽ More Explanations on relational data are hard to verify since the explanation structures are more complex (e.g. graphs). To verify interpretable explanations (e.g. explanations of predictions made in images, text, etc.), typically human subjects are used since it does not necessarily require a lot of expertise. However, to verify the quality of a relational explanation requires expertise and is hard to scale-up. GNNExplainer is arguably one of the most popular explanation methods for Graph Neural Networks. In this paper, we develop an approach where we assess the uncertainty in explanations generated by GNNExplainer. Specifically, we ask the explainer to generate explanations for several counterfactual examples. We generate these examples as symmetric approximations of the relational structure in the original data. From these explanations, we learn a factor graph model to quantify uncertainty in an explanation. Our results on several datasets show that our approach can help verify explanations from GNNExplainer by reliably estimating the uncertainty of a relation specified in the explanation. △ Less

Submitted 5 January, 2024; originally announced January 2024.

Comments: Published in Proceedings of 2023 IEEE Conference on Big Data

arXiv:2312.08287 [pdf, other]

doi 10.1109/ICDM58522.2023.00165

On the verification of Embeddings using Hybrid Markov Logic

Authors: Anup Shakya, Abisha Thapa Magar, Somdeb Sarkhel, Deepak Venugopal

Abstract: The standard approach to verify representations learned by Deep Neural Networks is to use them in specific tasks such as classification or regression, and measure their performance based on accuracy in such tasks. However, in many cases, we would want to verify more complex properties of a learned representation. To do this, we propose a framework based on a probabilistic first-order language, nam… ▽ More The standard approach to verify representations learned by Deep Neural Networks is to use them in specific tasks such as classification or regression, and measure their performance based on accuracy in such tasks. However, in many cases, we would want to verify more complex properties of a learned representation. To do this, we propose a framework based on a probabilistic first-order language, namely, Hybrid Markov Logic Networks (HMLNs) where we specify properties over embeddings mixed with symbolic domain knowledge. We present an approach to learn parameters for the properties within this framework. Further, we develop a verification method to test embeddings in this framework by encoding this task as a Mixed Integer Linear Program for which we can leverage existing state-of-the-art solvers. We illustrate verification in Graph Neural Networks, Deep Knowledge Tracing and Intelligent Tutoring Systems to demonstrate the generality of our approach. △ Less

Submitted 13 December, 2023; originally announced December 2023.

Comments: 6 pages, Proceedings of 23rd IEEE International Conference on Data Mining 2023 (ICDM'23)

arXiv:2308.03892 [pdf, other]

doi 10.5281/zenodo.8115669

Scalable and Equitable Math Problem Solving Strategy Prediction in Big Educational Data

Authors: Anup Shakya, Vasile Rus, Deepak Venugopal

Abstract: Understanding a student's problem-solving strategy can have a significant impact on effective math learning using Intelligent Tutoring Systems (ITSs) and Adaptive Instructional Systems (AISs). For instance, the ITS/AIS can better personalize itself to correct specific misconceptions that are indicated by incorrect strategies, specific problems can be designed to improve strategies and frustration… ▽ More Understanding a student's problem-solving strategy can have a significant impact on effective math learning using Intelligent Tutoring Systems (ITSs) and Adaptive Instructional Systems (AISs). For instance, the ITS/AIS can better personalize itself to correct specific misconceptions that are indicated by incorrect strategies, specific problems can be designed to improve strategies and frustration can be minimized by adapting to a student's natural way of thinking rather than trying to fit a standard strategy for all. While it may be possible for human experts to identify strategies manually in classroom settings with sufficient student interaction, it is not possible to scale this up to big data. Therefore, we leverage advances in Machine Learning and AI methods to perform scalable strategy prediction that is also fair to students at all skill levels. Specifically, we develop an embedding called MVec where we learn a representation based on the mastery of students. We then cluster these embeddings with a non-parametric clustering method where we progressively learn clusters such that we group together instances that have approximately symmetrical strategies. The strategy prediction model is trained on instances sampled from these clusters. This ensures that we train the model over diverse strategies and also that strategies from a particular group do not bias the DNN model, thus allowing it to optimize its parameters over all groups. Using real world large-scale student interaction datasets from MATHia, we implement our approach using transformers and Node2Vec for learning the mastery embeddings and LSTMs for predicting strategies. We show that our approach can scale up to achieve high accuracy by training on a small sample of a large dataset and also has predictive equality, i.e., it can predict strategies equally well for learners at diverse skill levels. △ Less

Submitted 7 August, 2023; originally announced August 2023.

Comments: 12 pages, 7 figures Published as a full paper in the 16th International Conference on Educational Data Mining 2023

arXiv:2210.05425 [pdf]

COVID-19-related Nepali Tweets Classification in a Low Resource Setting

Authors: Rabin Adhikari, Safal Thapaliya, Nirajan Basnet, Samip Poudel, Aman Shakya, Bishesh Khanal

Abstract: Billions of people across the globe have been using social media platforms in their local languages to voice their opinions about the various topics related to the COVID-19 pandemic. Several organizations, including the World Health Organization, have developed automated social media analysis tools that classify COVID-19-related tweets into various topics. However, these tools that help combat the… ▽ More Billions of people across the globe have been using social media platforms in their local languages to voice their opinions about the various topics related to the COVID-19 pandemic. Several organizations, including the World Health Organization, have developed automated social media analysis tools that classify COVID-19-related tweets into various topics. However, these tools that help combat the pandemic are limited to very few languages, making several countries unable to take their benefit. While multi-lingual or low-resource language-specific tools are being developed, they still need to expand their coverage, such as for the Nepali language. In this paper, we identify the eight most common COVID-19 discussion topics among the Twitter community using the Nepali language, set up an online platform to automatically gather Nepali tweets containing the COVID-19-related keywords, classify the tweets into the eight topics, and visualize the results across the period in a web-based dashboard. We compare the performance of two state-of-the-art multi-lingual language models for Nepali tweet classification, one generic (mBERT) and the other Nepali language family-specific model (MuRIL). Our results show that the models' relative performance depends on the data size, with MuRIL doing better for a larger dataset. The annotated data, models, and the web-based dashboard are open-sourced at https://github.com/naamiinepal/covid-tweet-classification. △ Less

Submitted 11 October, 2022; originally announced October 2022.

Comments: Accepted at the 7th Social Media Mining for Health (#SMM4H) Workshop, co-located at Coling 2022

arXiv:2103.06049 [pdf]

Search Disaster Victims using Sound Source Localization

Authors: Abhish Khanal, Deepak Chand, Prakash Chaudhary, Subash Timilsina, Sanjeeb Prasad Panday, Aman Shakya, Rom Kant Pandey

Abstract: Sound Source Localization (SSL) are used to estimate the position of sound sources. Various methods have been used for detecting sound and its localization. This paper presents a system for stationary sound source localization by cubical microphone array consisting of eight microphones placed on four vertical adjacent faces which is mounted on three wheel omni-directional drive for the inspection… ▽ More Sound Source Localization (SSL) are used to estimate the position of sound sources. Various methods have been used for detecting sound and its localization. This paper presents a system for stationary sound source localization by cubical microphone array consisting of eight microphones placed on four vertical adjacent faces which is mounted on three wheel omni-directional drive for the inspection and monitoring of the disaster victims in disaster areas. The proposed method localizes sound source on a 3D space by grid search method using Generalized Cross Correlation Phase Transform (GCC-PHAT) which is robust when operating in real life scenario where there is lack of visibility. The computed azimuth and elevation angle of victimized human voice are fed to embedded omni-directional drive system which navigates the vehicle automatically towards the stationary sound source. △ Less

Submitted 10 March, 2021; originally announced March 2021.

Comments: 9 pages, 17 figures, 17th ISCRAM Conference Blacksburg, VA, USA

Journal ref: Iscram 2020 1022-1030

arXiv:2101.08215 [pdf]

SAR and Optical data fusion based on Anisotropic Diffusion with PCA and Classification using Patch-based with LBP

Authors: Achala Shakya, Mantosh Biswas, Mahesh Pal

Abstract: SAR (VV and VH polarization) and optical data are widely used in image fusion to use the complimentary information of each other and to obtain the better-quality image (in terms of spatial and spectral features) for the improved classification results. This paper uses anisotropic diffusion with PCA for the fusion of SAR and optical data and patch-based SVM Classification with LBP (LBP-PSVM). Fusio… ▽ More SAR (VV and VH polarization) and optical data are widely used in image fusion to use the complimentary information of each other and to obtain the better-quality image (in terms of spatial and spectral features) for the improved classification results. This paper uses anisotropic diffusion with PCA for the fusion of SAR and optical data and patch-based SVM Classification with LBP (LBP-PSVM). Fusion results with VV polarization performed better than VH polarization using considered fusion method. For classification, the performance of LBP-PSVM using S1 (VV) with S2, S1 (VH) with S2 is compared with SVM classifier (without patch) and PSVM classifier (with patch), respectively. Classification results suggests that the LBP-PSVM classifier is more effective in comparison to SVM and PSVM classifiers for considered data. △ Less

Submitted 20 January, 2021; originally announced January 2021.

arXiv:2004.02168 [pdf, other]

Comparative Analysis of Multiple Deep CNN Models for Waste Classification

Authors: Dipesh Gyawali, Alok Regmi, Aatish Shakya, Ashish Gautam, Surendra Shrestha

Abstract: Waste is a wealth in a wrong place. Our research focuses on analyzing possibilities for automatic waste sorting and collecting in such a way that helps it for further recycling process. Various approaches are being practiced managing waste but not efficient and require human intervention. The automatic waste segregation would fit in to fill the gap. The project tested well known Deep Learning Netw… ▽ More Waste is a wealth in a wrong place. Our research focuses on analyzing possibilities for automatic waste sorting and collecting in such a way that helps it for further recycling process. Various approaches are being practiced managing waste but not efficient and require human intervention. The automatic waste segregation would fit in to fill the gap. The project tested well known Deep Learning Network architectures for waste classification with dataset combined from own endeavors and Trash Net. The convolutional neural network is used for image classification. The hardware built in the form of dustbin is used to segregate those wastes into different compartments. Without the human exercise in segregating those waste products, the study would save the precious time and would introduce the automation in the area of waste management. Municipal solid waste is a huge, renewable source of energy. The situation is win-win for both government, society and industrialists. Because of fine-tuning of the ResNet18 Network, the best validation accuracy was found to be 87.8%. △ Less

Submitted 14 August, 2020; v1 submitted 5 April, 2020; originally announced April 2020.

Comments: 6 pages, 13 figures

Journal ref: 5th International Conference on Advanced Engineering and ICT-Convergence 2020

arXiv:2002.09901 [pdf]

A Nepali Rule Based Stemmer and its performance on different NLP applications

Authors: Pravesh Koirala, Aman Shakya

Abstract: Stemming is an integral part of Natural Language Processing (NLP). It's a preprocessing step in almost every NLP application. Arguably, the most important usage of stemming is in Information Retrieval (IR). While there are lots of work done on stemming in languages like English, Nepali stemming has only a few works. This study focuses on creating a Rule Based stemmer for Nepali text. Specifically,… ▽ More Stemming is an integral part of Natural Language Processing (NLP). It's a preprocessing step in almost every NLP application. Arguably, the most important usage of stemming is in Information Retrieval (IR). While there are lots of work done on stemming in languages like English, Nepali stemming has only a few works. This study focuses on creating a Rule Based stemmer for Nepali text. Specifically, it is an affix strip** system that identifies two different class of suffixes in Nepali grammar and strips them separately. Only a single negativity prefix (Na) is identified and stripped. This study focuses on a number of techniques like exception word identification, morphological normalization and word transformation to increase stemming performance. The stemmer is tested intrinsically using Paice's method and extrinsically on a basic tf-idf based IR system and an elementary news topic classifier using Multinomial Naive Bayes Classifier. The difference in performance of these systems with and without using the stemmer is analysed. △ Less

Submitted 23 February, 2020; originally announced February 2020.

Comments: 5 pages, 2 figures, 3 tables

ACM Class: I.7.2

Journal ref: Proceedings of the 4th International IT Conference on ICT with Smart Computing and 9th National Students' Conference on Information Technology, (NaSCoIT 2018), Kathmandu, Nepal, ISSN No 2505-1075, pp. 16 (December 2018)

arXiv:2002.07783 [pdf, ps, other]

doi 10.1063/5.0004862

Dynamic percolation of ferromagnetic regions in phase separated manganites using non-uniform electric fields

Authors: Ambika Shakya, Amlan Biswas

Abstract: Thin films of the manganite (La$_{1-y}$Pr$_y$)$_{1-x}$Ca$_x$MnO$_3$ exhibit dynamic phase coexistence with micrometer scale, fluid-like ferromagnetic metallic (FMM) regions interspersed in a charge-order insulating (COI) background. It has been previously reported that a uniform electric field realigns the fluid-like FMM regions due to a phenomenon similar to dielectrophoresis. Here we report that… ▽ More Thin films of the manganite (La$_{1-y}$Pr$_y$)$_{1-x}$Ca$_x$MnO$_3$ exhibit dynamic phase coexistence with micrometer scale, fluid-like ferromagnetic metallic (FMM) regions interspersed in a charge-order insulating (COI) background. It has been previously reported that a uniform electric field realigns the fluid-like FMM regions due to a phenomenon similar to dielectrophoresis. Here we report that non-uniform electric fields have a stronger effect on the FMM regions as expected from the dielectrophoresis model. The dynamic percolation of the FMM regions is observed over a wider range of temperatures compared to the results in a uniform field. Additionally, in a non-uniform electric field, the time required for dynamic percolation along the magnetic hard axis ($t_{\mathrm{B}}$) decreased with increasing applied voltage ($V_{\mathrm{A}}$) as a power law, $V_{\mathrm{A}}^{-δ}$ with $δ\approx 5$ while $δ< 2$ for a uniform electric field. Our results in a non-uniform electric field provide strong evidence in favor of the dielectrophoresis model and a unique method for manipulating micrometer-sized ferromagnetic regions using electric fields. △ Less

Submitted 18 February, 2020; originally announced February 2020.

Comments: 6 pages, 5 figures

Showing 1–10 of 10 results for author: Shakya, A