Analyzing Students’ Answers Using Association Rule Mining Based on Feature Selection
Journal of Southwest Jiaotong University 53.5 (2018)
Posted: 17 Mar 2019
Date Written: 2018
Educational Institutions tend to find ways to improve academic performance by implementing different analytical tools and techniques. The explosive size of educational databases has many advantages and one of them is they can be used to enhance the performance of all academic staff and the institutions in the result. Educational Data Mining (EDM) as a new trend of data mining has faced a huge number of researches and tested almost all data mining techniques and algorithms which result in emerging enhancements in all educational fields. One of the most important functions in data mining is association rules mining. The association rules mining finds the association between dataset features and produces the association as rules. The resulting rules describe many hidden patterns inside the dataset and this feature is very helpful in EDM. Since all educational databases have many features which make the resulting association rules too many, so there is a need for using Feature Selection (FS) algorithm to reduce the features and use important features only. FS has become increasingly important as the size and dimensionality of educational datasets increase. FS is one of most important data mining research areas. It aims to identify several features that describe the dataset better than the original set of features. This objective can be achieved by removing redundant and less correlated features according to importance criteria in FS. Various selection algorithms have been proposed for filter-based FS. ReliefF is one of the most important algorithms that have been successfully implemented in many FS applications. This study presents a new model that can be utilized by any university that seeks to improve the quality of education by analyzing data and identifying factors that affect academic results to increase students’ chances of success. We focus on Relief-based algorithms (RBAs) according to the technique of mining association rules. The main concept of the proposed algorithm is to find features that are closely correlative with the class attribute through the mining of association rules. Experimental results based on several real datasets confirm that the proposed model can obtain a smaller and promising feature subset compared with other models and techniques. The outcomes of this research can help improve policies in higher education.
Keywords: Educational Data Mining (EDM), Association Rules, Apriori, Feature Selection, ReliefF, Weka
Suggested Citation: Suggested Citation