IJCAIT Vol 12, Issue 1, 2020



Educational Data Mining: A state-of-the-art survey on tools and techniques used in EDM
Authors: Ginika Mahajan, Bhavna Saini, India


Over the last decade, limitless data is generating in the field of education. To process this huge raw data, enormous potential is required beyond the manual and time consuming tasks.With the span of time, analytics and data mining is used to extract useful information from large data repositories. Educational Data Mining (EDM) exploits statistical, machine learning and data mining in the domain of education to analyze and predict the educational data using various approaches. EDM seeks to use online modes of learning to better understand learners and their learning, and develop computational approaches to analyze the facts and figures so as to benefit learners. Various machine learning algorithms and research tools are used in Educational Data Mining for analysis and prediction on different types of educational data. This paper presents a survey of applications and tools used in Educational Data Mining. Also, it presents detailed review of current trends in EDM where techniques and results of recent work done in this field are compared.


Keywords: Educational data mining; Data Mining; Applications;

References
[1] Hung, H. C., Liu, I. F., Liang, C. T., & Su, Y. S. (2020). Applying Educational Data Mining to Explore Students’ Learning Patterns in the Flipped Learning Approach for Coding Education. Symmetry, 12(2), 213.
[2] Bondoc, C. M. D., &Malawit, T. G. (2020). Classifying relevant video tutorials for the school’s learning management system using support vector machine algorithm. Global Journal of Engineering and Technology Advances, 2(3), 001-009.
[3] Romero, C., & Ventura, S. (2010). Educational data mining: a review of the state of the art. IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), 40(6), 601-618.
[4] Baker, R. S., &Inventado, P. S. (2014). Educational data mining and learning analytics. In Learning analytics (pp. 61-75). Springer, New York, NY.
[5] Maheshwari, E., Roy, C., Pandey, M., &Rautray, S. S. (2020). Prediction of Factors Associated with the Dropout Rates of Primary to High School Students in India Using Data Mining Tools. In Frontiers in Intelligent Computing: Theory and Applications (pp. 242-251). Springer, Singapore.
[6] Tsiakmaki, M., Kostopoulos, G., Kotsiantis, S., &Ragos, O. (2020). Implementing AutoML in Educational Data Mining for Prediction Tasks. Applied Sciences, 10(1), 90.
[7] Hou, H. T. (2015). Integrating cluster and sequential analysis to explore learners’ flow and behavioral patterns in a simulation game with situated-learning context for science courses: A video-based process exploration. Computers in human behavior, 48, 424-435.
[8] Acharya, A., & Sinha, D. (2014). Application of feature selection methods in educational data mining. International Journal of Computer Applications, 103(2). [9]Amrieh, E. A., Hamtini, T., &Aljarah, I. (2016). Mining educational data to predict student’s academic performance using ensemble methods. International Journal of Database Theory and Application, 9(8), 119-136.
[10] Romero, C., Ventura, S., & De Bra, P. (2004). Knowledge discovery with genetic programming for providing feedback to courseware authors. User Modeling and User-Adapted Interaction, 14(5), 425-464.
[11]Tsiakmaki, M., Kostopoulos, G., Koutsonikos, G., Pierrakeas, C., Kotsiantis, S., &Ragos, O. (2018, July). Predicting University Students' Grades Based on Previous Academic Achievements. In 2018 9th International Conference on Information, Intelligence, Systems and Applications (IISA) (pp. 1-6). IEEE.
[12] Romero, C., Romero, J. R., & Ventura, S. (2014). A survey on pre-processing educational data. In Educational data mining (pp. 29-64). Springer, Cham.
[13] Liu, S., &d'Aquin, M. (2017, April). Unsupervised learning for understanding student achievement in a distance learning setting. In 2017 IEEE Global Engineering Education Conference (EDUCON) (pp. 1373-1377). IEEE.
[14] Hofmann, M., &Klinkenberg, R. (Eds.). (2016). RapidMiner: Data mining use cases and business analytics applications. CRC Press.
[15] Berthold, M. R., Cebron, N., Dill, F., Gabriel, T. R., Kötter, T., Meinl, T., ... &Wiswedel, B. (2009). KNIME-the Konstanz information miner: version 2.0 and beyond. AcM SIGKDD explorations Newsletter, 11(1), 26-31.
[16] Demšar, J., Curk, T., Erjavec, A., Gorup, C., Hocevar, T., Milutinovic, M., ... &Štajdohar, M. (2013). Orange: data mining toolbox in Python. The Journal of Machine Learning Research, 14(1), 2349-2353.
[17] SPSS Inc. (2005). SPSS Base 14.0 user's guide. Prentice Hall.
[18] Alcalá-Fdez, J., Fernández, A., Luengo, J., Derrac, J., García, S., Sánchez, L., & Herrera, F. (2011). Keel data-mining software tool: data set repository, integration of algorithms and experimental analysis framework. Journal of Multiple-Valued Logic & Soft Computing, 17.
[19] Meng, X., Bradley, J., Yavuz, B., Sparks, E., Venkataraman, S., Liu, D., ... & Xin, D. (2016). Mllib: Machine learning in apache spark. The Journal of Machine Learning Research, 17(1), 1235-1241.
[20] Rodrigo, M., Mercedes, T., d Baker, R. S., McLaren, B. M., Jayme, A., &Dy, T. T. (2012). Development of a Workbench to Address the Educational Data Mining Bottleneck. International Educational Data Mining Society.
[21] Zhu, N. Q. (2013). Data visualization with D3. js cookbook. Packt Publishing Ltd.
[22] Koedinger, K. R., Baker, R. S., Cunningham, K., Skogsholm, A., Leber, B., & Stamper, J. (2010). A data repository for the EDM community: The PSLC DataShop. Handbook of educational data mining, 43, 43-56.
[23] Sharma, Manik, Samriti Sharma, and Gurvinder Singh. "Performance Analysis of Statistical and Supervised Learning Techniques in Stock Data Mining." Data 3.4 (2018): 54.
[24] Kaur, P., Sharma, M., Mittal, M.: Big data and machine learning based secure healthcare framework. In: International Conference on Computational Intelligence and Data Science (ICCIDS 2018), Elsevier Procedia Computer Science, vol. 132, pp. 1049–1059 (2018).
[25] Ambigavathi, M., &Sridharan, D. (2020). A Survey on Big Data in Healthcare Applications. In Intelligent Communication, Control and Devices (pp. 755-763). Springer, Singapore.
[26] Sharma, Manik, and Prableen Kaur. "A Comprehensive Analysis of Nature-Inspired Meta-Heuristic Techniques for Feature Selection Problem." Archives of Computational Methods in Engineering (2020): 1-25.
[27] Pramanik, M.I., Lau, R.Y.K., Demirkan, H., Azad, M.A.K.: Smart health: big data enabled health paradigm within smart cities. J. Expert Syst. Appl. 87, 370–383 (2017).
[28] Sharma, R., Kamble, S. S., Gunasekaran, A., Kumar, V., & Kumar, A. (2020). A systematic literature review on machine learning applications for sustainable agriculture supply chain performance. Computers & Operations Research, 104926.
[29] Balducci, F., Impedovo, D. and Pirlo, G., 2018. Machine Learning Applications on Agricultural Datasets for Smart Farm Enhancement. Machines, 6(3), p.38. [30] Kaur, Prableen, and Manik Sharma. "Analysis of data mining and soft computing techniques in prospecting diabetes disorder in human beings: a review." Int. J. Pharm. Sci. Res 9 (2018): 2700-2719.
[31] Borodin, V., Bourtembourg, J., Hnaien, F. and Labadie, N., 2016. Handling uncertainty in agricultural supply chain management: A state of the art. European Journal of Operational Research, 254(2), pp.348-359.
[32] Sharma, R., Kamble, S. S., Gunasekaran, A., Kumar, V., & Kumar, A. (2020). A systematic literature review on machine learning applications for sustainable agriculture supply chain performance. Computers & Operations Research, 104926.
[33] Dinçer, H., Yüksel, S., Canbolat, Z. N., &Pinarbasi, F. (2020). Data Mining-Based Evaluating the Customer Satisfaction for the Mobile Applications: An Analysis on Turkish Banking Sector by Using IT2 Fuzzy DEMATEL. In Tools and Techniques for Implementing International E-Trading Tactics for Competitive Advantage (pp. 320-339). IGI Global.
[34] Sharma, M., G. Singh, and R. Singh. "Stark assessment of lifestyle based human disorders using data mining based learning techniques." IRBM 38.6 (2017): 305-324.
[35] Shahbazi, F. (2020). Using Decision Tree Classification Algorithm to Design and Construct the Credit Rating Model for Banking Customers. IOSR Journal of Business and Management, 21(3, Series 2), 24-28.
[36] Dixit, V. S., & Gupta, S. (2020). Personalized Recommender Agent for E-Commerce Products Based on Data Mining Techniques. In Intelligent Systems, Technologies and Applications (pp. 77-90). Springer, Singapore.
[37] Xu, D., Ruan, C., Korpeoglu, E., Kumar, S., &Achan, K. (2020, January). Product Knowledge Graph Embedding for E-commerce. In Proceedings of the 13th International Conference on Web Search and Data Mining (pp. 672-680).


© 2012 onwards International Journal of Computer Applications & Information Technology

International Journal of Computer Applications & Information Technology
All rights reserved are reserved to Chief Editor IJCAIT.

For any Technical Support editorijcait@gmail.com