IJITCS Vol. 14, No. 1, 8 Feb. 2022
Cover page and Table of Contents: PDF (size: 343KB)
Full Text (PDF, 343KB), PP.1-8
Views: 0 Downloads: 0
Data normalization, data scaling, Anfis, classification, regression, scikit-learn
Machine learning and artificial intelligence techniques are more and more in our lives and studies in this field are increasing day by day. Data is vital for these studies. In order to draw meaningful conclusions from the available data, new methods are proposed and successful results are obtained. The preparation of the obtained data is very important in the studies to be carried out. Data preprocessing is very important in the preparation of data. The most critical stage of the data preprocessing process is the scaling or normalization of the data. Machine learning libraries such as scikit-learn and programming languages such as R provide the necessary libraries to scale data. However, it is not known exactly which normalization method will be applied and which will yield more successful results. The success of these normalization methods has been investigated on many different methods, but such a study has not been done on the adaptive neural fuzzy inference system (ANFIS). The aim of this study is to examine the success of normalization methods on ANFIS in terms of both classification and regression problems. So, for studies using the Anfis method, guidance will be provided on which normalization process will give better results in the data preprocessing stage. Four different normalization methods in the scikit-learn library were applied on the Diabets and Forestfire datasets in the UCI database. The results are presented separately for both classification and regression. It has been determined that min-max normalization in classification problems and working with original data in regression problems are more successful.
Mesut. Polatgil, "Investigation of the Effect of Normalization Methods on ANFIS Success: Forestfire and Diabets Datasets", International Journal of Information Technology and Computer Science(IJITCS), Vol.14, No.1, pp.1-8, 2021. DOI: 10.5815/ijitcs.2022.01.01
[1] S. Mohsin, Fuzzy Logic based Health Care System using Wireless Body Area Network Prakashgoud Patil, International Journal of Computer Applications. 80 (2013) 975–8887.
[2] M.A.S. Machado, T.D.R.G. Moreira, L.F.A.M. Gomes, A.M. Caldeira, D.J. Santos, A Fuzzy Logic Application in Virtual Education, Procedia Computer Science. 91 (2016) 19–26. https://doi.org/10.1016/J.PROCS.2016.07.037.
[3] N. Hachicha, B. Jarboui, P. Siarry, A fuzzy logic control using a differential evolution algorithm aimed at modelling the financial market dynamics, Information Sciences. 181 (2011) 79–91. https://doi.org/10.1016/J.INS.2010.09.010.
[4] T. Vasileva-Stojanovska, M. Vasileva, T. Malinovski, V. Trajkovik, An ANFIS model of quality of experience prediction in education, Applied Soft Computing. 34 (2015) 129–138. https://doi.org/10.1016/J.ASOC.2015.04.047.
[5] L. Sarangi, M. Narayan Mohanty, S. Patnaik ITER, Design of ANFIS Based E-Health Care System for Cardio Vascular Disease Detection, (2017). https://doi.org/10.1007/978-3-319-49568-2_63.
[6] S. Barak, J.H. Dahooie, T. Tichý, Wrapper ANFIS-ICA method to do stock market timing and feature selection on the basis of Japanese Candlestick, Expert Systems with Applications. 42 (2015) 9221–9235. https://doi.org/10.1016/J.ESWA.2015.08.010.
[7] T. Jayalakshmi, A. Santhakumaran, Statistical Normalization and Back Propagationfor Classification, International Journal of Computer Theory and Engineering. (2011) 89–93. https://doi.org/10.7763/IJCTE.2011.V3.288.
[8] Amit Pandey, Achin Jain,"Comparative Analysis of KNN Algorithm using Various Normalization Techniques", International Journal of Computer Network and Information Security(IJCNIS), Vol.9, No.11, pp.36-42, 2017.DOI: 10.5815/ijcnis.2017.11.04
[9] P. Cihan, O. Kalipsız, E. Gökçe, Hayvan Hastalığı Teşhisinde Normalizasyon Tekniklerinin Yapay Sinir Ağı Ve Özellik Seçim Performansına Etkisi, Turkish Studies. 12 (2017) 59–70. https://doi.org/10.7827/TurkishStudies.11902.
[10] S. Yavuz, M. Deveci, İstatiksel Normalizasyon Tekniklerinin Yapay Sinir Ağın Performansına Etkisi, Erciyes Üniversitesi İktisadi ve İdari Bilimler Fakültesi Dergisi. 0 (2015) 167–187. https://dergipark.org.tr/en/pub/erciyesiibd/78019 (accessed May 25, 2021).
[11] G. Aksu, C. Oktay Güzeller, M.T. Eser, The Effect of the Normalization Method Used in Different Sample Sizes on the Success of Artificial Neural Network Model, International Journal of Assessment Tools in Education. 6 (2019) 170–192. https://doi.org/10.21449/ijate.479404.
[12] B.K. Singh, N.I.T. Raipur, K. Verma, A.S. Thoke, Investigations on Impact of Feature Normalization Techniques on Classifier’s Performance in Breast Tumor Classification, International Journal of Computer Applications. 116 (2015) 975–8887.
[13] D. Singh, B. Singh, Investigating the impact of data normalization on classification performance, Applied Soft Computing. 97 (2020) 105524. https://doi.org/10.1016/J.ASOC.2019.105524.
[14] A. Ali, N. Senan, The Effect of Normalization in Violence Video Classification Performance, IOP Conference Series: Materials Science and Engineering. 226 (2017) 012082. https://doi.org/10.1088/1757-899X/226/1/012082.
[15] J.S.R. Jang, ANFIS: Adaptive-Network-Based Fuzzy Inference System, IEEE Transactions on Systems, Man and Cybernetics. 23 (1993) 665–685. https://doi.org/10.1109/21.256541.
[16] J.S. Roger Jang, C.T. Sun, Functional Equivalence Between Radial Basis Function Networks and Fuzzy Inference Systems, IEEE Transactions on Neural Networks. 4 (1993) 156–159. https://doi.org/10.1109/72.182710.
[17] I.S. Sitanggang, S. Roseli, L. Syaufina, Information Technology and Computer Science, Information Technology and Computer Science. 9 (2018) 13–20. https://doi.org/10.5815/ijitcs.2018.09.02.
[18] S. Islam Ayon, M. Milon Islam, Information Engineering and Electronic Business, Information Engineering and Electronic Business. 2 (2019) 21–27. https://doi.org/10.5815/ijieeb.2019.02.03.
[19] R. Vaitheeshwari, V. Sathieshkumar, Performance analysis of epileptic seizure detection system using neural network approach, ICCIDS 2019 - 2nd International Conference on Computational Intelligence in Data Science, Proceedings. (2019). https://doi.org/10.1109/ICCIDS.2019.8862158.
[20] C. Chen, N. Kitbutrawat, S. Kajita, H. Yamaguchi, T. Higashino, Modeling BLE propagation above the ceiling for smart HVAC systems, Proceedings - 2019 15th International Conference on Intelligent Environments, IE 2019. (2019) 68–71. https://doi.org/10.1109/IE.2019.00012.
[21] S.V. Khond, Effect of Data Normalization on Accuracy and Error of Fault Classification for an Electrical Distribution System, Https://Doi.Org/10.1080/23080477.2020.1799135. 8 (2020) 117–124. https://doi.org/10.1080/23080477.2020.1799135.
[22] K. Manimekalai, A. Kavitha, MISSING VALUE IMPUTATION AND NORMALIZATION TECHNIQUES IN MYOCARDIAL INFARCTION, ICTACT JOURNAL ON SOFT COMPUTING. (2018) 3. https://doi.org/10.21917/ijsc.2018.0230.
[23] J. Hyma, P. Reddy, A. Damodaram, Performance analysis of Heterogeneous Data Normalization with a New Privacy Metric, Dat a Mining Journal of Comput Er Science IJCSIS Journal of Comput Er Science IJCSIS Sept Ember. (2018). https://sites.google.com/site/ijcsis/ (accessed October 9, 2021).
[24] S. Kappal, Data Normalization Using Median Median Absolute Deviation MMAD based Z-Score for Robust Predictions vs. Min – Max Normalization, London Journal of Research in Science: Natural and Formal. (2019). https://research.journalspress.com/index.php/science/article/view/444 (accessed October 9, 2021).
[25] X.H. Cao, I. Stojkovic, Z. Obradovic, A robust data scaling algorithm to improve classification accuracies in biomedical data, BMC Bioinformatics 2016 17:1. 17 (2016) 1–10. https://doi.org/10.1186/S12859-016-1236-X.