Iliou, Theodoros; Konstantopoulou, Georgia; Lymperopoulou, Christina; Anastasopoulos, Konstantinos; Anastassopoulos, George; Margounakis, Dimitrios; Lymberopoulos, Dimitrios
In: Artificial Intelligence Applications and Innovations, pp. 512–519, Springer International Publishing, Cham, 2019, ISBN: 978-3-030-19823-7.
As real world data tends to be incomplete, noisy and inconsistent, data preprocessing is an important issue for data mining. Data preparation includes data cleaning, data integration, data transformation and data reduction. In this paper, Iliou preprocessing method is compared with Principal Component Analysis in suicide prediction according to family history. The dataset consists of 360 students, aged 18 to 24, who were experiencing family history problems. The performance of Iliou and Principal Component Analysis data preprocessing methods was evaluated using the 10-fold cross validation method assessing ten classification algorithms, IB1, J48, Random Forest, MLP, SMO, JRip, RBF, Na"ive Bayes, AdaBoostM1 and HMM, respectively. Experimental results illustrate that Iliou data preprocessing algorithm outperforms Principal Component Analysis data preprocessing method, achieving 100% against 71.34% classification performance, respectively. According to the classification results, Iliou preprocessing method is the most suitable for suicide prediction.