Main Article Content

Melda Pita Uli Sitompul
Opim Salim Sitompul
Zakarias Situmorang

Abstract

Clustering is a data mining method for grouping data that have similar or different characters in each section. One of the methods is using K-Means by measuring the distance between clusters using the shortest distance or Euclidean Distance. K-means entails weakness, which is the determination of clusters in k-means clustering, resulting in the different data grouping and affecting the results of the data cluster distribution. To overcome this issue, the elbow creation method is employed to determine the similarity level in the cluster by observing the comparison between Root Means Square and R Square to measure the homogeneity and heterogeneity of the cluster where this method is applied by considering the changes in the comparison between the RMSSTD (Root Means Square Standard Deviation) and RS (R Squared) values which have the intersection of the RMSSTD and RSquared values. The difference between RMSSTD cluster 1 and RMSSTD cluster 2 was 0.066 and RS cluster 1 and RS cluster 2 was -0.304. Based on those figures, the highest difference was found in cluster 2. All considered, tourist destinations in East Asia frequently visited or interested to visitors are grouped into cluster 2, comprising criteria 6, 7, 8, and 10, or in other words, resort destination, picnic area, beaches, and religious institutions

Downloads

Download data is not yet available.

Article Details

How to Cite
Sitompul, M. P. U. ., Sitompul, O. S. ., & Situmorang, Z. . (2022). Optimization of Determination Against K-Means Cluster Algorithm using Elbow Creation. Jurnal Teknik Informatika C.I.T Medicom, 14(1), 1–9. https://doi.org/10.35335/cit.Vol14.2022.176.pp1-9
References
A. Agrawal and H. Gupta, “Global K-means (GKM) clustering algorithm: a survey,” Int. J. Comput. Appl., vol. 79, no. 2, 2013.
Y. Agusta, “Minimum message length mixture modelling for uncorrelated and correlated continuous data applied to mutual funds classification.” Monash University, 2004.
S. B. Sutono, “Selection of representative Kansei adjectives using cluster analysis: a case study on car design,” Int. J. Adv. Eng. Manag. Sci., vol. 2, no. 11, p. 239691, 2016.
P. Bholowalia and A. Kumar, “EBK-means: A clustering technique based on elbow method and k-means in WSN,” Int. J. Comput. Appl., vol. 105, no. 9, 2014.
R. A. Johnson and D. W. Wichern, Applied multivariate statistical analysis, vol. 6. Pearson London, UK:, 2014.
T. M. Kodinariya and P. R. Makwana, “Review on determining number of Cluster in K-Means Clustering,” Int. J., vol. 1, no. 6, pp. 90–95, 2013.
B. Everitt, “Cluster Analysis, 5th edn John Wiley & Sons,” Ltd New York.[Google Sch., 2011.
T. Hastie, R. Tibshirani, J. Friedman, and J. Franklin, “Reviews-the elements of statistical learning: data mining, inference and prediction,” Math. Intell., vol. 27, no. 2, pp. 83–84, 2005.
M. Nilashi, K. Bagherifard, M. Rahmani, and V. Rafe, “A recommender system for tourism industry using cluster ensemble and prediction machine learning techniques,” Comput. Ind. Eng., vol. 109, pp. 357–368, 2017.
B. R. Jipkate and V. V Gohokar, “A comparative analysis of fuzzy c-means clustering and k means clustering algorithms,” Int. J. Comput. Eng. Res., vol. 2, no. 3, pp. 737–739, 2012.
V. K. Panchal, H. Kundra, and J. Kaur, “Comparative study of particle swarm optimization based unsupervised clustering techniques,” Int. J. Comput. Sci. Netw. Secur., vol. 9, no. 10, pp. 132–140, 2009.
A. Singh, A. Yadav, and A. Rana, “K-means with Three different Distance Metrics,” Int. J. Comput. Appl., vol. 67, no. 10, pp. 13–17, 2013, doi: 10.5120/11430-6785.
T. S. Madhulatha, “An overview on clustering methods,” arXiv Prepr. arXiv1205.1117, 2012.
S. Renjith, A. Sreekumar, and M. Jathavedan, “Evaluation of partitioning clustering algorithms for processing social media data in tourism domain,” in 2018 IEEE Recent Advances in Intelligent Computational Systems (RAICS), 2018, pp. 127–131.
J. V. De Oliveira and W. Pedrycz, Advances in fuzzy clustering and its applications. John Wiley & Sons, 2007.
A. Bhagat, Mobile intensive care unit relocation modeling using cluster analysis and linear optimization. State University of New York at Binghamton, 2009.
M. Halkidi, Y. Batistakis, and M. Vazirgiannis, “On clustering validation techniques,” J. Intell. Inf. Syst., vol. 17, no. 2, pp. 107–145, 2001.
W. Niyagas, A. Srivihok, and S. Kitisin, “Clustering e-banking customer using data mining and marketing segmentation,” ECTI Trans. Comput. Inf. Technol., vol. 2, no. 1, pp. 63–69, 2006.
M. Halkidi, Y. Batistakis, and M. Vazirgiannis, “Clustering algorithms and validity measures,” in Proceedings Thirteenth International Conference on Scientific and Statistical Database Management. SSDBM 2001, 2001, pp. 3–22.
W. Yotsawat and A. Srivihok, “Rules mining based on clustering of inbound tourists in Thailand,” in Advanced Computer and Communication Engineering Technology, Springer, 2015, pp. 693–705.