Optimization of the Number of Clusters in Grouping Characteristics of Hajj Pilgrim Candidates in Tebing Tinggi City Using the K-Means Algorithm and Elbow Method

Authors

  • Mhd. Septian Universitas Pembangunan Panca Budi
  • Muhammad Irfan Sarif Universitas Pembangunan Panca Budi

Keywords:

K-Means, Elbow Method, Clustering, Hajj Pilgrims, Optimization, Data Mining, Tebing Tinggi

Abstract

Every year, thousands of prospective Hajj pilgrims from Tebing Tinggi City register and participate in manasik (Hajj rituals) guidance. The diversity of their characteristics, including age, gender, education, occupation, and frequency of guidance, demands different service strategies. This study aims to group prospective Hajj pilgrims based on demographic characteristics using the K-Means algorithm with cluster number optimization via the Elbow method. The data used is the registration data of prospective Hajj pilgrims for 2023 from KBIHU in Tebing Tinggi City, totaling **189 records**. The variables used include age, gender, education, and occupation. The results show that the optimal number of clusters is **K=3** with an inertia value of 152.47 at K=3. Cluster 1 (Elderly) is dominated by pilgrims aged over 50 years with secondary education and occupations as housewives/retirees. Cluster 2 (Productive Adults) consists of pilgrims aged 30-50 years with higher education and occupations as civil servants/private employees. Cluster 3 (Youth) comprises pilgrims under 30 years old with student status. This study provides recommendations for different guidance strategies for each cluster: an intensive approach with visual/audio media for Cluster 1, flexible blended learning for Cluster 2, and independent digital-based guidance for Cluster 3.

References

Herdiana, I., et al. (2025). A More Precise Elbow Method for Optimum K-means Clustering. arXiv preprint, arXiv:2502.00851.

Hidayat, T. (2022). Klasterisasi data jamaah umroh menggunakan metode K-Means Clustering (Studi Kasus di AET Travel Indonesia). Masters thesis, Universitas Putra Indonesia YPTK.

Kodinariya, T. M., & Makwana, P. R. (2013). Review on determining number of Cluster in K-Means Clustering. International Journal of Advanced Research in Computer Science and Management Studies, 1(6), 2321-7782.

Mardi, Y. (2017). Data Mining : Klasifikasi Menggunakan Algoritma C4.5. Jurnal Edik Informatika, 2(2), 213-219.

Muningsih, E., & Yogyakarta, A. B. S. I. (2017). Optimasi jumlah cluster k-means dengan metode elbow untuk pemetaan pelanggan. Prosiding Seminar Nasional ELINVO, September, 105--114.

Patro, S. G. K., & sahu, K. K. (2015). Normalization: A Preprocessing Stage. Iarjset, 20--22.

Purnima, B., & Arvind, K. (2014). EBK-Means: A Clustering Technique based on Elbow Method and K-Means in WSN. International Journal of Computer Applications, 105(9), 17--24.

Shella, D. M., & Eviyanti, A. (2025). Implementation of the K-Means Algorithm for Clustering the KBIH Jabal Nur Hajj Pilgrim Area. UMSIDA Preprints Server.

Wakhidah, N. (2010). Clustering Menggunakan K-Means Algorithm. Jurnal Transformatika, 8(1), 33.

Winaryanti, H. S., Hadi, H. P., & Rachmawanto, E. H. (2024). Customer Segmentation Using K-Means Clustering with RFM Method (Case Study: PT. Dewangga Travindo Semarang). Journal of Applied Intelligent System, 9(1).

Yusuf, A. (2025). Computational Implementation of Linear Reference Elbow (LRE) Method for Optimal Cluster Determination in k-Means Algorithm. Journal of Physics: Conference Series, 3139, 012038.

Downloads

Published

2025-10-27