Classification of Study Program Competitiveness Levels in SNBT Using the Random Forest Method Based on Capacity and Regional Distribution Data
Keywords:
Random Forest, SNBT Competitiveness, Study Program Classification, Seat Capacity, Regional Distribution.Abstract
The high level of competition in the Selection Nasional Berdasarkan Tes (SNBT) creates uncertainty for prospective students in choosing appropriate study programs, potentially leading to selection failure and uneven distribution of applicants across regions. This study aims to build a classification model to determine the competitiveness level of study programs in the SNBT by integrating data on capacity quotas and regional distributions using the Random Forest algorithm. The methodology follows a quantitative approach with a predictive computational workflow, utilizing secondary data from official national selection records. The dataset was automatically categorized into three tier Low, Medium, and High competitiveness using a quantile-based approach derived from available seats and application numbers. The predictive performance evaluated through a multi-class confusion matrix demonstrated an overall classification accuracy, precision, recall, and F1-score of 68.00% across all competitiveness tiers, indicating a balanced decision threshold free from class bias. Furthermore, feature importance analysis based on Mean Decrease Impurity revealed that institutional seat capacity is the primary computational driver, contributing 53.60% of the decision weight, followed by historical applicant volume at 29.40%, while localized geographic variables (provinces and regions) account for a combined 17.00%. These empirical findings confirm that while market supply-and-demand indicators dictate the baseline of SNBT competition, geographic factors function as critical spatial catalysts that mathematically refine student distribution choices.
References
Siregar, A. M., & Siregar, R. R. (2022). Analisis Minat Calon Mahasiswa dalam Memilih Program Studi pada Seleksi Nasional Berdasarkan Tes (SNBT). Jurnal Pendidikan dan Konseling (JPDK), 4(6), 1100-1108.https://doi.org/10.31004/jpdk.v4i6.8924
Pratama, M. A., & Setiawan, B. (2023). Faktor-Faktor yang Mempengaruhi Kegagalan Calon Mahasiswa dalam UTBK-SNBT: Sebuah Studi Eksploratif. Jurnal Ilmiah Edutic, 10(1), 45-54.https://doi.org/10.21107/edutic.v10i1.18942
Handayani, S., & Nugroho, A. (2021). Ketimpangan Spasial Distribusi Pendaftar Perguruan Tinggi Negeri di Indonesia. Jurnal Geografi dan Edukasi, 18(2), 89-98.https://doi.org/10.31227/osf.io/g4j8z
Lestari, D. P. (2024). Peran Guru BK dalam Strategi Pemilihan Program Studi Jalur SNBT Berdasarkan Peta Keketatan Prodi. Jurnal Bimbingan dan Konseling Terintegrasi, 6(1), 12-21.https://doi.org/10.24036/0024844-0-2024
Wijaya, K., & Ramadhan, F. (2023). Sistem Pendukung Keputusan Pemilihan Program Studi Menggunakan Metode Klasifikasi Data Historis Keketatan Seleksi. Jurnal Teknologi Informasi dan Sistem Informasi, 10(3), 321-330.https://doi.org/10.33965/jti.v10i3.4412
Suryadi, E., & Fatmawati, K. (2022). Penerapan Machine Learning di Bidang Pendidikan: Telaah Literatur Sistematis Terhadap Prediksi Kelulusan Siswa. Jurnal Edukasi Matematika dan Komputer, 4(2), 77-85.https://doi.org/10.37058/jemk.v4i2.5113
Gunawan, I., & Hidayat, R. (2021). Pengaruh Faktor Geografis dan Akreditasi terhadap Minat Pendaftar Perguruan Tinggi di Indonesia. Jurnal Analisis Pendidikan, 23(1), 54-66.https://doi.org/10.21831/jap.v23i1.39121
Rahmawati, A., & Utomo, S. (2023). Analisis Daya Tampung dan Distribusi Kewilayahan Terhadap Keketatan Seleksi Mahasiswa Baru. Jurnal Manajemen dan Kebijakan Pendidikan, 11(2), 145-156.https://doi.org/10.21831/jamp.v11i2.58913
Kusuma, W. A., & Saputra, D. (2024). Integrasi Pemetaan Spasial dalam Prediksi Keketatan Jurusan Perguruan Tinggi Menggunakan Data Terbuka Pemerintah. Jurnal Sains Data Indonesia, 5(1), 33-42.https://doi.org/10.46336/jsdi.v5i1.512
Chen, X., & Ishwaran, H. (2021). Random Forests in Educational Data Mining: A Review of Recent Applications and Performance. IEEE Transactions on Learning Technologies, 14(4), 512-525.https://doi.org/10.1109/TLT.2021.3102341
Wahyuni, S., & Rosmansyah, Y. (2022). Komparasi Algoritma Random Forest dan Naive Bayes untuk Klasifikasi Keketatan Seleksi PTN. Jurnal Ilmu Komputer dan Informatika, 8(2), 210-219.https://doi.org/10.22216/jiki.v8i2.7214
Hidayat, T., & Nugroho, S. (2023). Pemetaan Pola Pemilihan Perguruan Tinggi Berdasarkan Klaster Kewilayahan Menggunakan Algoritma Kombinasi. Jurnal Komputasi Akademik, 15(1), 102-114.https://doi.org/10.30865/klik.v4i2.1105
Ramadhan, R., & Putri, L. A. (2022). Pengaruh Kuota Daya Tampung dan Trend Minat Pendaftar Terhadap Rasio Keketatan Jalur Tes Nasional. Jurnal Evaluasi Pendidikan, 13(2), 185-194.https://doi.org/10.21009/jep.v13i2.29841
Utami, M. D., & Santoso, H. (2024). Analisis Spasial Aksesibilitas Geografis Terhadap Ketimpangan Pilihan Program Studi Favorit di PTN. Jurnal Geografi Indonesia, 36(1), 45-56.https://doi.org/10.22146/gji.84112
Saputra, A., & Arifin, Z. (2023). Aplikasi Data Mining Berbasis Prediksi untuk Membantu Calon Mahasiswa Menghindari Kegagalan Jalur UTBK. Jurnal Sistem Informasi Bisnis, 13(2), 204-213.https://doi.org/10.21456/vol13iss2pp204-213
Fitriani, N., & Rosadi, T. (2021). Klasifikasi Tingkat Persaingan Program Studi Menggunakan Pembelajaran Mesin Berbasis Komparasi Algoritma Klasifikasi. Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi), 5(4), 670-678.https://doi.org/10.29207/resti.v5i4.3129
Alim, S., & Budiman, A. (2023). Penerapan Algoritma Random Forest dalam Penentuan Skala Prioritas Seleksi Berkas Akademis Otomatis. Jurnal Informatika Kedokteran dan Pendidikan, 11(3), 290-301.https://doi.org/10.35842/jtis.v11i3.467
Wati, M., & Indriyani, R. (2024). Penggunaan Metode Ensemble Random Forest untuk Optimasi Akurasi Klasifikasi Data Multivariat Pendidikan. Jurnal Algoritma dan Komputasi, 18(1), 12-23.https://doi.org/10.2312/algoritm.v18i1.7912
Kurniawan, D., & Wardani, N. K. (2022). Model Analitik Keketatan Pendaftaran Universitas Berdasarkan Parameter Kuota dan Demografi Menggunakan Pohon Keputusan Random Forest. Jurnal Tekno-Insentif, 16(2), 134-145.https://doi.org/10.36787/jti.v16i2.641
Setiawan, H., & Baskoro, F. (2023). Pemanfaatan Variabel Kewilayahan dalam Model Klasifikasi Spasial Berbasis Machine Learning: Sebuah Tinjauan Sistematis. Jurnal Telematika dan Komputasi Terapan, 7(2), 99-111.https://doi.org/10.31328/jointecs.v7i2.4132
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Robin Antoni, Muhammad Iqbal

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.




