Theoretical Review: Understanding Test Reliability in Learning
DOI:
https://doi.org/10.59525/gej.1669Keywords:
Learning Evaluation, Psychometrics, ReliabilityAbstract
Evaluation is a crucial instrument in the Independent Curriculum instructional cycle for measuring the achievement of learning objectives and providing pedagogical feedback. However, in practice, the provision of valid and consistent evaluation tools still faces challenges, characterized by fluctuations in test scores due to weak control over measurement error. This article aims to examine in depth the basic concept of reliability, visualize its operation, and reconstruct techniques for estimating test instrument reliability coefficients. The research method used is library research, analyzing classic and contemporary literature in the fields of educational evaluation and psychometrics using a descriptive-qualitative approach. The results of the discussion indicate that reliability reflects the level of consistency, dependability, and stability of measurement results, which can be estimated through three main approaches: stability (test-retest), equivalence (parallel forms), and internal consistency (such as Split-Half, KR-20/21, and Cronbach's Alpha). Through simulation of the Spearman-Brown (Split-Half) formula calculation on a multiple-choice test, a reliability coefficient of 0.79 was obtained. Based on Guilford's criteria, this value is included in the high category and exceeds the minimum limit of feasibility for learning evaluation instruments (0.70), so the instrument is highly reliable.
References
Khumaedi, M. (2012). Reliabilitas Instrumen Penelitian Pendidikan. Jurnal Pendidikan Teknik Mesin, 12(1), 25-30.
Mardapi, D. (2017). Pengukuran, Penilaian, dan Evaluasi Pendidikan. Yogyakarta: Parama Publishing.
Matondang, Z. (2007). Validitas dan Reliabilitas Suatu Instrumen Penelitian. Jurnal Tabularasa, 4(1), 87-97.
Purwanto. (2016). Evaluasi Hasil Belajar. Yogyakarta: Pustaka Pelajar.
Ramadhan, M. F., Siroj, R. A., & Afgani, M. W. (2024). Validitas and Reliabilitas. Journal on Education, 6(2), 10967-10975.
Retnawati, H. (2016). Analisis Prasyarat Evaluasi: Validitas, Reliabilitas, dan Karakteristik Butir. Yogyakarta: Parama Publishing.
Saputri dkk. (2023). Analisis Instrumen Assesmen : Validitas, Reliabilitas, Tingkat Kesukaran Dan Daya Beda Butir Soal. Jurnal Ilmiah PGSD FKIP, 09, 2986–2995.
Siregar, S. (2013). Statistik Parametrik untuk Penelitian Kuantitatif. Jakarta: Bumi Aksara.
Sudijono, A. (2018). Pengantar Evaluasi Pendidikan. Jakarta: Rajawali Pers.
Sugiyono. (2019). Metode Penelitian Kuantitatif, Kualitatif, dan R&D. Bandung: Alfabeta.
Sumardi. (2020). Teknik Pengukuran dan Penilaian Hasil Belajar. Yogyakarta: CV Budi Utama.
Suryabrata, S. (2014). Metodologi Penelitian. Jakarta: Rajawali Pers.
Yati afiyanti. (2002). Validitas dan reliabilitas dalam penelitian kualitatif. Keperawatan Indonesia, 12, 137–141.
Yen, W. M. & Yen, W. M. (2002). Introduction to Measurement Theory. Long Grove: Waveland Press.
Yusup, F. (2018). Uji Validitas dan Reliabilitas Instrumen Penelitian Kuantitatif. Jurnal Tarbiyah: Jurnal Ilmiah Kependidikan, 7(1), 17-23.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Yunisa Habibah, Ilda Pulungan, Rizki Hannum, Nabila Zahra NST, Nabilla Chairunnisa, Ahmad Adiwan Bincar

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.



