Validitas dan reliabilitas instrumen asesmen kinerja literasi sains pelajaran fisika berbasis STEM
Supahar Supahar, Universitas Negeri Yogyakarta, Indonesia
Abstract
Kata kunci:validitas isi, validitas empiris, asesmen kinerja, literasi sains, STEM
VALIDITY AND RELIABILITY INSTRUMENT OF SCIENTIFIC LITERACY PERFORMANCE ASSESSMENT IN PHYSICS TEACHING BASED ON STEM
Abstract
This research is part of the development of scientific literacy performance assessment based on STEM in teaching physics. The aim of this research is to reveal the validity (content and also empiric) and reliability of scientific literacy performance assessment instrument based on STEM. The kind of instruments were developed are observational sheet and multiple choice test. The content validity of observational sheet was revealed by used the Aiken’s V Coefficient. The content validity of multiple choice tests was revealed by used Content Validity Index (CVI) which proposed by Lawshe. The empirical validity and reliability of multiple choice tests was revealed by used Item Response Theory Analysis. The reliability of observational sheet was revealed by used ICC (Item Correlation Coefficient) Analysis. The results of this study are the validity from the contents and empirical trials from the developed instruments. The observation sheet from scoring rubric and self-assessment has been valid with Aiken’s V value that exceeds the standard of 0,75. The reliability of the scoring rubric has Alfa Reliability> 0.8 and Excellent of ICC. Validity values from The written test is shown with CVI of 1 and the MNSQ INFIT value which match to the Rasch model. Based on the TIC and SEM graphs, the written test is stated to be reliable for use in students with moderate to high categories (-0.7 to 6.7). STEM-based Science Literacy performance assessment with caloric material is appropriate to use.
Keywords: content validity, empirical validity, performance assessment, scientific literacy, STEM
Keywords
Full Text:
Fulltext PDFReferences
Aiken, L. R. (1985). Three Coefficients for Analyzing the Reliability and Validity of Ratings. Educational and Psychological Measurement, 45(1), 131–142. https://doi.org/10.1177/0013164485451012
Azwar, S. (2012). Reliabiltas dan validitas(4th ed.). Yogyakarta: Pustaka Pelajar.
Azwar, S. (2015). Metode penelitian. Yogyakarta: Pustaka Pelajar.
Badan Standar Nasional Pendidikan. (2006). Panduan penyusunan kurikulum tingkat satuan pendidikan jenjang pendidikan dasar dan menengah. Jakarta: BSNP. Retrieved from http://bsnp-indonesia.org/id/wp-content/uploads/kompetensi/Panduan_Umum_KTSP.pdf
Bashooir, K., & Supahar, S. (2016). Analisis aspek kinerja literasi sainspada materi kalor Fisika. UPEJ Unnes Physics Education Journal, 5(1). Retrieved from https://journal.unnes.ac.id/sju/index.php/upej/article/view/12711
Breiner, J. M., Harkness, S. S., Johnson, C. C., & Koehler, C. M. (2012). What is STEM? A discussion about conceptions of STEM in education and partnerships. School Science and Mathematics, 112(1), 3–11. https://doi.org/10.1111/j.1949-8594.2011.00109.x
Chiappetta, E. L., & Koballa, T. R. (2010). Science instruction in the middle and secondary schools: developing fundamental knowledge and skills (7th ed.). USA: Pearson Education, Inc.
Cicchetti, D. V. (1994). Guidelines, criteria, and rules of thumb for evaluating normed and standardized assessment instruments in psychology. Psychological Assessment, 6(4), 284–290. https://doi.org/10.1037/1040-3590.6.4.284
Departemen Pendidikan dan Kebudayaan. (2001). Kamus besar bahasa Indonesia (3rd ed.). Jakarta: Balai Pustaka.
Department of Education. (2009). Report of the STEM review. Retrieved from https://www.education-ni.gov.uk/sites/default/files/publications/de/Report of the STEM Review 2009_1.PDF
Gonzalez, H. B., & Kuenzi, J. J. (2012). Science, technology, engineering, and mathematics (STEM) education: a primer. Retrieved from https://fas.org/sgp/crs/misc/R42642.pdf
Hernandez, P. R., Bodin, R., Elliott, J. W., Ibrahim, B., Rambo-Hernandez, K. E., Chen, T. W., & de Miranda, M. A. (2014). Connecting the STEM dots: measuring the effect of an integrated engineering design intervention. International Journal of Technology and Design Education, 24(1), 107–120. https://doi.org/10.1007/s10798-013-9241-0
Ismail, I., Permanasari, A., & Setiawan, W. (2016). Efektivitas virtual lab berbasis STEM dalam meningkatkan literasi sains siswa dengan perbedaan gender. Jurnal Inovasi Pendidikan IPA, 2(2), 190. https://doi.org/10.21831/jipi.v2i2.8570
Kartowagiran, B., & Jaedun, A. (2016). Model asesmen autentik untuk menilai hasil belajar siswa sekolah menengah pertama (SMP): implementasi asesmen autentik di SMP. Jurnal Penelitian Dan Evaluasi Pendidikan, 20(2), 131. https://doi.org/10.21831/pep.v20i2.10063
Lawshe, C. H. (1975). A quantitative approach to content validity. Personnel Psychology, 28(4), 563–575. https://doi.org/10.1111/j.1744-6570.1975.tb01393.x
Mardapi, D. (2012). Pengukuran, penilaian dan evaluasi pendidikan. Yogyakarta: Nuha Medika.OECD. (2014). PISA 2012 results: what students know and can do student performance in mathematics, reading and science volume I. Paris: OECD Publishing.
P21. (2009). 21st century skills map.Retrieved from http://www.p21.org/storage/documents/21st_century_skills_english_map.pdf
Presiden Republik Indonesia. Undang-Undang Republik Indonesia nomor 20 tahun 2003 tentang Sistem Pendidikan Nasional (2003). Indonesia.Reeve, E. M. (2013). Implementing science, technology, mathematics, and engineering (STEM) education in Thailand and in ASEAN. Retrieved from http://dpst-apply.ipst.ac.th/specialproject/images/IPST_Global/document/Implementing STEM in ASEAN -IPST May 7 2013 -Final.pdf
Retnawati, H. (2016). Validitas reliabilitas dan karakteristik butir. Yogyakarta: Parama Publishing.
Shrout, P. E., & Fleiss, J. L. (1979). Intraclass correlations: uses in assessing rater reliability. Psychological Bulletin, 86(2), 420–428. Retrieved from http://www.ncbi.nlm.nih.gov/pubmed/18839484
Streiner, D. L. (2003). Starting at the beginning: an introduction to coefficient alpha and internal consistency. Journal of Personality Assessment, 80(1), 99–103. https://doi.org/10.1207/S15327752JPA8001_18
Subali, B., & Suyata, P. (2012). Pengembangan item tes konvergen dan divergen: penyelidikan validitasnya Secara empiris. Yogyakarta: Diandra Pustaka Indonesia.
Sumintono, B., & Widhiarso, W. (2015). Aplikasi pemodelan RASCH pada assessment pendidikan. Cimahi: Tim Komunikata Publishing House.
Supahar, & Prasetyo, Z. K. (2015). Pengembangan instrumen penilaian kinerja kemampuan inkuiri peserta didik pada mata pelajaran fisika SMA. Jurnal Penelitian Dan Evaluasi Pendidikan, 19(1), 96–108. Retrieved from https://journal.uny.ac.id/index.php/jpep/article/view/4560
Supahar, S. (2014). The estimation of inquiry performance test items of high school physics subject with quest program. In International Conference on Research, Implementation And Education of Mathematics And Sciences. Yogyakarta: Yogyakarta State University.
Supahar, S. (2015). Applying content validity ratios (CVR) to the quantitative content validity of physics learning achievement tests. In International Conference on Research, Implementation And Education of Mathematics And Sciences. Yogyakarta: Yogyakarta State University.
Wagner, T. (2008). The global achievement gap. New York: Basic Book.
Yore, L. D., & Treagust, D. F. (2006). Current realities and future possibilities: language and science literacy—empowering research and informing instruction. International Journal of Science Education, 28(2–3), 291–314. https://doi.org/10.1080/09500690500336973
DOI: https://doi.org/10.21831/pep.v22i2.19590
Refbacks
- There are currently no refbacks.
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Find Jurnal Penelitian dan Evaluasi Pendidikan on:
ISSN 2338-6061 (online) || ISSN 2685-7111 (print)
View Journal Penelitian dan Evaluasi Pendidikan Visitor Statistics