Is there any item or test bias in the Business English Test at Universitas Terbuka?
Downloads
Ahmadi, A., & Jalili, T. (2014). A confirmatory study of Differential Item Functioning on EFL reading comprehension. Applied Research on English Language, 3(2), 55–68.
Andrich, D., & Marais, I. (2019). Fit of Responses to the Model III—Differential Item Functioning (pp. 199–208). https://doi.org/10.1007/978-981-13-7496-8_16
Argianti, A., & Retnawati, H. (2020). Characteristics of Math national-standardized school exam test items in junior high school: What must be considered? Jurnal Penelitian Dan Evaluasi Pendidikan, 24(2). https://doi.org/10.21831/pep.v24i2.32547
Babu, N., & Kohli, P. (2023). Commentary: Reliability in research. Indian Journal of Ophthalmology, 71(2), 400. https://doi.org/10.4103/ijo.IJO_2016_22
Bademci, V. (2022). Correcting Fallacies about Validity as the Most Fundamental Concept in Educational and Psychological Measurement. International E-Journal of Educational Studies, 6(12), 148–154. https://doi.org/10.31458/iejes.1140672
Baker, F. B., & Kim, S.-H. (2017). Item Characteristic Curve Models (pp. 17–34). https://doi.org/10.1007/978-3-319-54205-8_2
Balluerka, N., Plewis, I., Gorostiaga, A., & Padilla, J.-L. (2014). Examining Sources of DIF in Psychological and Educational Assessment Using Multilevel Logistic Regression. Methodology, 10(2), 71–79. https://doi.org/10.1027/1614-2241/a000076
Bilyakovska, O. (2022). TEST AS AN EFFECTIVE MEANS OF ASSESSING THE QUALITY OF STUDENTS’ KNOWLEDGE. Academic Notes Series Pedagogical Science, 1(204), 16–20. https://doi.org/10.36550/2415-7988-2022-1-204-16-20
Bodea, C., & Kerner, A. (2021). The Gender Credibility Gap: All-Male Boards and Substantive Gender Representation in Central Banking. SSRN Electronic Journal. https://doi.org/10.2139/ssrn.3780220
Bormanaki, H. B., & Ajideh, P. (2022). Item performance across native language groups on the Iranian National University Entrance English Exam: a nationwide study. Language Testing in Asia, 12(1), 29. https://doi.org/10.1186/s40468-022-00185-2
Büyükkidik, S. (2023). Purification procedures used for the detection of gender DIF: Item bias in a foreign language test. International Journal of Assessment Tools in Education, 10(4), 765–780. https://doi.org/10.21449/ijate.1250358
Cai, L. S., & Albano, A. D. (2018). Examining Sources of Gender DIF in Mathematics Knowledge of Future Teachers Using Cross-Classified IRT Models. In Exploring the Mathematical Education of Teachers Using TEDS-M Data (pp. 543–561). Springer International Publishing. https://doi.org/10.1007/978-3-319-92144-0_19
Canay, I. A., Mogstad, M., & Mountjoy, J. (2022). On the Use of Outcome Tests for Detecting Bias in Decision Making. SSRN Electronic Journal. https://doi.org/10.2139/ssrn.4156834
Chubbuck, K., Curley, W. E., & King, T. C. (2016). Who’s on First? Gender Differences in Performance on the SAT ® Test on Critical Reading Items With Sports and Science Content. ETS Research Report Series, 2016(2), 1–116. https://doi.org/10.1002/ets2.12109
Dewi, D. M., Saingan, A. F., & Fahmi, Y. (2022). Kontribusi Teknologi Informasi dan Komunikasi terhadap Rata-Rata Lama Sekolah di Pulau Jawa. PAKAR Pendidikan, 20(1), 24–36. https://doi.org/10.24036/pakar.v20i1.248
Dewi, H. H., Damio, S. M., & Sukarno, S. (2023). Item analysis of reading comprehension questions for English proficiency test using Rasch model. REID (Research and Evaluation in Education), 9(1), 24–36. https://doi.org/10.21831/reid.v9i1.53514
Diouf, I., & Pépin, D. (2017). Gender and central banking. Economic Modelling, 61, 193–206. https://doi.org/10.1016/j.econmod.2016.12.006
Dubbelman, M. A., Verrijp, M., Facal, D., Sánchez‐Benavides, G., Brown, L. J. E., der Flier, W. M., Jokinen, H., Lee, A., Leroi, I., Lojo‐Seoane, C., Milošević, V., Molinuevo, J. L., Pereiro Rozas, A. X., Ritchie, C., Salloway, S., Stringer, G., Zygouris, S., Dubois, B., Epelbaum, S., … Sikkes, S. A. M. (2020). The influence of diversity on the measurement of functional impairment: An international validation of the Amsterdam IADL Questionnaire in eight countries. Alzheimer’s & Dementia: Diagnosis, Assessment & Disease Monitoring, 12(1). https://doi.org/10.1002/dad2.12021
Effiom, A. P. (2021). Test fairness and assessment of differential item functioning of mathematics achievement test for senior secondary students in Cross River state, Nigeria using item response theory. Global Journal of Educational Research, 20(1), 55–62. https://doi.org/10.4314/gjedr.v20i1.6
Eliaumra, E., Samaela, D. P., & Muhdin, N. K. (2022). Developing diagnostic test assessment to measure creative thinking skills of Biology preservice teacher students. REID (Research and Evaluation in Education), 8(2), 152–168. https://doi.org/10.21831/reid.v8i2.50885
Förster, M., & Happ, R. (2019). The relationship among gender, interest in economic topics, media use, and the economic knowledge of students at vocational schools. Citizenship, Social and Economics Education, 18(3), 143–157. https://doi.org/10.1177/2047173419892209
Fulcher, G. (2016). Context and Inference in Language Testing. In The Dynamic Interplay between Context and the Language Learner (pp. 225–241). Palgrave Macmillan UK. https://doi.org/10.1057/9781137457134_12
Gibbons, L., Crane, P. K., Mehta, K. M., Pedraza, O., Tang, Y., Manly, J. J., Narasimhalu, K., Teresi, J., Jones, R. N., & Mungas, D. (2011). Multiple, correlated covariates associated with differential item functioning (DIF): Accounting for language DIF when education levels differ across languages. Ageing Research, 2(1), 4. https://doi.org/10.4081/ar.2011.e4
Granocchio, E., De Salvatore, M., Bonanomi, E., & Sarti, D. (2023). Sex‐related differences in reading achievement. Journal of Neuroscience Research, 101(5), 668–678. https://doi.org/10.1002/jnr.24913
Hagquist, C., & Andrich, D. (2017). Recent advances in analysis of differential item functioning in health research using the Rasch model. Health and Quality of Life Outcomes, 15(1), 181. https://doi.org/10.1186/s12955-017-0755-0
Hagquist, C., Due, P., Torsheim, T., & Välimaa, R. (2019). Cross-country comparisons of trends in adolescent psychosomatic symptoms – a Rasch analysis of HBSC data from four Nordic countries. Health and Quality of Life Outcomes, 17(1), 27. https://doi.org/10.1186/s12955-019-1097-x
Hirnstein, M., Stuebs, J., Moè, A., & Hausmann, M. (2023). Sex/Gender Differences in Verbal Fluency and Verbal-Episodic Memory: A Meta-Analysis. Perspectives on Psychological Science, 18(1), 67–90. https://doi.org/10.1177/17456916221082116
Ito, S., Nagao, H., Kurokawa, T., Kasuya, T., & Inoue, J. (2019). Bayesian inference of grain growth prediction via multi-phase-field models. Physical Review Materials, 3(5), 053404. https://doi.org/10.1103/PhysRevMaterials.3.053404
Joo, S.-H., Lee, P., & Stark, S. (2022). Bayesian Approaches for Detecting Differential Item Functioning Using the Generalized Graded Unfolding Model. Applied Psychological Measurement, 46(2), 98–115. https://doi.org/10.1177/01466216211066606
Kans, M., & Claesson, L. (2022). Gender-Related Differences for Subject Interest and Academic Emotions for STEM Subjects among Swedish Upper Secondary School Students. Education Sciences, 12(8), 553. https://doi.org/10.3390/educsci12080553
Kheder, K., & Rouabhia, R. (2023). GENDER DIFFERENCES IN LEARNING LANGUAGES. European Journal of Applied Linguistics Studies, 6(2). https://doi.org/10.46827/ejals.v6i2.456
Kruchinina, O. V., Stankova, E. P., & Galperina, E. I. (2020). Development of Spatiotemporal EEG Organization in Males and Females Aged 8–30 Years during Comprehension of Oral and Written Texts. Human Physiology, 46(3), 244–256. https://doi.org/10.1134/S036211972003010X
Kruger, D. J. (2008). Male Financial Consumption is Associated with Higher Mating Intentions and Mating Success. Evolutionary Psychology, 6(4), 147470490800600. https://doi.org/10.1177/147470490800600407
Leventhal, B., & Gregg, N. (2022). Reliability and Measurement Error. In Reliability and Measurement Error. Routledge. https://doi.org/10.4324/9781138609877-REE28-1
Li, L., & Becker, B. J. (2021). Assessing Differential Bundle Functioning Using Meta‐Analysis. Journal of Educational Measurement, 58(4), 492–514. https://doi.org/10.1111/jedm.12303
Lord, F. M. (1980). Applications of Item Response Theory To Practical Testing Problems. Routledge. https://doi.org/10.4324/9780203056615
Mach, T. (2023). Literatur im DaF-Unterricht: einige kritische Anmerkungen. AUC PHILOLOGICA, 2022(3), 119–133. https://doi.org/10.14712/24646830.2023.6
Magdolen, M., Behren, S. von, Hobusch, J., Chlond, B., & Vortisch, P. (2020). Comparison of Response Bias in an Intercultural Context – Evaluation of Psychological Items in Travel Behavior Research. Transportation Research Procedia, 48, 2891–2905. https://doi.org/10.1016/j.trpro.2020.08.231
Makkink, A. W., & Vincent-Lambert, C. (2020). The development of ‘SATLAB’: A tool designed to limit assessment bias in simulation-based learning. South African Journal of Pre-Hospital Emergency Care, 1(1), 26–34. https://doi.org/10.24213/1-1-3024
Moradi, E., Ghabanchi, Z., & Pishghadam, R. (2022). Reading comprehension test fairness across gender and mode of learning: insights from IRT-based differential item functioning analysis. Language Testing in Asia, 12(1), 39. https://doi.org/10.1186/s40468-022-00192-3
Nedungadi, S., Brown, C. E., & Paek, S. H. (2022). Differential Item Functioning Analysis of the Fundamental Concepts for Organic Reaction Mechanisms Inventory. Journal of Chemical Education, 99(8), 2834–2842. https://doi.org/10.1021/acs.jchemed.2c00242
Nurrahman, A., Sukirno, S., Pratiwi, D. S., Iskandar, J., Rahim, A., & Rahmaini, I. S. (2022). Developing student social attitude self-assessment instruments: A study in vocational high school. REID (Research and Evaluation in Education), 8(1), 1–12. https://doi.org/10.21831/reid.v8i1.45100
Otaya, L. G., Kartowagiran, B., & Retnawati, H. (2020). The construct validity and reliability of the lesson plan assessment instrument in primary schools. Jurnal Prima Edukasia, 8(2), 126–134. https://doi.org/10.21831/jpe.v8i2.33275
Otok, B. W., Suharsono, A., Purhadi, Standsyah, R. E., & Azies, H. Al. (2021). A meta confirmatory factor analysis of the underdeveloped areas in the Java Island. 020002. https://doi.org/10.1063/5.0059540
Patnala, V., Salla, G. R., Prabhakar, S., Singh, R. P., & Annapureddy, V. (2024). Analysing the Grain size and asymmetry of the particle distribution using auto-correlation technique. Applied Physics A, 130(3), 191. https://doi.org/10.1007/s00339-024-07332-x
Penfield, R. D., & Camilli, G. (2006). 5 Differential Item Functioning and Item Bias (pp. 125–167). https://doi.org/10.1016/S0169-7161(06)26005-X
Prieto, G., & Nieto, E. (2014). Influence of DIF on differences in performance of Italian and Asian individuals on a reading comprehension test of Spanish as a foreign language (negative emotionality) in Hong Kong. Journal of Applied Measurement, 15(2), 176–188.
Ra, J., & Rhee, K. J. (2018). Detection of Gender related DIF in the Foreign Language Classroom Anxiety Scale. Educational Sciences: Theory & Practice. https://doi.org/10.12738/estp.2018.1.0606
Rasooli, A., Zandi, H., & DeLuca, C. (2019). Conceptualising fairness in classroom assessment: exploring the value of organisational justice theory. Assessment in Education: Principles, Policy & Practice, 26(5), 584–611. https://doi.org/10.1080/0969594X.2019.1593105
Retnawati, H. (2016). Proving content validity of self-regulated learning scale (The comparison of Aiken index and expanded Gregory index). REID (Research and Evaluation in Education), 2(2), 155–164. https://doi.org/10.21831/reid.v2i2.11029
Rinaldi, P., Pasqualetti, P., Volterra, V., & Caselli, M. C. (2023). Gender differences in early stages of language development. Some evidence and possible explanations. Journal of Neuroscience Research, 101(5), 643–653. https://doi.org/10.1002/jnr.24914
Roever, C. (2007). DIF in the Assessment of Second Language Pragmatics. Language Assessment Quarterly, 4(2), 165–189. https://doi.org/10.1080/15434300701375733
Sauer, J., Sonderegger, A., & Hoyos Álvarez, M. A. (2018). The influence of cultural background of test participants and test facilitators in online product evaluation. International Journal of Human-Computer Studies, 111, 92–100. https://doi.org/10.1016/j.ijhcs.2017.12.001
Setiawan, A., Cendana, W., Ayres, M., Yuldashev, A. A., & Setyawati, S. P. (2023). Development and validation of a self-assessment-based instrument to measure elementary school students’ attitudes in online learning. REID (Research and Evaluation in Education), 9(2), 184–197. https://doi.org/10.21831/reid.v9i2.52083
Setiawati, F. A., Ayriza, Y., Retnowati, E., & Amelia, R. N. (2017). The Response Patterns of the Career Interest Instrument Based on Holland’s Theory. ANIMA Indonesian Psychological Journal, 32(3), 128–147. https://doi.org/10.24123/aipj.v32i3.628
Shykhnenko, K. I. (2020). OPTIMISING ASSESSMENT SYSTEM IN THE ESP COURSE THROUGH THE USE of THE METHODS OF DIFFERENTIAL ITEM FUNCTIONING AND DIFFERENTIAL TEST FUNCTIONING IN FINAL TEST DESIGN. Zhytomyr Ivan Franko State University Journal. Рedagogical Sciences, 2(101), 156–165. https://doi.org/10.35433/pedagogy.2(101).2020.156-165
Sumin, S., Sukmawati, F., & Nurdin, N. (2022). Gender differential item functioning on the Kentucky Inventory of Mindfulness Skills instrument using logistic regression. REID (Research and Evaluation in Education), 8(1), 55–66. https://doi.org/10.21831/reid.v8i1.50809
Szmańda, J. B., & Witkowski, K. (2021). Morphometric Parameters of Krumbein Grain Shape Charts—A Critical Approach in Light of the Automatic Grain Shape Image Analysis. Minerals, 11(9), 937. https://doi.org/10.3390/min11090937
Terluin, B., Brouwers, E. P. M., Marchand, M. A. G., & de Vet, H. C. W. (2018). Assessing the equivalence of Web-based and paper-and-pencil questionnaires using differential item and test functioning (DIF and DTF) analysis: a case of the Four-Dimensional Symptom Questionnaire (4DSQ). Quality of Life Research, 27(5), 1191–1200. https://doi.org/10.1007/s11136-018-1816-5
Walker, C. M., & Gocer Sahin, S. (2023). Differential functioning. In International Encyclopedia of Education(Fourth Edition) (pp. 249–259). Elsevier. https://doi.org/10.1016/B978-0-12-818630-5.10035-1
Wallace, M. P. (2018). Fairness and Justice in L2 Classroom Assessment : Perceptions from Test Takers. The Journal of AsiaTEFL, 15(4), 1051–1064. https://doi.org/10.18823/asiatefl.2018.15.4.11.1051
Wallin, G., Chen, Y., & Moustaki, I. (2023). DIF Analysis with Unknown Groups and Anchor Items. Psychometrika, 89(1), 267–295. https://doi.org/10.1007/s11336-024-09948-7
Wallin, G., Chen, Y., & Moustaki, I. (2024). DIF Analysis with Unknown Groups and Anchor Items. Psychometrika, 89(1), 267–295. https://doi.org/10.1007/s11336-024-09948-7
Waschl, N., & Burns, N. R. (2020). Sex differences in inductive reasoning: A research synthesis using meta-analytic techniques. Personality and Individual Differences, 164, 109959. https://doi.org/10.1016/j.paid.2020.109959
Wilsa, A. W., Rusilowati, A., Susilaningsih, E., Jaja, J., & Nurpadillah, V. (2023). Validity, reliability, and item characteristics of cell material science literacy assessment instruments. Jurnal Penelitian Dan Evaluasi Pendidikan, 27(2), 177–188. https://doi.org/10.21831/pep.v27i2.61577
Wu, S., Barr, D. J., Gann, T. M., & Keysar, B. (2013). How culture influences perspective taking: differences in correction, not integration. Frontiers in Human Neuroscience, 7. https://doi.org/10.3389/fnhum.2013.00822
Wulandari, R. D., Laksono, A. D., Rohmah, N., & Ashar, H. (2023). Regional differences in primary healthcare utilization in Java Region—Indonesia. PLOS ONE, 18(3), e0283709. https://doi.org/10.1371/journal.pone.0283709
Yavuz Temel, G. (2023). A Simulation and Empirical Study of Differential Test Functioning (DTF). Psych, 5(2), 478–496. https://doi.org/10.3390/psych5020032
Yüksel, S., Demir, P., & Alkan, A. (2019). Factors causing occurrence of artificial dif: A simulation study for dichotomous data. Communications in Statistics - Simulation and Computation, 48(7), 2004–2011. https://doi.org/10.1080/03610918.2018.1429622
Copyright (c) 2025 Agus Santoso, Heri Retnawati, Timbul Pardede, Dyah Paminta Rahayu, Munaya Nikma Rosyada, Rugaya Tuanaya, Rimajon Sotlikova, Begimbetova Guldana Atymtaevna

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
The authors who publish this journal agree to the following requirements. The author retains the copyright regarding the work being simultaneously licensed below Creative Commons Attribution ShareAlike License.

Jurnal Diksi by Faculty of Languages, Arts, and Culture, Universitas Negeri Yogyakarta is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Based on a work at http://journal.uny.ac.id/index.php/diksi














