مدل راش چند وجهی در آزمون‌های عملی: مورد مطالعه آزمون سُرایش

ناجی, سید هدی; مقدم زاده, علی; ایزانلو, بلال; خدایی, ابراهیم

doi:10.22034/emes.2024.2003752.2479

مدل راش چند وجهی در آزمون‌های عملی: مورد مطالعه آزمون سُرایش

نوع مقاله : مقاله پژوهشی

نویسندگان

سید هدی ناجی ¹

علی مقدم زاده ²

بلال ایزانلو ³

ابراهیم خدایی ⁴

¹ دانشجوی دکتری، گروه آموزشی روش‌ها و برنامه‌های آموزشی، دانشکده روانشناسی و علوم تربیتی، دانشگاه تهران، ایران

² دانشیار گروه آموزشی روش‌ها و برنامه‌های آموزشی، دانشکده روانشناسی و علوم تربیتی، دانشگاه تهران

³ استادیار دانشکده روان‌شناسی و علوم تربیتی دانشگاه خوارزمی

⁴ استاد گروه آموزشی روش‌ها و برنامه‌های آموزشی، دانشکده روانشناسی و علوم تربیتی، دانشگاه تهران

10.22034/emes.2024.2003752.2479

چکیده

هدف: هدف این پژوهش تحلیل داده‌های حاصل از آزمون‌های عملی سازمان سنجش با استفاده از مدل راش چندوجهی و مقایسه آن با نتایج حاصل از تحلیل‌های کلاسیک است.
روش پژوهش: روش پژوهش کمی و از نوع توصیفی-تحلیلی است. مشارکت کنندگان شامل تمامی داوطلبان رشته آهنگسازی حاضر در آزمون عملی سُرایش بودند. داده مورد تحلیل از فرم‌های ارزیابی پر شده توسط چهار ارزیاب برای هر داوطلب بدست آمده است.
یافته‌ها: یافته نشان می‌دهد اگر چه ضریب همبستگی میان ارزیاب‌ها بالا بوده (بیش از 90/0)، توافق ارزیاب‌ها براساس ضریب کاپا، در حد متوسط است. همچنین براساس نتایج راش چند وجهی، پارامتر سخت‌گیری ارزیاب‌ها در حد متوسط بوده است.
نتیجه‌گیری: تمایز بین ضریب همبستگی و کاپا، نشان می‌دهد که نمی‌توان از هر یک از این شاخص‌ها به تنهایی برای تحلیل ارزیاب استفاده کرد، همچنین این ضرایب، وضعیت گروهی ارزیاب‌ها را نشان می‌دهند در حالی که راش چند وجهی وضعیت هر یک از ارزیاب‌ها را نشان می‌دهد. نتایج راش چند وجهی نشان داد که ارزیاب‌ها دچار خطای سخت‌گیری و سهل‌گیری نیستند.

کلیدواژه‌ها

راش چند وجهی ـ مدل راش ـ نظریه کلاسیک اندازه‌گیری ـ ارزیاب ـ آزمون عملی ـ کاپا

موضوعات

سنجش و اندازه‌گیری آموزش عالی

عنوان مقاله English

Multi-Faceted Rasch Model in Practical Tests: Study Subject: Sorayesh Test

نویسندگان English

Seyyedeh Hoda Naji ¹

Ali Moghadamzadeh ²

Balal Izanloo ³

Ebrahim Khodaie ⁴

¹ Ph.D student. Department of Curriculum Planning and Educational Methods, Faculty of Psychology and Education, University of Tehran, Tehran, Iran

² Associate Professor Department of Curriculum Planning and Educational Methods, Faculty of Psychology and Education, University of Tehran, Tehran, Iran

³ University of Kharazmi, Faculty of Psychology and Education , Karaj, Iran

⁴ professor Department of Curriculum Planning and Educational Methods, Faculty of Psychology and Education, University of Tehran, Tehran, Iran

چکیده English

Objective: The purpose of this research is to analyze the data obtained from the practical tests conducted by National Organization of Educational Testing, using the Multi-Faceted Rasch model and comparing it with the results of classical analysis methods.
Methods: The research method is quantitative a type of descriptive-analysis method. The participants included all the songwriting candidates taking the Sorayesh practical test. The analyzed data has been obtained from the evaluation forms filled out by four evaluators for each candidate.
Results: The findings show that although the correlation coefficient among raters was high (more than 0/90), the agreement of the raters in terms of the Kappa coefficient was average. Furthermore, based on the Multi-Faceted Rasch model, the raters strictness parameter was moderate.
Conclusion: The difference between the correlation and the Kappa coefficient shows that these two indicators cannot be used alone to analyze the rater. Also, these indicators present the group status of the raters while the Multi-Faceted Rasch model shows the individual status of each rater. The results of the Multi-Faceted Rasch model indicated that the raters didn't exhibit strictness or leniency errors.

کلیدواژه‌ها English

Keywords: Multi

Faceted Rasch

Rasch model

Classical Test Theory

Rater

practical test

Kappa

References

Allen, M. J., & Yen, W. M. (2002). Introduction to Measurement Theory. Prospect Heights, IL: Waveland Press. Translated in persian, 2013, SAMT

American Educational Research Association, American Psychological Association, & National Council on Measurement in Education (Eds.). (2014). Standards for educational and psychological testing. American Educational Research Association. Tranlated in persian, 2019, Tehran university.

Andrich D. & Marais I. (2019). A course in rasch measurement theory: measuring in the educational social and health sciences. Springer. https://doi.org/1007/10/978-981-13-7496-8

de Ayala, R. J. (2022). The theory and practice of item response theory (2st ed.). Guilford Press.

Eckes, T. (2015). Introduction to Many-Facet Rasch Measurement: Analyzing and Evaluating Rater-Mediated Assessments (2nd ed.). New York: Peter Lang.

Embretson, Susan & Reise, S.. (2000). Item Response Theory For Psychologists. Tranlated in persian, Roshd, 2009, Tehran.

Ezanloo, B., & Hajatpour, S. (2023). An Investigation of the Evaluators' Ratings of the Performance Exams in the Field of Arts Using Multi-Faceted Rasch Model. Educational Measurement and Evaluation Studies, 13(42), 100-123. doi: 22034/10/emes.528161/2023.2244

Hambleton, Ronald K. & Swaminathan, Hariharan. & Rogers, H. Jane. (1991). Fundamentals of item response theory. Newbury Park, Calif: Sage Publications, Translated in persian, 2010, Alameh tabai university.

Hombo, C. M., Donoghue, J. R., & Thayer, D. T. (2001). A simulation study of the effect ofrater designs on ability estimation (ETS Research Report No. RR-01-05). Princeton, NJ:Educational Testing Service.

Keeves J. P. (1997). Introduction: Advances in Measurement in Education. In Keeves J. P. (Ed.), Educational research methodology and measurement: an international handbook (2nd ed.) (pp. 705-712). Pergamon.

Lee, M., & Cha, D. (2016). A comparison of generalizability theory and many facet Rasch measurement in an analysis of mathemetics creative problem solving test. Journal of Curriculum Evaluation, 19(2), 251–279

Li, G., Pan, Y., & Wang, W. (2021). Using Generalizability Theory and Many-Facet Rasch Model to Evaluate In-Basket Tests for Managerial Positions. Frontiers in psychology, 12, 660553. https://doi.org/3389/10/fpsyg.660553/2021

Polat, M., Sölpük Turhan, N., & Toraman, Çetin . (2022). Comparison of Classical Test Theory vs. F Theory in writing assessment. Pegem Journal of Education and Instruction, 12(2), 213–225. https://doi.org/47750/10/pegegog.02/12.21

Robitzsch, A., & Steinfeld, J. (2018). Item response models for human ratings: Overview, estimation methods, and implementation in R. Psychological Test and Assessment Modeling, 60(1), 101–139.

Taylor, C. S. (2013). Validity and validation. Oxford University Press, Translated in persian, 2010, Alameh tabai university.

Wind, S., & Hua, C. (2022). Rasch Measurement Theory Analysis in R (1st ed.). Chapman and Hall/CRC. https://doi.org/1201/10/9781003174660

Wolf. R.M. (1997). Rating Scales. In Keeves J. P. (Ed.), Educational research methodology and measurement: an international handbook (2nd ed.) (pp. 958-965). Pergamon.

دوره 14، شماره 47
پاییز 1403
صفحه 7-26

XML

اصل مقاله 611.62 K

تعداد مشاهده مقاله 518
تعداد دریافت فایل اصل مقاله 445

مطالعات اندازه گیری و ارزشیابی آموزشی

مدل راش چند وجهی در آزمون‌های عملی: مورد مطالعه آزمون سُرایش

Multi-Faceted Rasch Model in Practical Tests: Study Subject: Sorayesh Test

References

دوره 14، شماره 47پاییز 1403صفحه 7-26

فایل ها

هم رسانی

ارجاع به این مقاله

آمار

دوره 14، شماره 47
پاییز 1403
صفحه 7-26