Publications-Theses

Article View/Open

Publication Export

Google ScholarTM

NCCU Library

Citation Infomation

Related Publications in TAIR

題名 廣義估計方程式在題組式測驗的應用
Generalized estimation equation in Testlet-based educational testing
作者 李介中
Lee, Chieh Chung
貢獻者 張源俊
Chang, Yuan Chin
李介中
Lee, Chieh Chung
關鍵詞 試題反應理論
試題訊息量
題組反應理論
題組式測驗
廣義估計方程式
SCORIGHT
日期 2017
上傳時間 2-Feb-2018 10:48:03 (UTC+8)
摘要 在測驗含有題組(testlet)結構時,由於違反了試題反應理論(Item Response Theory, IRT)中局部獨立性的假設,使得IRT的估計方法產生偏誤,過去研究的解決方式為在IRT模型中多加入一個參數,將題組的影響力納入模型中,此即為題組反應理論(Testlet Response Theory, TRT),在貝氏(Bayesian)的架構下,此方法的計算則可透過SCORIGHT軟體來達成。本研究旨在透過另一種方法,即廣義方程式(Generalized Estimation Equation, GEE)去處理測驗中的題組效果。GEE過去常被使用於分析縱貫式(longitudinal)的資料,本研究使用此方法來捕捉題組測驗下作答結果的相關性,並經重新參數化調整係數後使其能對受試者能力值進行估計。
電腦模擬的結果顯示GEE能有效的處理題組效果帶來的影響。在GEE和貝氏題組模型的比較上,GEE對於程度好和程度差的受試者有較佳的估計效果;而貝氏題組模型則對於程度中等的受試者表現較好,此外我們也針對GEE的估計效率進行了實驗,結果顯示先將受試者依能力分組再進行GEE估計能提升GEE的估計效率。
在文章中,我們也展示了使用GEE計算題組訊息量的方式,做為題組式測驗下評估該測驗對於各能力區間的受試者在估計準確度上的參考。
If the tests have testlet structure, the bias may arise when using traditional Item Response Theory(IRT) estimation methods due to the violations to the assumption of local independence. To deal with the testlet effect, previous studies introduced a new parameter to the classical IRT model which called Testlet Response Theory(TRT). Under the Bayesian framework, the estimation can be accomplished on the SCORIGHT program. The purpose of this paper is to use another method named Generalized Estimation Equation(GEE) to model testlet response data. GEE was commonly used to analyze the longitudinal data. We use this method to capture the information from the correlated items and estimated ability of the examinees through re-parametrization.
Simulation results indicate that GEE can deal with the testlet effect effectively. On the comparison between GEE and Bayesian testlet model, GEE does better on estimation of the examinees who have high or low ability level. In contrast, Bayesian testlet model does better on estimation of medium ability level. In addition, we design the experiment to test the efficiency of GEE. The results show that group the examinees according to their ability before doing the GEE estimation can improve the efficiency of GEE.
In this paper, we also demonstrate the method to calculate testlet information using GEE which can be taken as reference for assessing estimation accuracy of each ability level in testlet-based testing.
參考文獻 中文部分
余民寧. (1992). 試題反應理論的介紹 (二)--基本概念和假設. 研習資訊, 9, 5-9.
陳柏熹, 黃宏宇, & 王文中. (2008). 題組之相關特性對電腦化適性測驗測量精準度的影響. 測驗學刊, 55(1), 129-150.
英文部分
Dobson, A. J., & Barnett, A. (2008). An introduction to generalized linear models: CRC press.
Leisch, F., Weingessel, A., & Hornik, K. (1998). On the generation of correlated artificial binary data.
Liang, K.-Y., & Zeger, S. L. (1986). Longitudinal data analysis using generalized linear models. Biometrika, 13-22.
Lord, F. M., Novick, M. R., & Birnbaum, A. (1968). Statistical theories of mental test scores.
Park, C. G., Park, T., & Shin, D. W. (1996). A simple method for generating correlated binary variates. The American Statistician, 50(4), 306-310.
Sireci, S. G., Thissen, D., & Wainer, H. (1991). On the reliability of testlet‐based tests. Journal of Educational measurement, 28(3), 237-247.
Wainer, H., Bradlow, E. T., & Wang, X. (2007). Testlet response theory and its applications: Cambridge University Press.
Wainer, H., & Kiely, G. L. (1987). Item clusters and computerized adaptive testing: A case for testlets. Journal of Educational measurement, 24(3), 185-201.
Wainer, H., & Thissen, D. (1996). How is reliability related to the quality of test scores? What is the effect of local dependence on reliability? Educational Measurement: Issues and Practice, 15(1), 22-29.
Wang, X., Bradlow, E. T., & Wainer, H. (2004). User`s guide for SCORIGHT (version 3.0): A computer program for scoring tests built of testlets including a module for covariate analysis. ETS Research Report Series, 2004(2).
Yen, W. M. (1993). Scaling performance assessments: Strategies for managing local item dependence. Journal of Educational measurement, 30(3), 187-213.
描述 碩士
國立政治大學
統計學系
104354018
資料來源 http://thesis.lib.nccu.edu.tw/record/#G1043540181
資料類型 thesis
dc.contributor.advisor 張源俊zh_TW
dc.contributor.advisor Chang, Yuan Chinen_US
dc.contributor.author (Authors) 李介中zh_TW
dc.contributor.author (Authors) Lee, Chieh Chungen_US
dc.creator (作者) 李介中zh_TW
dc.creator (作者) Lee, Chieh Chungen_US
dc.date (日期) 2017en_US
dc.date.accessioned 2-Feb-2018 10:48:03 (UTC+8)-
dc.date.available 2-Feb-2018 10:48:03 (UTC+8)-
dc.date.issued (上傳時間) 2-Feb-2018 10:48:03 (UTC+8)-
dc.identifier (Other Identifiers) G1043540181en_US
dc.identifier.uri (URI) http://nccur.lib.nccu.edu.tw/handle/140.119/115723-
dc.description (描述) 碩士zh_TW
dc.description (描述) 國立政治大學zh_TW
dc.description (描述) 統計學系zh_TW
dc.description (描述) 104354018zh_TW
dc.description.abstract (摘要) 在測驗含有題組(testlet)結構時,由於違反了試題反應理論(Item Response Theory, IRT)中局部獨立性的假設,使得IRT的估計方法產生偏誤,過去研究的解決方式為在IRT模型中多加入一個參數,將題組的影響力納入模型中,此即為題組反應理論(Testlet Response Theory, TRT),在貝氏(Bayesian)的架構下,此方法的計算則可透過SCORIGHT軟體來達成。本研究旨在透過另一種方法,即廣義方程式(Generalized Estimation Equation, GEE)去處理測驗中的題組效果。GEE過去常被使用於分析縱貫式(longitudinal)的資料,本研究使用此方法來捕捉題組測驗下作答結果的相關性,並經重新參數化調整係數後使其能對受試者能力值進行估計。
電腦模擬的結果顯示GEE能有效的處理題組效果帶來的影響。在GEE和貝氏題組模型的比較上,GEE對於程度好和程度差的受試者有較佳的估計效果;而貝氏題組模型則對於程度中等的受試者表現較好,此外我們也針對GEE的估計效率進行了實驗,結果顯示先將受試者依能力分組再進行GEE估計能提升GEE的估計效率。
在文章中,我們也展示了使用GEE計算題組訊息量的方式,做為題組式測驗下評估該測驗對於各能力區間的受試者在估計準確度上的參考。
zh_TW
dc.description.abstract (摘要) If the tests have testlet structure, the bias may arise when using traditional Item Response Theory(IRT) estimation methods due to the violations to the assumption of local independence. To deal with the testlet effect, previous studies introduced a new parameter to the classical IRT model which called Testlet Response Theory(TRT). Under the Bayesian framework, the estimation can be accomplished on the SCORIGHT program. The purpose of this paper is to use another method named Generalized Estimation Equation(GEE) to model testlet response data. GEE was commonly used to analyze the longitudinal data. We use this method to capture the information from the correlated items and estimated ability of the examinees through re-parametrization.
Simulation results indicate that GEE can deal with the testlet effect effectively. On the comparison between GEE and Bayesian testlet model, GEE does better on estimation of the examinees who have high or low ability level. In contrast, Bayesian testlet model does better on estimation of medium ability level. In addition, we design the experiment to test the efficiency of GEE. The results show that group the examinees according to their ability before doing the GEE estimation can improve the efficiency of GEE.
In this paper, we also demonstrate the method to calculate testlet information using GEE which can be taken as reference for assessing estimation accuracy of each ability level in testlet-based testing.
en_US
dc.description.tableofcontents 第一章 緒論 1
第二章 文獻回顧 4
第一節 試題反應理論(Item Response Theory, IRT) 4
第二節 題組反應理論(Testlet Response Theory, TRT) 5
第三節 廣義估計方程式(Generalized Estimation Equation, GEE) 6
第三章 研究方法 8
第一節 估計方法 8
一、 最大概似估計(Maximum likelihood estimation) 8
二、 貝氏題組模型 (Bayesian testlet model) 9
三、 廣義估計方程式(Generalized estimating equation) 10
四、 分組GEE 12
五、 多階段分組GEE 12
第二節 模擬流程 13
一、 題目設定與題目訊息量 13
二、 考生能力設定 15
三、 產生作答結果 15
四、 參數估計、極端值處理與評估指標 17
第四章 結果 18
第一節 在不同能力水準和題組結構下比較MLE與GEE 18
第二節 工作相關矩陣之設定對GEE估計之影響 19
第三節 GEE與SCORIGHT估計能力值θ之情形 21
第四節 不同分組方法對於分組GEE估計結果的影響 24
第五節 能力值θ呈特定分布下之估計情形 27
第五章 結論與建議 32
第一節 結論 32
第二節 研究限制與未來方向 33
參考文獻 35
附錄 36
zh_TW
dc.format.extent 1387414 bytes-
dc.format.mimetype application/pdf-
dc.source.uri (資料來源) http://thesis.lib.nccu.edu.tw/record/#G1043540181en_US
dc.subject (關鍵詞) 試題反應理論zh_TW
dc.subject (關鍵詞) 試題訊息量zh_TW
dc.subject (關鍵詞) 題組反應理論zh_TW
dc.subject (關鍵詞) 題組式測驗zh_TW
dc.subject (關鍵詞) 廣義估計方程式zh_TW
dc.subject (關鍵詞) SCORIGHTen_US
dc.title (題名) 廣義估計方程式在題組式測驗的應用zh_TW
dc.title (題名) Generalized estimation equation in Testlet-based educational testingen_US
dc.type (資料類型) thesisen_US
dc.relation.reference (參考文獻) 中文部分
余民寧. (1992). 試題反應理論的介紹 (二)--基本概念和假設. 研習資訊, 9, 5-9.
陳柏熹, 黃宏宇, & 王文中. (2008). 題組之相關特性對電腦化適性測驗測量精準度的影響. 測驗學刊, 55(1), 129-150.
英文部分
Dobson, A. J., & Barnett, A. (2008). An introduction to generalized linear models: CRC press.
Leisch, F., Weingessel, A., & Hornik, K. (1998). On the generation of correlated artificial binary data.
Liang, K.-Y., & Zeger, S. L. (1986). Longitudinal data analysis using generalized linear models. Biometrika, 13-22.
Lord, F. M., Novick, M. R., & Birnbaum, A. (1968). Statistical theories of mental test scores.
Park, C. G., Park, T., & Shin, D. W. (1996). A simple method for generating correlated binary variates. The American Statistician, 50(4), 306-310.
Sireci, S. G., Thissen, D., & Wainer, H. (1991). On the reliability of testlet‐based tests. Journal of Educational measurement, 28(3), 237-247.
Wainer, H., Bradlow, E. T., & Wang, X. (2007). Testlet response theory and its applications: Cambridge University Press.
Wainer, H., & Kiely, G. L. (1987). Item clusters and computerized adaptive testing: A case for testlets. Journal of Educational measurement, 24(3), 185-201.
Wainer, H., & Thissen, D. (1996). How is reliability related to the quality of test scores? What is the effect of local dependence on reliability? Educational Measurement: Issues and Practice, 15(1), 22-29.
Wang, X., Bradlow, E. T., & Wainer, H. (2004). User`s guide for SCORIGHT (version 3.0): A computer program for scoring tests built of testlets including a module for covariate analysis. ETS Research Report Series, 2004(2).
Yen, W. M. (1993). Scaling performance assessments: Strategies for managing local item dependence. Journal of Educational measurement, 30(3), 187-213.
zh_TW