摩根量表分析：多元計分試題下受試者潛在特質排序之相關探討

Publications-Theses

Article View/Open

pdf(357)

Publication Export

Google Scholar^TM

題名	摩根量表分析：多元計分試題下受試者潛在特質排序之相關探討 Mokken scale analysis: A study of ordering respondents on the latent trait for polytomous items
作者	江怡萱 Chiang, Yi Hsuan
貢獻者	江振東江怡萱 Chiang, Yi Hsuan
關鍵詞	試題反應理論摩根量表單調同質性模型單調概似比性質部分計分模型 item response theory Mokken scale monotone homogeneity model monotone likelihood ratio partial credit model
日期	2013
上傳時間	14-Jul-2014 11:29:27 (UTC+8)
摘要	架構於摩根量表(Mokken scale)的單調同質性模型(The Monotone Homogeneity Model, MHM)為試題反應理論(Item Response Theory, IRT)中假設條件較寬鬆的模型。Grayson (1988) 與Huynh (1994) 證明在單調同質性模型成立下，受試者對二元計分試題的回答總分與潛在特質間具有單調概似比(Monotone Likelihood Ratio, MLR)性質，並可推得總分對於潛在特質具有隨機排序(Stochastic Ordering of the Latent Trait by the Total Test Score, SOL)性質。然而在多元計分試題，Hemker等人(1996、1997)指出僅屬於有母數試題反應理論的部分計分模型(Partial Credit Model, PCM)與其特例評定量表模型(Rating Scale Model, RSM)具MLR與SOL性質。由於有母數試題反應理論模型均為單調同質性模型之特例，因此可透過有母數試題反應理論模型，生成符合摩根量表單調同質性模型的試題反應。假設受試者對於多元計分試題的反應可藉由部分計分模型加以描述，則無論維持原始的多元計分資料形式，獲得每位受試者的多元計分總分，抑或是將每題多元計分試題化為二元計分的方式，得到受試者的二元計分總分，這兩種總分對於潛在特質都應具有隨機排序性質，而可用來衡量受試者潛在特質的大小順序。經模擬結果發現，使用多元計分總分排序受試者潛在特質的整體正確率，與使用二元計分總分排序受試者潛在特質的整體正確率，都有七成五以上，然而二元計分總分排序受試者潛在特質的整體正確率變動較大，較不穩定，而且與如何從多元計分轉化為二元計分的方式有關。再者，由於使用多元計分總分與二元計分總分分別能排序的受試者不完全一致，使用整體正確率作為比較多元計分總分與二元計分總分何者在排序受試者潛在特質上正確程度較高並不完全合適。因此我們也定義條件正確率作為另一種評比的指標，其主要目的是想要比較兩種計分總分均可排序的受試者中，分別排序正確的比例。模擬結果也顯示，整體而言，多元計分總分在排序受試者潛在特質上較二元計分總分準確。 The monotone homogeneity model (MHM) of Mokken scale is the most general item response theory (IRT) model and all parametric IRT models are its special cases. For dichotomous items, the total test score has monotone likelihood ratio (MLR) in the latent trait, and which in turn implies stochastic ordering of the latent trait by the total test score (SOL) under the MHM. However, for polytomous items, MLR only holds for the partial credit model (PCM) and its special case, the rating scale model (RSM). When analyzing polytomous items, some researchers use the total test score calculated by the polytomous item scores to order respondents on the latent trait. The others combine some of the polytomous item scores in each item, treat these items as new dichotomous items, and calculate the total test score by the new dichotomous items to order the latent traits of the respondents. Results of the simulation study show that, when item responses satisfy the partial credit model, both the total test scores calculated by the polytomous item scores and by the new dichotomous items have more than 75% accuracy rate of ordering respondents on the latent trait. Nevertheless, the accuracy rates of ordering respondents on the latent trait by the dichotomous total test score are more variable. In order to compare which of the total test score is better for ordering the latent traits of the respondents on the same basis, we define another accuracy rate, conditional accuracy rate. It shows that the conditional accuracy rate of the polytomous total test score tends to be higher than that of the dichotomous total test score as well.
參考文獻	1. Han, K. T. (2007). WinGen: Windows software that generates IRT parameters and item responses. Applied Psychological Measurement, 31(5), 457-459. 2. Hemker, B. T., Sijtsma, K., Molenaar, I. W., & Junker, B. W. (1996). Polytomous IRT models and monotone likelihood ratio of the total score. Psychometrika, 61(4), 679–693. 3. Huynh, H. (1994). A new proof for monotone likelihood ratio for the sum of independent Bernoulli random variables. Psychometrika, 59(1), 77-79. 4. Masters, G. N. (1982). A Rasch model for partial credit scoring. Psychometrika, 47(2), 149-174. 5. Sijtsma, K., & Molenaar, I. W. (Eds.). (2002). Introduction to nonparametric item response theory (Vol. 5). Sage. 6. Sijtsma, K., & Verweij, A. C. (1992). Mokken scale analysis: Theoretical considerations and an application to transitivity tasks. Applied Measurement in Education, 5(4), 355-373. 7. Stochl, J., Jones, P. B., & Croudace, T. J. (2012). Mokken scale analysis of mental health and well-being questionnaire item responses: a non-parametric IRT method in empirical research for applied health researchers. BMC Medical Research Methodology, 12(1), 74. 8. Van der Ark, L. A. (2005). Stochastic ordering of the latent trait by the sum score under various polytomous IRT models. Psychometrika, 70(2), 283–304. 9. Van der Ark, L.A. (2007). Mokken scale analysis in R. Journal of Statistical Software, 20(11), 1–19. 10. Van der Ark, L. A. (2012). New developments in Mokken scale analysis in R. Journal of Statistical Software, 48(5), 1–27. 11. Van der Ark, L. A., & Bergsma, W. P. (2010). A note on stochastic ordering of the latent trait using the sum of polytomous item scores. Psychometrika, 75(2), 272–279. 12. Van Schuur, W. (2011). Ordinal item response theory: Mokken scale analysis (Vol. 169). SAGE Publications. 13. Watson, R., van der Ark, L. A., Lin, L. C., Fieo, R., Deary, I. J., & Meijer, R. R. (2012). Item response theory: How Mokken scaling can be used in clinical practice. Journal of Clinical Nursing, 21(19pt20), 2736-2746.
描述	碩士國立政治大學統計研究所 101354008 102
資料來源	http://thesis.lib.nccu.edu.tw/record/#G0101354008
資料類型	thesis

dc.contributor.advisor	江振東	zh_TW
dc.contributor.author (Authors)	江怡萱	zh_TW
dc.contributor.author (Authors)	Chiang, Yi Hsuan	en_US
dc.creator (作者)	江怡萱	zh_TW
dc.creator (作者)	Chiang, Yi Hsuan	en_US
dc.date (日期)	2013	en_US
dc.date.accessioned	14-Jul-2014 11:29:27 (UTC+8)	-
dc.date.available	14-Jul-2014 11:29:27 (UTC+8)	-
dc.date.issued (上傳時間)	14-Jul-2014 11:29:27 (UTC+8)	-
dc.identifier (Other Identifiers)	G0101354008	en_US
dc.identifier.uri (URI)	http://nccur.lib.nccu.edu.tw/handle/140.119/67470	-
dc.description (描述)	碩士	zh_TW
dc.description (描述)	國立政治大學	zh_TW
dc.description (描述)	統計研究所	zh_TW
dc.description (描述)	101354008	zh_TW
dc.description (描述)	102	zh_TW
dc.description.abstract (摘要)	架構於摩根量表(Mokken scale)的單調同質性模型(The Monotone Homogeneity Model, MHM)為試題反應理論(Item Response Theory, IRT)中假設條件較寬鬆的模型。Grayson (1988) 與Huynh (1994) 證明在單調同質性模型成立下，受試者對二元計分試題的回答總分與潛在特質間具有單調概似比(Monotone Likelihood Ratio, MLR)性質，並可推得總分對於潛在特質具有隨機排序(Stochastic Ordering of the Latent Trait by the Total Test Score, SOL)性質。然而在多元計分試題，Hemker等人(1996、1997)指出僅屬於有母數試題反應理論的部分計分模型(Partial Credit Model, PCM)與其特例評定量表模型(Rating Scale Model, RSM)具MLR與SOL性質。由於有母數試題反應理論模型均為單調同質性模型之特例，因此可透過有母數試題反應理論模型，生成符合摩根量表單調同質性模型的試題反應。假設受試者對於多元計分試題的反應可藉由部分計分模型加以描述，則無論維持原始的多元計分資料形式，獲得每位受試者的多元計分總分，抑或是將每題多元計分試題化為二元計分的方式，得到受試者的二元計分總分，這兩種總分對於潛在特質都應具有隨機排序性質，而可用來衡量受試者潛在特質的大小順序。經模擬結果發現，使用多元計分總分排序受試者潛在特質的整體正確率，與使用二元計分總分排序受試者潛在特質的整體正確率，都有七成五以上，然而二元計分總分排序受試者潛在特質的整體正確率變動較大，較不穩定，而且與如何從多元計分轉化為二元計分的方式有關。再者，由於使用多元計分總分與二元計分總分分別能排序的受試者不完全一致，使用整體正確率作為比較多元計分總分與二元計分總分何者在排序受試者潛在特質上正確程度較高並不完全合適。因此我們也定義條件正確率作為另一種評比的指標，其主要目的是想要比較兩種計分總分均可排序的受試者中，分別排序正確的比例。模擬結果也顯示，整體而言，多元計分總分在排序受試者潛在特質上較二元計分總分準確。	zh_TW
dc.description.abstract (摘要)	The monotone homogeneity model (MHM) of Mokken scale is the most general item response theory (IRT) model and all parametric IRT models are its special cases. For dichotomous items, the total test score has monotone likelihood ratio (MLR) in the latent trait, and which in turn implies stochastic ordering of the latent trait by the total test score (SOL) under the MHM. However, for polytomous items, MLR only holds for the partial credit model (PCM) and its special case, the rating scale model (RSM). When analyzing polytomous items, some researchers use the total test score calculated by the polytomous item scores to order respondents on the latent trait. The others combine some of the polytomous item scores in each item, treat these items as new dichotomous items, and calculate the total test score by the new dichotomous items to order the latent traits of the respondents. Results of the simulation study show that, when item responses satisfy the partial credit model, both the total test scores calculated by the polytomous item scores and by the new dichotomous items have more than 75% accuracy rate of ordering respondents on the latent trait. Nevertheless, the accuracy rates of ordering respondents on the latent trait by the dichotomous total test score are more variable. In order to compare which of the total test score is better for ordering the latent traits of the respondents on the same basis, we define another accuracy rate, conditional accuracy rate. It shows that the conditional accuracy rate of the polytomous total test score tends to be higher than that of the dichotomous total test score as well.	en_US
dc.description.tableofcontents	第一章緒論 1 第一節研究背景 1 第二節研究動機與目的 2 第二章文獻探討 3 第一節試題反應理論 3 2.1.1 試題反應函數 3 2.1.2 試題階段式反應函數 4 第二節有母數試題反應理論 5 2.2.1 二元計分試題反應理論模型 5 2.2.2 多元計分試題反應理論模型 7 2.2.3 部分計分模型 8 第三節無母數試題反應理論 10 2.3.1 古特曼量表 10 2.3.2 摩根量表 12 2.3.3 無母數試題反應理論模型 13 第三章研究方法與設計 16 第一節研究方法 16 第二節研究設計 17 第四章研究結果與討論 24 第一節研究結果 24 4.1.1多元計分總分與二元計分總分排序潛在特質之整體正確率 24 4.1.2多元計分總分與二元計分總分排序潛在特質正確率比較 51 第二節延伸討論 68 4.2.1 同時使用兩種計分總分排序受試者潛在特質之可行性 68 4.2.2 其它試題階段式反應函數參數值設定 74 第五章結論與建議 86 第一節研究結論 86 第二節研究限制 89 第三節研究建議 90 第四節未來研究方向 92 參考文獻 94 附錄 96 模擬試題反應資料程式 96 整體正確率與條件正確率程式 97	zh_TW
dc.format.extent	7320567 bytes	-
dc.format.mimetype	application/pdf	-
dc.language.iso	en_US	-
dc.source.uri (資料來源)	http://thesis.lib.nccu.edu.tw/record/#G0101354008	en_US
dc.subject (關鍵詞)	試題反應理論	zh_TW
dc.subject (關鍵詞)	摩根量表	zh_TW
dc.subject (關鍵詞)	單調同質性模型	zh_TW
dc.subject (關鍵詞)	單調概似比性質	zh_TW
dc.subject (關鍵詞)	部分計分模型	zh_TW
dc.subject (關鍵詞)	item response theory	en_US
dc.subject (關鍵詞)	Mokken scale	en_US
dc.subject (關鍵詞)	monotone homogeneity model	en_US
dc.subject (關鍵詞)	monotone likelihood ratio	en_US
dc.subject (關鍵詞)	partial credit model	en_US
dc.title (題名)	摩根量表分析：多元計分試題下受試者潛在特質排序之相關探討	zh_TW
dc.title (題名)	Mokken scale analysis: A study of ordering respondents on the latent trait for polytomous items	en_US
dc.type (資料類型)	thesis	en
dc.relation.reference (參考文獻)	1. Han, K. T. (2007). WinGen: Windows software that generates IRT parameters and item responses. Applied Psychological Measurement, 31(5), 457-459. 2. Hemker, B. T., Sijtsma, K., Molenaar, I. W., & Junker, B. W. (1996). Polytomous IRT models and monotone likelihood ratio of the total score. Psychometrika, 61(4), 679–693. 3. Huynh, H. (1994). A new proof for monotone likelihood ratio for the sum of independent Bernoulli random variables. Psychometrika, 59(1), 77-79. 4. Masters, G. N. (1982). A Rasch model for partial credit scoring. Psychometrika, 47(2), 149-174. 5. Sijtsma, K., & Molenaar, I. W. (Eds.). (2002). Introduction to nonparametric item response theory (Vol. 5). Sage. 6. Sijtsma, K., & Verweij, A. C. (1992). Mokken scale analysis: Theoretical considerations and an application to transitivity tasks. Applied Measurement in Education, 5(4), 355-373. 7. Stochl, J., Jones, P. B., & Croudace, T. J. (2012). Mokken scale analysis of mental health and well-being questionnaire item responses: a non-parametric IRT method in empirical research for applied health researchers. BMC Medical Research Methodology, 12(1), 74. 8. Van der Ark, L. A. (2005). Stochastic ordering of the latent trait by the sum score under various polytomous IRT models. Psychometrika, 70(2), 283–304. 9. Van der Ark, L.A. (2007). Mokken scale analysis in R. Journal of Statistical Software, 20(11), 1–19. 10. Van der Ark, L. A. (2012). New developments in Mokken scale analysis in R. Journal of Statistical Software, 48(5), 1–27. 11. Van der Ark, L. A., & Bergsma, W. P. (2010). A note on stochastic ordering of the latent trait using the sum of polytomous item scores. Psychometrika, 75(2), 272–279. 12. Van Schuur, W. (2011). Ordinal item response theory: Mokken scale analysis (Vol. 169). SAGE Publications. 13. Watson, R., van der Ark, L. A., Lin, L. C., Fieo, R., Deary, I. J., & Meijer, R. R. (2012). Item response theory: How Mokken scaling can be used in clinical practice. Journal of Clinical Nursing, 21(19pt20), 2736-2746.	zh_TW

Publications-Theses

Article View/Open

Publication Export

Google ScholarTM

NCCU Library

Citation Infomation

Related Publications in TAIR

Google Scholar^TM