通過分數之標準設定的研究

鄭明長; ZHENG, MING-CHANG

Please use this identifier to cite or link to this item: https://ah.lib.nccu.edu.tw/handle/140.119/89462

題名:	通過分數之標準設定的研究
作者:	鄭明長 ZHENG, MING-CHANG
貢獻者:	余民寧 YU, MIN-NING 鄭明長 ZHENG, MING-CHANG
關鍵詞:	通過分數標準設定分類教育
日期:	1993
上傳時間:	2-May-2016
摘要:	本研究之目的有二：（一）探討基於試題反應理論的設定通過分數之標準的可能方法
參考文獻:	余民寧(民80 )試題反應理論的介紹(一)：測驗理論的發展趨勢\r\n。研習資訊， 8 卷( 6 期) , 13-18 頁。\r\n余民寧(民81a )試題反應理論的介紹(二)：基本概念與假設。\r\n研習資訊， 9 卷( 1 期) , 5-9 頁。\r\n余民寧(民81b )試題反應理論的介紹(三)：試題反應模式及其\r\n特性。研習資訊， 9 卷( 2 期) , 6-10 頁。\r\n余民寧(民81c )測驗理論的發展趨勢。政治大學心理研究所主辦：\r\n心理測驗之學術及實務研討會論文。\r\n余民寧(民81d )試題反應理論的介紹(六)：能力量尺。研習資\r\n訊， 9 卷( 5 期) , 8-12 頁。\r\n余民寧(民81e )試題反應理論的介紹(七)：訊息函數。研習資\r\n訊， 9 卷( 6 期) , 5 - 9 頁。\r\n吳裕益(民75 )標準參照測驗通過分數設定方法之研究。政大教研\r\n所博士論文，未出版。\r\n林惠芬(民82 )通過分數設定方法在護理人員按竅筆試測驗之研究\r\n。測驗年刊， 40 輯， 253-262 頁。\r\n許擇基、劉長萱(民81) 試題作答理論簡介。臺北:中國行為科學社。\r\n郭生玉(民74) 心理與教育測驗。臺北:精華。\r\n陳英豪、吳裕益(民75 )新舊測驗理論之比較及其應用。臺南師專\r\n學報，19 期~ 253-290 頁。\r\n\r\n\r\n\r\nAndrew, B. J. & Hecht, J. (1976). A preliminary\r\ninvestigation of two procedures for setting examination\r\nstandards. Educational and Psychological Measurement,\r\n36,45-50.\r\nAngoff, W. H. (1971). Scales, norms, and equivalent scorea.\r\nIn R. L. Thorndike (Ed.), Educational Measurement(pp.508\r\n-600). Washington, D. C.: American Council on\r\nEducation.\r\nBeaton, A. E., & Allen, N. L.\r\nthrough scale anchoring.\r\nStatistics, !2, 191-204.\r\n(1992). Interpreting scales\r\nJournal of Educational\r\nBehuniak, P. JR., Archambault, F. X., & Gable, R. K. (1982).\r\nAngoff and Nedelsky standard setti ng procedures:\r\nimplications for the validity of Pr oficiency test score\r\ninterpretation. Educational and Psychological\r\nmeasurement, 42, 247-255.\r\nBerk, R. A. (1986). A consumer`s guide to setting\r\nperformance stsndards on criterion-referenced tests.\r\nReview of Educational Research, 56(1), 137-172.\r\nBerk, R. A. (1976). Determination of optiomal cutting scores\r\nIn criterion-referenced measurement.\r\nExperimental Education, 45, 4-9.\r\nJournal of\r\nBeuk, C. H. (1984). A method for reaching a compromlse\r\nbetween absolute and relative standards in examinations.\r\nJournal of Educational Measurement, 21,147-152.\r\nBirnbaum, A. (1968). Estimation of an ability. In F. M. Lord\r\nand M. R. Novick, Statistical theories of mental test\r\nscores (chapters 20). Reading, MA: Addison-Wesley.\r\nBlock, J. H. (1971). Critertion-referced measurements:\r\nPotential. Shool Review,69,289-298.\r\nBlock, J. N. (1972). Student learning and the setting of\r\nmastery performance standards. Educational Horizons, 50,\r\n183-190.\r\nBlock, J. H. (1978). Standards and criteria: A respose.\r\nJournal of Education Measurement, 15, 291-295.\r\nBrennan, R. L., & Locb`JQod, R. E. (1980). A comparlson of\r\nthe Nedelsky and Angoff cutting score procedures using\r\nGeneralizability theory. Applied psychological\r\nmeasurement, 4, 219-240.\r\nBurton, N. W. (1978). Societal standards.Journal of\r\nEducational Measurement, 15,263-271.\r\nCascio, W. F., Alexander, R. A., & Barrett, G. V. (1988).\r\nSetting cutoff scores: Legal, psychometric, and\r\nprofessional lssues and guidelines. Personnel\r\nPsychology, 41, 1-24.\r\nCrocker, L., & Algina, J. (1986). Introduction to classical\r\nand modern test theory. New York: Bolt, Rinehart &\r\nWinston.\r\nCross, L. B., Impara, J. C., Frary, R. B., & Jaeger, R. M.\r\n(1984). A comparison of three methods for establishing\r\nminimum standards on the national teacher examinationa.\r\nJournal of Education Measurement, 21, 113-129.\r\nDavis, F. B., Diamond, J. J. (1974). The preparation of\r\ncriterion-referenced tests. In C. W. Barris, M. C.\r\nAlkin, & W. J. Popham. (Eds.), Problems ln criterion\r\nreferenced measurement. Los &ngeles: UCLA Graduate\r\nschool of Education, Center for the study of Evaluation.\r\nde Gruijter, D. N. M., & Bambleton, R. K. (1984). On\r\nproblems encountered using decision theory to set cutoff\r\nscores. Applied Psychological Measurement, 8, 1- 8.\r\nEbel, R. L.(1971). Critertion-referced measurements:\r\nLimitation. Shool Review,69,282-288.\r\nEbe1, R. L. (1972). Essentials of Educational Measurement.\r\nEnglewood. Cli££s, N. J.:Prentice-Hall.\r\nEbel, R. L. (1978). The case for minimum competency testing.\r\nphi Delta Kappan, April, 546-549.\r\nEbel, R. L. (1979). Essentials of Educational Mensurement\r\n(3rd ed.). Englewood Cli£fs, NJ: Prentice-flaIl.\r\nEmrick, J. A. (1971). An evaluation model for mastery\r\ntesting. Journal of Educational Measurement, ~(4),\r\n321-326.\r\nGagn`e, R. M. (1985). The conditions of Learning and theory\r\nof instruction. New York: Holt, Rinechart & Winston.\r\nGarcia-Quintana, R. A., & Mappus, L. L. (1980). Using\r\nnorm-referenced data to set standards for a minimum\r\ncompetency program III the stats of South Carolina:\r\nAieasibility study. Educational Evaluation and Policy\r\nAnalysis,~, 47-52.\r\nGlass, G. V. (1978). Standards and criteria. Journal of\r\nEducational Measurement, 15(4), 237-261.\r\nGlaser, R. (1963). Instructional technology and the\r\nmeasurement of learning outcomes . American Psychologist,\r\n18, 519-521.\r\nGlaser, R., & Klaus, D. J. (1962). Proficiency measurement:\r\nAssessing human performance. In R. M. Gagne` (Ed),\r\nPsychological Principles in Systems Development(pp.419-\r\n474). New York: Holt, Rinhart and Winston.\r\nGlaser, R. & Nitko,A J.(1971).Measurement in learning and\r\ninstruction. In R. L. Thorndike (Ed.), Educational\r\nmeasurement(pp.625-670). Washington: American Council on\r\nEducation,\r\nGuion, R. M., & Ironson, G. H. (1983). Latent trait theory\r\nfor organizational research. Organizational Behavior and\r\nHuman Performance, 31,54-87.\r\nHaladyna, T. M., & Roid, G. H. (1983). A cornparlBon of two\r\napproaches to criterion-referenced test construction.\r\nJournal of Educational Measurement, 20,271-281.\r\nHalpin, G., Sigmon, G.,\r\ncompetency standards\r\n& Halpin,\r\nset by\r\nG. (1983). Minimum\r\nthree judgmental\r\nprocedures:implications for validity. Educational and\r\npsychological measurement , 43,185- 196.\r\nHambleton, R. K. (1978). On the use of cut- off scores with\r\ncriterion- referenced tests in instructional settings.\r\nJournal of Educational Measurement, 15(4), 277-290.\r\nHambleton, R. K. (1979) Latent trait models and their\r\napplications. In R. T. Guest, (Ed), Methodological\r\ndevelopments. Washington: Jossey-Bass.\r\nHambleton, R. K.(1980). Test score validity and\r\nstandars-setting methods. In R. A. Berk, (Ed.),\r\nCriterion-referenced Measurement: The state of the\r\nart(pp.80-128). Baltimore, MD:John Hopkins University\r\nPress.\r\nHambleton, R. K. (1983). Application of item response models\r\nto criterion referenced assessment. Applied\r\npsychological Measurement, 7, 33-44.\r\nHambleton, R. K. (1989). Principles and selected\r\napplications of item response theory. In R. L. Linn\r\n(ED.), Educational measurment (3rd ed., pp. 147-200).\r\nNew York: Macmillan.\r\nHambleton, R. K. (1990). Criterion referenced-testing\r\nmethods and practices. In T. B. Gutkin & C. R.Reynolds\r\n(Eds.), The handbook of school psychology (pp. 388-415).\r\nNew Jork:John Wiley & Sons.\r\nHambleton, R. K.,Algina, J., & coulson, D. S. (1978).\r\ncriterion- referenced testing and measurement:A review\r\nof technical issues and developments. Review of\r\nEducational Research, 48, 1- 47.\r\nHambleton, R. K., & Cook, L. L. (1977). Latent trait models\r\nand their use in the analysis of educational test data.\r\nJournal of Educational Measurement, 14,75 -96.\r\n\r\nHambleton, R. K., & de\r\nApplication of\r\ncriterion-referenced\r\nGruijter, D. N.\r\nitem\r\ntest\r\nresponse\r\nselection.\r\nEducational Measurement, 20, 355-367.\r\nM. (1983).\r\nmodel to\r\nJournal of\r\nHambleton, R. K., & Eignor, D. R. (1978). Guidelines for\r\nevaluating criterion-referenced tests and test\r\nmanuals. Journal of Educational Measurement, 15,321-327.\r\nHambleton, R. K., & Eignor, D. R. (1980). Competency test\r\ndevelopment ,validation,and standard setting. In R. M.\r\nJaeger & C. K. Tittle (Eds.), Minimum Competency\r\nAchievement Testing: Motives, models, measures, and\r\nconsequences(pp.367-396). Berkeley, CA.: McCutchan.\r\nHambleton, R. K., Mills, C. N. & Simon, R. (1983).\r\nDetermining the lengths for criterion- referenced tests.\r\nJournal of Educational Measurement, 20, 27-38.\r\nHambleton, R. K., & Novick, M. R. (1973). Toward an \r\nintegration of theory and method for \r\ncriterion-referenced tests. Journal of Education\r\nMeasurement, 10,159-170.\r\nHambleton, R. K., swaminathan, H., Algin a , J., & Coulson, D.\r\nS. (1978). Criterion-referenced testing and measurement:\r\nA review of technical issues and developments. Review of\r\nEducational research, 48, 1-47.\r\nHambleton, R. K., & Swaminathan, H. (1985). Item response\r\ntheory : Principles and applications. Boston, Ma:\r\nKluwer-Nijhoff.\r\nHambleton, R. K., Swaminthan, H. & Rogers, H. J. (1991).\r\nFundamentals of item response theory. Newburry Park,\r\nCA: SAGE.\r\nHarasym, P. H. (1981). A comparison of the Nedelsky and\r\nmodified Angoff standard-setting procedure on evaluation\r\noutcome. Educational and Psycholoical Measurement,\r\n41,725-734.\r\nHarris, C. W., (1972).An interpretation of Livingston`s\r\nreliability coefficient for criterion-referenceed tests.\r\nJournal of Educational Measurement, 9, 27-29.\r\nHarris, D.J., & Subkoviak, M. J. (1986). Item analysis: A\r\nshort-cut statisitic for mastery tests. Educational and\r\nPsychological Measurement, 46, 494-507.\r\nHu li l1; L. L., Drasgm`J, F., & Parsons, C. K. (1983). Item\r\nresponse theory: Application to psychological\r\nmeasurement. Homewood, IL: Dow Jones- Irwin.\r\nHuynh, H. (1976). On the reliability of decisions In\r\ndomain-referenced testing. Journal of Educational\r\nMeasurement, 13, 253-264.\r\nHuynh, E. (1978). Reliability of mutiple classifications.\r\nPsychmetrika, 45, 317-325.\r\nEuynh, E. (1985). Assessing Mastery of basic skills through\r\nsummative testing. In D. V. Levine, (Ed), Improving\r\nstudent achievement through mastery learning programs.\r\nSan Francisco, Califoenia: Jossey-Bass.\r\nEuynh, E., & Castel, J. (1985). A comparslon of the mllllmax\r\nand Rasch approaches to set simultaneous- passlllg scores\r\nfor subtests. Journal of Education Statistics, 10,\r\n334-344.\r\nJaeger, R. M. (1991). Selection of judges for\r\nstandard-setting. Educational Measurement: Issues and\r\nPractice, 10(2), 3-6.\r\nJaeger, R. M. (1989). Certification of student competence.\r\nIn R. L. Linn (ED.), Educational rneasurment (3rd ed.,\r\npp. 147-200). New York: Macmillan.\r\nJaeger, R. M. (1982). An iterative structured judgment\r\nprocess for establishing standards on competency tests:\r\nTheory and application Educational Evaluation and\r\nPolicy Analysis, 4, 461-476.\r\nKane, M. T. (1987). On the use of IRT models with\r\njudgemental standard setting procedures. Journal of\r\nEducational Measurement, 24, 333-345.\r\n\r\nKoffler, S. L. (1980). A comparlson of approaches for\r\nsetting proficiency standards. Journal of Educational\r\nmeasurement, li, 167-178.\r\nKriewal, T. E. (1972). Aspects and applications of\r\ncriterion-referenced tests. I.llinois school research,\r\n9, 5-18.\r\nLevin, H. M. (1978). Educational performance standards:lmage\r\nor substance? Journal of Educational Measurement, 15,\r\n309-319.\r\nLivingston, S. A. (1975). A utility-based approach to the\r\nevaluation of pass/fall testing decision procedures (\r\nRep. No. Copa-75-01). Princeton, NJ: Center for\r\nOccupational and Professional Assessment, Educational\r\nTesting Sevice.\r\nLivingston, S. A. (1980). Choosing minimum pessing score by\r\nstochastic approximation techniques. Education and\r\nPsychological Measurement, 40, 859-873.\r\nLivingston, S. A., & Zieky, M. J. (\"1982). Manual for setting\r\nstandards on the basic skills assessment tests.\r\nPrinceton, N. J.: Educational Testing Service.\r\nLord, F. M. (1980).Applications of item response theory to\r\npractical test problem. Hillsdale, NJ: Lawrence Erlbaum\r\nassociates.\r\n\r\n\r\n\r\n(P86未key)\r\n\r\nMislevy, R. J., & Bock, R. D. (1983). BILOG: Item analysis\r\nand test with binary logistic models. Mooresville IN:\r\nScientific Software , Inc.\r\nMislevy, R. J., Johnson, E. G., & Muraki, E. (1992). Sclaing\r\nprocedures in N~. Journal of Education statistics, 17,\r\n131-154.\r\nMislevy, R. J. & Stocking, M; L. (1989). A consumer`s guide\r\nto LOGIST and BILOG. Applied Psychological Measurement,\r\n13, 57-75.\r\nNedelsky, L. (1954). Absolute grading standards for\r\nobjective tests. Educational and Psychological\r\nMeasurement, 14, 3-19 .\r\nNorcini, J. J., Lipner, R. S., Langdon, L. 0., & Strecker,\r\nC. A. (1987). A comparlSon of three variations on a\r\nstandard-setting method. Journal of Educational\r\nMeasurement, 24, 56-64.\r\nNovick , M. R. , & Lewis ; C. (1974)= Prescribing test length\r\nfor estimation criterion-referenced measurement. In C.\r\nw. Harris, M. C. Alkin, & W. J. Popham (Eds.), Problems\r\nin criterion-referenced measurement(CSE Monograph Series\r\nin Evaluation, No.3, pp. 139-158). Los Angeles: Center\r\nfor the Study of Evaluation, University of California.\r\nNovick, M. R., Lewis, C., & Jackson, P. H. (1973). The\r\nestimation of proportions in m groups. Psychometrika,\r\n38, 19-46.\r\nPeng, C.-Y. J., & Subkoviak, M. J. (1980). A note on Huynh`s\r\nnomal approximation procedure for estimating\r\ncriterion-referenced reliability. Journal of\r\nEducational Measurement, 10(2), 359-368.\r\nPlake, B. S., Melican, G. J., & Mills, c. N. (1991). Factore\r\ninfluencing intrajudge consistency during\r\nstandard-setting. Educational Measurement: Issues and\r\npractice, 10(2), 15-16,22.\r\nPlake, B. S., & Kane, M. T. (1991). Comparison of method for\r\ncombining the minimum passing levels for individual item\r\ninto a passing. Journal of Educational Measurement, 28,\r\n249-256.\r\nPopham, W. K., & Husek, T. R. (1969). Implications of\r\ncriterion- referenced measurement. Journal of\r\nEducational Measurement,6~, 1-9.\r\nPopham, W. J. (1978). As always, provocative. Journal of\r\nEducational Measurement, 15, 297-300.\r\nPopham,W.J.(1981). Modern educational measurement.\r\nPrentic-hall.\r\nRasch, G. (1980). Probabilistic models for some intelligence\r\nand attainment tests. Chicago: The Oniversity of Chicago\r\nPress (Or iginal edition was published in 1960).\r\nReid, J. B. (1991). Training judges to generate\r\nstandard-setting data. Educational Measurement: Issues\r\nand practice, 10(2), 11-14.\r\nRowley, G. L.(1982). Historical antecedents of the\r\nstandard-setting debate: An inside account of the\r\nminimal-beardedness controversy. Journal of Educational\r\nMeasurement, 19,87-95.\r\nhannon, G. A., & Cliver, B. A. (1987). An application of\r\nitem response theory in the compariaon of four\r\nconventional item discrimination indices for\r\ncriterion-referenced tests. Journal of Educational\r\nMeasurement, 24, 347-356.\r\naunders, J. C., Ryan, J.P., & Huynh, H. (1981). A\r\ncomparison of two approaches to setting passing scores\r\nbased on the nedelsky procedure. Applied Psychological\r\nMeasurement, 5, 209-217 •\r\n. 1epard, L. (1980). Technical issures in minimum competence\r\ntesting. In D. C. Berlinger(Ed.), Review of research In\r\neducation (Vol. 8). Itasca, Illinois: F.E. Peacock.\r\nlepard, L. A. (1984). setting performance standards. In R.\r\nA. Berk (Ed), A guide to criterion-referenced test\r\nconstruction (pp.169-198). Baltimore, MD: Johns Hopkins\r\nUniversity Press.\r\nSkakun, E. N., & Kling, S. (1980). Comparablity of methods\r\nfor setting standards. Journal of Educational\r\nMeasurement, 17, 229-235.\r\nSmith, R. L., & Smith, J. K. (1988). Di££erential use of\r\nitem in£ormation by judges ueing Angoff and Nedelsky\r\nprocedures. Jorn::nal of Educational Measurement,\r\n25,259-285.\r\nSubkoviak, M. J. (1976). Estimating reliability from a\r\nsingle administraion of a criterion-referenced test.\r\nJournal of Educational Measurement, 13/265-276.\r\nSubkoviak, M. J.(1978). Empirical investigation of\r\nprocedures for estimating reliability for mastery tests.\r\nJournal of Educational Measurement, 15, 111-115.\r\nSubkoiak, M. J. (1980). Decision-consistency appoaches. In\r\nR. A. Berk, (Ed.), criterion-referenced Measurement:The\r\nstate of the art(pp . 129-185) . Baltimore, Md . : Johns\r\nUniversity Press.\r\nubkoviak, M. J. (1988). A practitioner`s guide to\r\ncomputation and interpretation of reliability indices\r\nfor mastery tests. Journal of Educational Measurement,\r\n25, 47-55.\r\nwaminathan, H., Hambleton, R. K., & Algina, J. (1975). A\r\nBayesian Decision-theoretic procedure for use with\r\ncriterion-referenced tests. Journal of Educational\r\nMeasurement, 12, 87-98.\r\nhissen, D. & steinberg, L.(1986). A taxonommy of item\r\nresponse models. Psychmetrika, 51, 567-577.\r\nan der Linden, W. J.(1978). Forgetting, guesslng, and\r\nmastery: The Macready and Dayton models revisited and\r\ncompared with a latent trait approach. Journal of\r\nEducational Statistics, 3, 305-317.\r\nan der Linden, W. J. (1981). A latent trait look at\r\npretest-posttest validation of criterion-referenced\r\ntest items. Review of Educational Research, 51, 379-402.\r\nIn der Linden, W. J. (1982). A latent trait method for\r\ndetermining intermining intra judge inconsistency in\r\nthe Angoff and Nedelsky techniques of standard setting.\r\nJournal of Educational Measurement, 19, 295-308.\r\nan der Linden, W. J. (1984). Some thoughts on the use of\r\ndecision theory to set cutoff scores: Comment on de\r\nGruijter and Hambleton. Applied Psychological\r\nMeasurement, 8, 9-17.\r\nrm, T. A. (1978). Aprimer of item response theory.\r\nSpringfield, VA: National Technical Information Service.\r\nlilcox, R. R. (1979). Prediction analysis and the\r\nreliability of a mastery test. Educational and\r\nPsychological Measurement, 39, 825-839.\r\noehr, D. J., Arthur, W. JR., & Fehrmann, M. L. (1991). An\r\nempirical comparlson of cutoff score method for\r\ncontent-related and criterion-related validity settings.\r\nEducational and Psychological Mea surement, 51,\r\n1029-1039.\r\nreight, B. D. (1977). Solving measurement problems with the\r\nRasch model. Journal of Educational Measurement, 14,\r\n97-166.\r\nright, B. D., & Stone, M. H. (1979). Best test design.\r\nChicago: MESA Press.\r\nen, W. M. (1987). A comparison of the efficiency and\r\naccuracy of BILOG and LOGIST. Psychometrika, 52,\r\n275-291.\r\n_eky, M. J., & Livingston, S. A. (1977). Manual for setting\r\nstandards on the basic skills assessment tests.\r\nPrinceton, NJ: Educational testing service .
描述:	碩士國立政治大學教育學系
資料來源:	http://thesis.lib.nccu.edu.tw/record/#B2002004311
資料類型:	thesis
Appears in Collections:	學位論文

Files in This Item:

File	Size	Format
index.html	115 B	HTML2	View/Open

Show full item record

Google Scholar^TM

Check

Files in This Item:

Google ScholarTM

Google Scholar^TM