chapter 1 introduction 1.1 rationales for studying rater variability 1.2 status quo of studies on rater variability 1.3 an overview of this book 1.4 definition of key terms chapter 2 literature review: studies on rater variability in language performance assessment 2.1 rater variability in language performance assessment 2.2 exploring rater variability using statistical analysis 2.2.1 introduction 2.2.2 rater reliability in classical test theory 2.2.3 rater facet as variance ponent in generalizability theory 2.2.4 rater calibration in many—facet rasch model 2.2.5 summary 2.3 process—oriented approach to investigating rater variability 2.3.1 raters decision—making: the "black box" behind the final ratings 2.3.2 indirect evidence 2.3.3 direct investigation of rating process: insights from verbal protocols 2.4 factors accounting for rater variability 2.4.1 external factors 2.4.2 internal factors 2.4.3 situational factors 2.5 a framework for parison between rater grou 2.6 summary chapter 3 study 1:investigating the scoring reliability of cet—set using many—facet rasch model 3.1 issues in second language speaking assessment 3.2 challenges in test validation 3.3 the context of the study 3.4 objectives of the study 3.5 methods 3.5.1 data 3.5.2 instrument (mfrm) 3.6 data analyses and fins 3.6.1 facet map 3.6.2 candidates 3.6.3 tasks 3.6.4 items 3.6.5 rating scales 3.6.6 raters 3.6.7 bias analysis 3.7 conclusions 3.8 implications 3.9 further research efforts to be made chapter 4 study 2: exploring how raters cognitive and meta—cognitive strategies influence rating accuracy in essay scoring 4.1 subjective scoring: a matter of reliability or validity? 4.2 exploring rating process: looking into rater variability 4.3 rater cognition studies in writing assessment 4.4 methodology 4.4.1 the context of the study 4.4.2 participants 4.4.3 materials 4.4.4 data collection 4.4.5 data analysis 4.5 results and discussion 4.5.1 general patterns of differences in broad categories 4.5.2 in—depth investigation ofdifferences in the major sub—categories 4.6 summary and further discussion 4.7 conclusion chapter 5 conclusions 5.1 summary of fins 5.2 parison of the two studies 5.3 limitations 5.4 further research efforts to be made appendix ⅰ cet—set rating scale appendix ⅱ cet4 rating rubrics for the writing task appendix ⅲ the writing task of the dec.2006 administration of cet4 and range finders appendix ⅳ sample essays appendix ⅴ instructions and training tasks for think—aloud session appendix ⅵ sample transcripts of raters thinking aloud appendix ⅶ co protocols for think—aloud verbal reports appendix ⅷ the co scheme for raters cognitive and meta—cognitive strategies references index
以下为对购买帮助不大的评价