Primer on Large-Scale Assessments of Educational Achievement

Page 75

What Are the Key Decisions in Designing Large-Scale Assessments?

Figures 4.1 and 4.2 show open-ended and multiple-choice reading literacy items developed under the SEA-PLM assessment framework. As discussed in chapter 9, this regional large-scale assessment measures reading literacy using different text types (for example, narrative, descriptive, and persuasive) and drawing on different cognitive processes linked to reading comprehension (such as locating information in a text, interpreting information, and reflecting). The open-ended item in figure 4.1 requires the student to use reading skills to compare pieces of information about two countries. The multiple-choice item in figure 4.2 presents a narrative text and requires the student to locate information about the action that one of the characters takes. In addition to the item development and content review process by subject matter experts, national large-scale assessment teams must pilot the items; the pilot helps determine the psychometric properties of each item and allows those with adequate levels of difficulty and discrimination to be selected. The pilot study is also a good opportunity to check for student understanding of each item in the assessment and address any content-related problems before the final assessment administration (box 4.5).

BOX 4.5. The Importance of Item Piloting Before the final version of test booklets is constructed, it is important to pilot the proposed test items to identify those that provide the most accurate and reliable evidence on what students know and can do. A pilot should be conducted several months in advance of test administration to allow sufficient time for data collection and analysis and to create, print, and distribute the final version of the test. The piloting process will help identify inappropriate items that should be omitted from the final version of the test, that may need revision before they can be included, and that are ready to be included in the final assessment. For instance, items that are extremely easy or extremely difficult for students at the target age or grade may need to be removed. Items that are unclear or have poor-quality distractors may improve if they are revised. It is also important to determine whether items perform similarly across population subgroups; no item should be systematically easier or more difficult for students of a particular sociodemographic group. If it is, it may need to be revised or removed from the final version of the test. It is common to pilot two to three times as many items as will be included in the final version of the test. For instance, if the final instrument will include 30 items per subject, at least 60 items should be piloted for each subject. In addition to supporting the selection of items for the final test form, a well-designed pilot study provides the assessment team with an opportunity to improve the instructions for assessment administrators, determine the time it takes participants to answer the test items, identify student engagement during the assessment, strengthen scoring rubrics for open-ended items, and refine data collection procedures before test administration. Anderson and Morgan (2008) provide an in-depth description of how to plan for, design, and conduct a pilot. Source: Adapted from Anderson and Morgan 2008.

53


Turn static files into dynamic content formats.

Create a flipbook

Articles inside

Glossary of Technical Terms

7min
pages 159-163

Asia Primary Learning Metrics 2019 Assessment

1min
page 155

Primary Learning Metrics 2019 Reading Literacy Assessment

1min
page 154

Benchmarks for Grades 4 and 6

2min
page 149

Metrics 2019 Mathematical Literacy Assessment

4min
pages 152-153

CONFEMEN 2014

1min
page 142

Programme d’Analyse des Systèmes Éducatifs de la CONFEMEN

2min
page 141

Assessment, 2000–18

1min
page 131

Eastern Africa Consortium for Monitoring Educational Quality

2min
page 138

8.9 Translation and Adaptation of International Large-Scale Assessments

2min
page 130

Study, 2001–16

1min
page 125

Mathematics and Science Study

2min
page 124

International Mathematics and Science Study

4min
pages 119-120

References

1min
pages 115-116

Chapter 8. What Are the Main International Large-Scale Student Assessments?

1min
page 117

Achievement Survey

1min
page 109

What Are Other Ways to Communicate Large-Scale Assessment Results?

2min
page 112

Chapter 7. How Can Effective Communication of Large-Scale Assessment Results Be Ensured?

1min
page 105

7.1 Features That a National Large-Scale Assessment Can Highlight

3min
pages 107-108

References

1min
page 104

Assessment According to Sex

2min
page 97

Key Ideas

1min
page 103

Student Achievement?

2min
page 99

6.2 Scoring Rubrics

2min
page 95

6.1 Example of an Item and Its Codebook Information

1min
page 94

References

1min
page 92

Key Ideas

1min
page 91

Administration?

2min
page 89

Structure and Main Changes over Time

1min
page 81

5.1 National Large-Scale Assessment: Student Tracking Form

1min
page 88

References

1min
page 84

What Should Be Included in the Test Administration Manual?

2min
page 79

How Will the Assessment Be Administered?

1min
page 78

4.5 The Importance of Item Piloting

2min
page 75

4.4 Item Writing Guidelines

1min
page 74

Chapter 4. What Are the Key Decisions in Designing Large-Scale Assessments?

1min
page 67

References

1min
page 66

Key Ideas

1min
page 65

3.2 National Large-Scale Assessment Funding Checklist

2min
pages 63-64

Team Personnel

2min
page 60

Católica de Chile in Supporting National and International Large-Scale Assessment Initiatives

9min
pages 56-59

3.1 Saudi Arabia’s Education and Training Evaluation Commission

2min
page 54

Syndicate

1min
page 55

References

4min
pages 49-52

Key Ideas

1min
page 48

Goals in Brazil

2min
page 43

2.6 Communicating National Large-Scale Assessment Results in Peru

2min
page 41

National Assessment

1min
page 39

Practice in Argentina

2min
page 47

What Are Some Common Policy Implications of Large-Scale Assessment Findings?

2min
page 42

9.3 Proportion of Students in Each Reading Proficiency Level in Grade

2min
page 27

2.5 Investing in Technical Expertise in Indonesia

2min
page 40

What Will You Learn from This Primer?

2min
page 32
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.