Monday, February 8, 2010
Race to the Top: Preparing to Choose, Implement and Manage Your Local Instructional Improvement System
•instructional planning;
•gathering information (e.g., through formative, interim, and summative assessments;
•looking at student work and other student data);
•analyzing information with the support of rapid-time reporting;
•using this information to inform decisions on appropriate next instructional steps;
•evaluating the effectiveness of the actions taken
Such systems promote collaborative problem-solving and action planning; they may also integrate instructional data with student-level data such as attendance, discipline, grades, credit accumulation, and student survey results to provide early warning indicators of a student’s risk of educational failure (p.9).
The Race to the Top initiative provides a unique, exciting, and timely opportunity for school districts to achieve the goal of elevating student achievement beyond current levels, and to do so in ways that enhance our competitiveness in the global society. Among the many elements that will contribute to success in achieving this goal is the well-informed selection, implementation and management of a high quality local instructional improvement system. In a nutshell, the opportunity to change achievement by the end of each school year is within our reach when we are empowered with actionable information on student learning that helps us to adjust instruction throughout the year. Learning occurs on a daily basis, thus, it is reasonable to assume that the goal of increasing learning can and should be supported one day at a time. A well-designed, research-based, standards aligned instructional improvement system can help to address this need.
As part of our ongoing outreach initiatives to help school districts and states address Race to the Top goals, ATI recently hosted in Colorado a statewide collaborative seminar “Elevating Student Achievement: Exploring What Works”. The goals of the seminar were to:
•Share evidence regarding what is working in districts to elevate student achievement;
•Identify the critical components of effective initiatives built upon the use of an instructional improvement system and aimed at elevating student achievement;
•Discuss challenges faced in initiatives designed to elevate achievement, and the solutions being implemented by school districts; and
•Consider the various management procedures and technology for addressing challenges that may be encountered in implementing programs designed to elevate achievement.
The seminar was attended by school district leaders, educators, and researchers and included presentations by WestEd’s Local Accountability Professional Development Series Project Director (website), the Director of Student Assessment and the Principal Consultant for the Colorado Department of Education Office of Standards and Assessments (website), and Assessment Technology, Inc. (website). Among the successes of this seminar was the formation of grassroots discussion panels comprised of school district leaders in the areas of curriculum, assessment, planning, technology, and evaluation. These individuals often have the responsibility of coordinating the development of criteria for selecting their local instructional improvement system and for implementation and management of the system. Below is a sampling of contextual highlights and questions addressed by panel members:
•An effective instructional improvement system begins with the specification of goals to be achieved within a specified time period. What challenges make it difficult to specify essential standards to be targeted for instruction within a given time period? How can those challenges be addressed effectively?
•An effective instructional improvement system uses formative assessments and a common set of interim assessments within each grade and subject. What challenges make it difficult to adopt common interim assessments and how can those challenges be addressed?
•In an effective instructional improvement system, instruction is adjusted following each interim assessment by implementing re-teaching and enrichment interventions. What challenges make it difficult to implement re-teaching and enrichment? How can those challenges be effectively addressed?
•Implementing an instructional improvement system requires groups of administrators and teachers to work together to provide an assessment system that can be used to inform instruction for all students. What are some of the management challenges associated with designing, scheduling, and implementing customized formative and interim benchmark assessments? What are the challenges associated with designing and implementing curricular interventions based on information about student learning? How can these challenges be met?
•Re-teaching and enrichment interventions require unanticipated allocations of time and resources to the instructional process. How do you meet the challenges of allocating time and resources to re-teaching and enrichment?
If you would like information regarding how panel members responded to these and other questions that emerged during the seminar, please contact us at ATI and/or watch the seminar video to be posted soon on the ATI website. If you would like to share your comments on these challenges within the forum community, please do so here. And, if you have questions that you would like to pose related to instructional improvement systems and the Race to the Top initiative, please post them here for comment from ATI and the forum community.
Jason K. Feld, Ph.D.
Tuesday, January 12, 2010
Elevating Student Achievement Seminar - Submit a Question
The seminar will be a dialog on instructional interventions and the elevation of student achievement. If you know of any others in your district interested in registering, please feel free to pass the following link on to them - www.ati-online.com/WhatWorks
We’re also encouraging those interested in submitting a question pre-event for seminar panel discussions on the following topics:
- The benefits of and challenges encountered when implementing an instructional improvement initiative.
- The benefits of and challenges encountered when managing an instructional improvement initiative.
We look forward to hearing your questions!
Thursday, January 7, 2010
So what are item parameters, anyway?
The best way to understand what item parameters refer to is to look at an Item Characteristic Curve. On an item characteristic curve, which presents the data for one, specific item, student ability (based on their performance on the assessment as a whole) is plotted on the horizontal axis, with a mean of 0 and a standard deviation of 1. The probability of answering the item correctly is plotted on the vertical axis. Typically the probability of answering correctly is relatively low for low ability students and relatively high for students of higher ability.
The first example (Item #4) is an example of a great item as far as the parameters go. The b-value (difficulty) for that item was 0.689, which is a bit on the difficult side, but not too bad. The important point here is that b-values (item difficulty) are on the same scale as student ability. So what this example is telling us is that students at or above 0.689 standard deviations above the mean are likely to get the answer correct. Students below that point on the ability scale are more likely to answer incorrectly. The b-parameter is also known as the location parameter, because it locates the point on the ability scale where students start demonstrating mastery of the concept.

The a-value (discrimination) refers to how well the item discriminates between different ability levels. It’s how steep the rise is in the curve that shows the probability of answering correctly. Ideally, there is a nice, steep rise in the probability of answering correctly like the one for question 4. That indicates that there is a dramatic change in how likely it is that a student has mastered the concept that’s pin-pointed within a very narrow range of the ability scale. You can be pretty confident that students above 0.689 standard deviations above the mean “get it” and that students below that point generally don’t. The discrimination parameter for question 4 is 1.459.
The next example, Item 5, shows an item that doesn’t discriminate quite as well as Item 4. The a-value on that one is 0.53. It’s also a pretty easy item, with a b-value of -1.07. So, on this one, most students are likely to get it correct, unless they’re more than one standard deviation below the mean of the ability scale.


Monday, December 28, 2009
A Closer Look at the Benchmark Results Page
This is because the DL score, and the student’s associated mastery level, provides a much better picture of a student’s ability. A raw score will simply tell a user what a student got right and what a student got wrong. The DL score factors in, not only what items a student got right and wrong, but also the difficulty and discrimination value (how well does this item discriminate between students of different ability levels) of the items.When a student takes an ATI Benchmark Assessment, they will earn a particular DL score. A DL score is a score that takes the relative difficulty of the assessment into consideration. DL scores on two assessments can be compared in a meaningful way whereas raw scores cannot. For example, 70% correct on a very easy assessment does not mean the same thing as 70% correct on a very difficult assessment. However, a DL (scale) score of 954 on one assessment means the same thing as a DL score of 954 on another assessment, as long as the two assessments have been placed on the same scale. The particular DL score a student earns places them in a particular mastery category. Each state has its own mastery categories (Below the Standard/Unsatisfactory, Approaches the Standard/Partially Proficient, Meets the Standard/Proficient, Exceeds the Standard/Advanced) but they are similar in nature. For example:

Cut scores are then established to determine which Mastery category a student will be placed based on his or her performance on the assessment. The cut scores that define the mastery categories are established for Bencmark 1 using equipercentile equating to align students’ scores to their scores on last year’s state assessment results. The cut scores on all other assessments administered in a school year are established based on the amount of growth in terms of scale scores that is expected from one benchmark to another.
What does this mean for the user? Users can rely on DL scores and their associated mastery catagories, to help identify students to target for interventions, even after one benchmark assessment is given. A teacher’s goal should be to see an increse in DL scores (and mastery catagories) as the year progresses and students learn more of the standards. To acheive this goal, teachers will want to analyze the Class Development Profile Grid, Item Analysis and Risk-Level Report to identified standards on which to focus their re-teaching instruction. Click here to learn more about how these reports can assist with interventions.
Tuesday, December 22, 2009
Lesson Plan Documentation: A Great Use of Instructional Dialogs
A completed lesson plan in a different electronic format may also be attached as a resource.
Generate a short quiz at the end of the Instructional Dialog. This can check the effectiveness of the lesson.
Finally, schedule the dialog which will allow the Instructional Dialog/lesson plan to show up on Galileo’s class calendar for an effortless view of what has been accomplished in the classroom.
Thursday, December 17, 2009
Intervention Alert Report
Tuesday, December 8, 2009
Thoughts on Race to the Top: Collaboration and local control
The guidelines presented to the states to prepare their RTT applications contained two clear themes. On the one hand, state education initiatives are supposed to preserve the “flexibility and autonomy” of LEAs. There is clear recognition of the need for districts to be supported in their efforts to make decisions about curriculum assessments and other issues that are in the best interests of their staff and students. In addition to the call for local control there is also a clear mandate for collaboration. States are encouraged to adopt common standards and collaborate to produce common assessments. One of the questions that state governments face in their preparation of their proposals is how best to balance these two, at times seemingly contradictory, objectives.
One of the ways that collaboration could be facilitated, while at the same time preserving the decision making power of the districts is for the state to make available to districts an item bank in which all of the items are on the same scale. These items could be used on both district interim assessments and the state test. What would this mean for the sake of districts you ask? Such an item bank would afford the opportunity to make sure that the assessments composed of these items, either entirely or in part, can be placed on the same scale. This means that the scores are directly comparable. The 500 on the math test given in the middle of the year could be compared directly to the 550 on the state test at the end of the year. Put another way, in this case, the statement could be made that the ability level required to achieve 550 on the benchmark test is higher than the ability level required to achieve 500 on the state test. Without tests that are on a common scale, such comparisons are not possible. The 550 might represent higher ability than the 500 and then again it might not. Having a common measuring stick could go a long way towards facilitating collaborative work.
A common item bank could also greatly assist smaller districts in their efforts to implement valid and reliable interim assessments for the purpose of informing instruction. The utility of assessment results is greatly aided to the extent that they reflect the kids that actually attend the district schools and the instructional priorities of the district. Research has consistently shown that the items behave differently when students change or when instruction changes. Ongoing analysis of test behavior is critical to making sure results are reliable and valid for the kids with whom it will be used. Such analysis is difficult with districts that have only small numbers of kids. Having a common item bank from which to draw could make it much easier to do the necessary analyses to back up an assessment initiative with a small district.
Achieving these beneficial results does not require that both the state and district tests are comprised entirely of items from this bank. The only requirement is that they both contain a sample of items from the bank. This would leave the district free to include local items reflecting content which may not be of interest to other districts in the state. Easy communication and collaboration need not be sacrificed in order to continue to achieve the flexibility and autonomy that allows districts to make sure that instructional improvement systems meet their priorities.
Anyhow, I had best sign off at this point. This post is already rather lengthy. We would, as always, be interested in hearing the feedback of others about these ideas or about other topics.