Join the GOOGLE +Rubber Room Community

Monday, December 19, 2011

Mike Winerip: 10 Years of Assessing Students With Scientific Exactitude

December 18, 2011

10 Years of Assessing Students With Scientific Exactitude



In the last decade, we have emerged from the Education Stone Age. No longer must we rely on primitive tools like teachers and principals to assess children’s academic progress. Thanks to the best education minds in Washington, Albany and Lower Manhattan, we now have finely calibrated state tests aligned with the highest academic standards. What follows is a look back at New York’s long march to a new age of accountability.
Richard Mills
DECEMBER 2002 The state’s education commissioner, Richard P. Mills, reports to the state Regents: “Students are learning more than ever. Student achievement has improved in relation to the standards over recent years and continues to do so.”
JANUARY 2003 New York becomes one of the first five states to have its testing system approved by federal officials under the new No Child Left Behind law. The Princeton Review rates New York’s assessment program No. 1 in the country.
SPRING 2003 Teachers from around New York complain that the state’s scoring of newly developed high school tests is out of whack, with biology and earth science tests being too easy and the physics test too hard. The state Council of School Superintendents finds the physics scores so unreliable, it sends a letter to colleges for the first time in its history urging them to disregard the test result. Dr. Mills does not flinch, calling the tests “statistically sound” and “in accordance with nationally accepted standards.”
JUNE 2003 Scores on the state algebra test are so poorly calibrated that 70 percent of seniors fail. After a statewide outcry, officials agree to throw out the results. The Princeton Review says that ranking New York first was a mistake. “We’re going to have to come up with a fiasco index for a state like New York that messes up a lot of people’s lives,” a spokesman says.
OCTOBER 2003 A special panel appointed to investigate the state math fiasco concludes that the test “can’t accurately predict performance,” was created “on the cheap” and was full of exam questions that were “poorly worded” and “confusing.”
DECEMBER 2003 The director of state testing resigns. It was his idea to leave, a spokesman says.
MAY 2004 For the fourth year in a row, scores have risen on elementary and middle school state reading and math tests. Dr. Mills urges the Regents: “Look at the data that shows steadily rising achievement of the standards in school districts of all wealth and categories. More children are learning more now than ever before.”
FEBRUARY 2005 Dr. Mills rebukes those who question whether state scores are inflated. “The exams are not the problem,” he said in a report to the Regents. “It’s past time to turn from obsessive criticism of the exam and solve the real problems — the students who are not educated to the standards.”
SPRING 2005 New York City fourth graders make record gains on the state English test, with 59 percent scoring as proficient, compared with 49 percent the year before. “Amazing results” that “should put a smile on the face of everybody in the city,” says Mayor Michael R. Bloomberg, who happily recites the numbers on his way to re-election.
FALL 2005 The federal tests (the National Assessment of Educational Progress), which are considered more rigorous than the state tests, show a drop in New York City reading scores. On the eighth-grade test, 19 percent are proficient in 2005, compared with 22 percent in 2003. Asked if city and state officials had hyped the state test results, Merryl H. Tisch, a Regent, says, “They have never, ever, ever exaggerated.”
Merryl Tisch
SEPTEMBER 2007 New York’s national assessment test results are again dismal; eighth-grade reading scores are lower than they were in 1998.
DECEMBER 2007 In his report to the Regents, Dr. Mills notes, “A rich, scholarly literature has challenged NAEP validity since the early 1990s.” He announces a plan to develop the first new state learning standards since 1996, to further spur academic excellence.
JUNE 2008 Newly released state test scores show another record year for New York children. Math scores for grades three through eight indicate that 80.7 percent are proficient, up from 72.7 in 2007. “Can we trust these results?” Dr. Mills asks. “Yes, we can. New York’s testing system, including grades three through eight tests, passed a rigorous peer review last year by the U.S. Department of Education. State Education Department assessment experts commission independent parallel analyses to double- and sometimes triple-check the work of our test vendor.”
JUNE 2009 In the previous decade, New York students’ average SAT verbal score has dropped to 484 from 494; the math SAT score has dropped to 499 from 506. The national assessment’s fourth-grade reading scores have been stagnant for four years, and the eighth-grade scores are their lowest in a decade.
But somehow, state test scores again soar to record levels. In New York City, 81 percent of students are deemed proficient in math, and 68.8 percent are proficient in English. “This is a big victory for the city,” the schools chancellor, Joel I. Klein, says, “and we should bask in it.” In November the mayor is elected to a third term, again riding the coattails of sweet city scores.
JULY 2010 Finally someone — Dr. Tisch, the chancellor of the Board of Regents — has the sense to stand up at a news conference and say that the state test scores are so ridiculously inflated that only a fool would take them seriously, thereby unmasking the mayor, the chancellor and the former state commissioner. State scores are to be scaled down immediately, so that the 68.8 percent English proficiency rate at the start of the news conference becomes a 42.4 proficiency rate by the end of the news conference. Shael Polakow-Suransky, chief accountability officer for the city, offers the new party line: “We know there has been significant progress, and we know we have a long way to go.” Whether there has been any progress at all during the Bloomberg years is questionable. The city’s fourth-grade English proficiency rate for 2010 is no better than it was in February 2001, nine months before the mayor was first elected.
Mr. Polakow-Suransky says that even if city test scores were inflated, he is not aware of any credible research calling the city’s 64 percent graduation rate into question.
FEBRUARY 2011 The city’s 64 percent graduation rate is called into question. The state announces a new accountability measure: the percentage of high school seniors graduating who are ready for college or a career. By this standard, the graduation rate for New York City in 2009 was 23 percent.
MAY 2011 Embracing the latest new tool in the accountability universe, the governor, state chancellor and education commissioner ramrod a measure through the Board of Regents, mandating that up to 40 percent of teachers’ and principals’ evaluations be based on student test scores.
AUGUST 2011 With new, more rigorous state tests, city scores rise slightly. “We are certainly going in the right direction,” the mayor says.
NOVEMBER 2011 New York is one of two states in the nation to post statistically significant declines on the National Assessment tests. John B. King, the education commissioner, says the state is certainly going in the wrong direction, but has a plan to spur students’ achievement. “The new Common Core Learning Standards will help get them there,” he says.
DEC. 19, 2011 Nearly a quarter of the state’s principals — 1,046 — have signed an online letter protesting the plan to evaluate teachers and principals by test scores. Among the reasons cited is New York’s long tradition of creating tests that have little to do with reality.

E-mail: oneducation
@nytimes.com