Sunday, December 2, 2012

The newest rhetoric on teacher evaluation — and why it is nonsense

Posted by Valerie Strauss on November 13, 2012 at 6:00 am

Carol Burris is the award-winning principal of South Side High School in Rockville Centre, New York, and a frequent Answer Sheet blogger. She just underwent an ordeal as a result of Hurricane Sandy. Here’s what she wrote:

(Mike Stobe/AFP/Getty Images)

Physically we are fine…we evacuated. I am one mile from the bay, but when it surged, about 6 feet entered my home. All in the finished basement. Lost a lot–heating system for the house, hot water and we have no electric. Walls and staircase have to come down. House smells awful. Lost lots of furniture, but also pictures, my husband’s Lionel train set from his Dad and all my research from my dissertation. Stuff you can’t get back. Some leaks, a fence down but because we left we did not lose our cars. We are pretty lucky compared with others.

Yet even while dealing with all of this Carol Burris still keeps writing about the negative effects of school reform. Here she looks at the newest popular rhetoric on teacher evaluation — and explains why it is nonsense. Burris is theco-author of the New York Principals letter of concern regarding the evaluation of teachers by student test scores, which has been signed by more than 1,500 New York principals and more than 5,400 teachers, parents, professors, administrators and citizens. You can read the letter by clicking here.

By Carol Burris

As a high school principal, it is my job to evaluate teachers. I take this responsibility very seriously — it helps ensure that our students receive the rich opportunities to learn that they deserve. With strong teachers, evaluation may entail reaffirming good practice, supporting innovative practice and facilitating ways for them to share their expertise with their colleagues. For novices or those who struggle, we work to improve their practice and, when necessary, to counsel them out or let them go.

It is because instruction is so important that the sweeping generalizations and false assumptions that have fueled recent teacher evaluation policies are of such concern to teachers and school leaders alike. The waves of misinformation about evaluation undermine confidence in our schools and result in “solutions” based on opinion and gut-level hunches, not research evidence. The recent Phi Delta Kappan opinion piece, entitled “Million Dollar Baby,” is an example of the misguided critiques that appear all too often.

Let me begin by saying that I have always been a fan of the Kappan, which skillfully takes scholarly research and makes it accessible to educators who do not have time to pore over academic journals. Despite that fine track record, the generalizations that form the argument in this month’s editor’s note cannot go unaddressed. It is time to get the record straight and address three common fallacies that dominate the new rhetoric on teacher evaluation:

1. Every former teacher evaluation system was the same and that unitary system was terrible. To quote from the opinion piece, “Unfortunately educators must bear the bulk of the blame for allowing such a lousy system to exist.” In reality, there was never one evaluation system nor was every system “lousy.” Rather, each school district has had its own system of teacher evaluation, and some of those have been better than others. That doesn’t mean, of course, that we don’t have substantial room for improvement. But it does mean that it’s ridiculous to start a reform discussion with the contention that all districts should abandon their evaluation system regardless of its track record. I would wager, for instance, that Kappan’s editor would agree that the Montgomery County Maryland School System has a nationally acclaimed system, and that Cincinnati Schools had a system, before Race to the Top, that has been shown to not only improve the craft of teachers but to increase student achievement. Neither system incorporated test scores. In the small districts on Long Island, most of us did an excellent job evaluating teachers—dismissing probationers who do not merit tenure, helping teachers continue to develop, working with and counseling those who needed to improve or to leave the profession, and building on the strength of even our most expert practitioners. Among Long Island principals, you will find few fans of New York State’s new evaluation systems, based on APPR.

2. Tenure is the problem. It is a job for life and it is unique to teaching. The Kappan editorial states that tenure is one of the “unique privileges that teachers enjoy.” But in truth due process before dismissal (tenure) is not unique to teaching. In fact, it is more difficult for a principal to dismiss a custodian due to civil service protection than it is to dismiss a teacher. Civil servants enjoy seniority rights, probation periods, salary schedules, and due process rights for dismissal just like teachers. Civil servants, who are broadly defined as those who work for government, include librarians, police officers, firefighters, transit workers, secretaries, and accountants. Due process should not be understood or practiced as a “job for life,” but it should remove the threat of political or arbitrary dismissals.

There are excellent reasons for such protections. The civil service was established in the late 1800s because prior to its establishment, government jobs were given to political supporters as spoils. The protections were put into place to make sure that public employees were hired on merit and could not be dismissed on the whims of the incoming administration. This remains a concern. Public schools are run by politicians—in some cases by mayors, in other cases by elected boards of education.

As an alternative to tenure, the Kappan editorial suggests that teachers “should receive a contract for a limited period of time, say three or five years”. Although this may sound reasonable, consider the clear consequences. Without the protection of tenure, educators could be dismissed for not pleasing the interests of powerful parents. They could be dismissed in order to bring in friends and relatives of newly elected mayors or board members. Teachers could be pressured to pass students who did not deserve to pass a class or be pressured to not discipline a student when warranted. Presently, there is one person in every district who works on a renewable contract: the superintendent. Nationally, the average time that a superintendent stays in a district is seven years. For an urban superintendent it is fewer than three years. And the constant turnover of superintendents does not serve students or schools well. Tenure promotes stability and community in our schools. Teacher turnover, even when it is the less effective teachers who leave, has a negative effect on student achievement. Likewise it has been found that churn in the principalship is not good for schools. Such instability does not promote excellence and the courage to make the tough decisions that are not politically popular but serve the best interests of students. Again, this isn’t an argument against pursuing ways to streamline the dismissal process; it’s an argument against poorly thought through changes.

3. High-stakes evaluations are fine as long as they do not rely on a single measure. This is the new popular rhetoric. It is a partial acknowledgement of the many problems associated with using students’ test scores and growth models in teacher evaluations, problems that have been repeatedly documented. And yet the Kappan editor and others still insist on the inclusion of students’ test scores in teacher evaluation. Multiple measures are indeed wise, but the effects of including any given measure need to be understood. Current policies do in fact place test scores in a prominent role, one for which they are not valid or reliable and because of which school districts can expect to be (justifiably) challenged in court by dismissed teachers (as explained in another article in the same November issue of the Kappan). The troubling reality is that these policies will promote teaching to standardized tests and a narrowing of the curriculum

The editorial suggests that we also include other untested ingredients, such as student surveys, in the evaluation mix. We should do this, apparently, even though there is as of yet no reliable research base to support the idea. As a high school principal, I thoroughly enjoy working with teenagers. I find their opinions to be frank and refreshing. But I do not think it is fair or wise to give 14 year olds a formal role in teacher evaluation. It is bad enough that we are undermining the student-teacher relationship by basing evaluations on those students test scores.

The magazine’s editor concludes by asserting that “every classroom should have excellent teaching every hour of every day.” I would add that every child should also have an excellent parent who serves them excellent food and provides them with an excellent home in an excellent neighborhood. Let’s also add excellent healthcare and excellent supervision every hour of every day as well. If we could accomplish all of that, we would have the highest achieving students on earth. But the rhetoric itself accomplishes little. What we need are research-based policies supported by lawmakers willing to provide the necessary resources.

In the meantime, while we wait for those wise lawmakers to emerge, perhaps we all could back off and allow teachers to enjoy the same humanity we seem to graciously grant to others. Teachers aren’t perfect, but I must tell you that nearly all of the teachers that I have met over the years are darn good at what they do. And the variation in their skill is no wider than the variation that I have observed in other professions whose evaluations we never seem to discuss. Let’s look to improve evaluation systems as well as other parts of our schools. But could we stay within reasonable bounds of critique based on fact and research? If we do not stop this constant drumbeat of criticism there will be no one left to evaluate with our new excellent-every-hour-every-day evaluation systems.

New teacher evaluations start to hurt students

By Valerie Strauss, The Answer Sheet

LINK

Much of the discussion about the use of student standardized test scores to evaluate teachers has centered on how unfair the “value-added” method is to teachers because it is unreliable and can — and does — label effective teachers as ineffective too often. But there are consequences for the students too, and they are just starting to be seen. This is explained by in this post by Carol Burris, the award-winning principal of South Side High School in Rockville Centre, New York. She is the co-author of the New York Principals letter of concern regarding the evaluation of teachers by student test scores, which has been signed by more than 1,500 New York principals and more than 5,400 teachers, parents, professors, administrators and citizens. You can read the letter by clicking here.

By Carol Burris

The first “growth scores” for the teachers of students in Grades 3-8 have arrived in New York State and they are even more problematic than expected. What is happening in New York is happening across our country. The question that those who love our public schools and their students must confront is what credence will we give them and how will we respond.

We know that of all of the many factors that account for the variance in student scores, the teacher is the greatest in-school factor — she contributes more for example, to student test scores than does the principal or the school size. However, factors other than the teacher account for roughly 85-90% of the variation in students’ test scores. Teachers account for only 10-15% of the variance in scores. Some researchers have argued that even that percentage is too high due to the conflation of teacher contribution with class size and peer effects. What that means is that the urban legend that three excellent teachers in a row will close the achievement gap is not grounded in research but rather in speculation.

The shortcomings of evaluating teachers by test scores were apparent in the recent report of the American Institute for Research (AIR), which developed the New York growth score model. AIR, in its BETA report, shows how as the percentage of students with disabilities and students of poverty in a class or school increases, the average teacher or principal growth score decreases. In short, the larger the share of such students, the more the teacher and principal are disadvantaged by the model. I predict that when the state results are made public, you will see a disproportionate amount of teachers of students with serious learning disabilities and teachers in schools with high levels of poverty labeled ineffective on scores. And that label will be unfair.

Likewise, in the model used this year, teachers who have students whose prior test scores were higher were advantaged, while teachers whose students have lower prior achievement were disadvantaged. This phenomenon, known as peer effects, has been observed in the literature since the 1980s. There is no control for peer effects in the model. We will see patterns of low scores for teachers of disadvantaged students. Over time, the students who need the best teachers and principals will see them leave their schools in order to escape the ‘ineffective’ label.

Perhaps the best critique of the model comes from AIR itself. The BETA report concludes that “the model selected to estimate growth scores for New York State represents a first effort to produce fair and accurate estimates of individual teacher and principal effectiveness based on a limited set of data” (p. 35). Not “our best attempt,” not even a “good first attempt,” but rather a “first effort” at fairness.

And yet, across the state, teachers and principals have received scores telling them that they are ineffective in producing student learning growth.

During the first two weeks of September, Principal Harry Leonadartis surveyed principals around the state to find out if the growth ratings they received for their teachers appeared to be an accurate reflection of their teachers’ skills. More than 500 New York principals responded.

Seventy three percent of respondents said that the “ineffective” label assigned to some of their teachers was either not a very accurate or an inaccurate reflection of that teacher based on their observations and the performance of that teacher’s students. A majority said that the scores overall were not a very accurate reflection of teacher ability. Regarding APPR, the state-imposed evaluation system, 81% regarded it as a tool of limited or no value for the evaluation of teachers. Only 19% had a positive attitude toward APPR with minimal concerns. Over 81% described themselves as either reluctant participants or opposed to APPR. More than 1510 New York principals have signed a letter of opposition to APPR which can be found here.

In the comments section of the survey, several principals reported having excellent special education teachers labeled as “ineffective.” One principal wrote: “Two excellent teachers who volunteer to take on my toughest students got an ineffective. Their hearts were broken. So was mine.” Another principal remarked, “The teachers who were identified as ineffective…have been teaching for more than 15 years, and have cared for students in ways that no test can measure.”

Other principals remarked that teachers who received poor ratings were teacher often praised by students and parents alike. Some principals stated that they would change their teacher’s assignment next year and assign them less needy students so that they could protect these excellent teachers from the ineffective rating. The unintended consequences to students are beginning.

How can an evaluation system in which the evaluators themselves have little faith possibly be productive? The question is, what will we collectively, and individually as school leaders do?

There are principals who are bravely standing up. One of them is Don Sternberg, the leader of the Wantagh Elementary School on Long Island. You can read his letter to parents here.

Don wrote:

Of additional concern to me is the relationship between children and their teacher as we move into an era where teacher job status is based upon student assessment scores. Guess what, some children will become more desirable than others to have in class! And, conversely, others will be less desirable. There, I wrote it! That concept is blasphemy in our school where teachers live to prepare children to be productive learners and members of society. Teachers state-wide are worried that their relationship with students might change when they are evaluated based upon their students’ test scores. Teachers want to educate students, not test prep them for job security.

Additionally, what should be shocking to you as a parent is that state and national databases are being created in order to analyze and store students’ test scores – your child’s assessment results and your child’s school attendance! Do you realize that the state has mandated that classroom teachers must take attendance during every math, ELA, social studies and science lesson – everyone, every day for the entire school year! Those records are sent to the state and become statistically part of the teacher evaluation process. It will no longer be enough that your child ‘was in school.’

Rather, if he or she was at a band lesson or out of the room for extra help in reading and a math lesson was taking place in class, he or she will be noted as absent from that instruction. That will be factored into the teacher evaluation….This is all part of the massive, multi-million tax-payer dollar teacher evaluation processes started by our Commissioner of Education, our governor, and our state legislators and fully supported by statisticians employed by the state and assessment-making companies. No one in Albany is selecting to see the end of the journey; that 98 percent of the students graduating from Wantagh Schools go on to two- and four-year colleges. Their myopic view is focused on the ‘parts’, not the whole. Who will eventually suffer? Your children!

Further upstate New York, an outstanding principal, John Mc Kenna, worked with the Niagara Regional PTA to create a resolution against high-stakes testing and teacher evaluation by test scores to be presented at the New York State PTA convention later this fall. You can read their resolution here:

Poverty matters. It does not seal the fate of a child, but if we are to overcome the disadvantages that it brings, we must level the playing field by providing effective supports to poor students and the teachers who serve them. The “no excuses” philosophy which seeks to blame teachers for the burden our entire society must bear is a cold and shameful response to our most disadvantaged students. The waste of billions of taxpayer dollars on testing, test security, test shredding, intrusive data systems and test-score teaching ratings is a violation of the public trust.

How many more brave educators and parents will stand up, speak and say “no more?”

NYC Rubber Room Reporter and ATR CONNECT

Sunday, December 2, 2012

Carol Burris: The "New" Teacher Evaluation Process is Nonsense

Sunday, December 2, 2012

The newest rhetoric on teacher evaluation — and why it is nonsense

New teacher evaluations start to hurt students

No comments:

Post a Comment