Jersey Jazzman: Oh, Matt...

Wednesday, August 24, 2011

Oh, Matt...

From my perspective, there are two MVPs in the current reform debate: Bruce Baker and Matt DiCarlo.

I have enormous respect for Matt. He commands a great deal of information about a complex topic, he has a strong grasp of research methods, and he has the ability to distill the thorny language of academic research into writing that lay people can not only understand, but use to inform themselves about this critical debate.

Which is why I am so very, very disappointed in his latest post:

Using value-added and other types of growth model estimates in teacher evaluations is probably the most controversial and oft-discussed issue in education policy over the past few years.

Many people (including a large proportion of teachers) are opposed to using student test scores in their evaluations, as they feel that the measures are not valid or reliable, and that they will incentivize perverse behavior, such as cheating or competition between teachers. Advocates, on the other hand, argue that student performance is a vital part of teachers’ performance evaluations, and that the growth model estimates, while imperfect, represent the best available option.

I am sympathetic to both views. In fact, in my opinion, there are only two unsupportable positions in this debate: Certainty that using these measures in evaluations will work; and certainty that it won’t. Unfortunately, that’s often how the debate has proceeded – two deeply-entrenched sides convinced of their absolutist positions, and resolved that any nuance in or compromise of their views will only preclude the success of their efforts. You’re with them or against them. The problem is that it’s the nuance – the details – that determine policy effects.

As Atrios said the other day: when you're confronted with a "Clowns to the left of me, Jokers to the right..." column, watch out.

The issue has never been "certainty"- everybody understands that no measure is perfect, and that there will some inevitable flaws in any system of evaluating teachers. The issue is "appropriateness." It is not appropriate to use test scores in high-stakes decision making when everyone - especially Matt - knows the error rates are far too high.

Even if you create an evaluation system where you mitigate for the huge margins of error (60% spreads?! Seriously?!), you're still left with the question of what you're going to do with the teacher's score once you have it. Fire them? Deny seniority? Pay them less or more? How can anyone possibly be for making these high-stakes decisions when they know the error rates are so high?

Let’s be clear about something: I’m not aware of a shred of evidence – not a shred – that the use of growth model estimates in teacher evaluations improves performance of either teachers or students.
Now, don’t get me wrong – there’s no direct evidence that using VA measures has a positive effect because there’s really no evidence at all. This stuff is all very new, and it will take time before researchers get some idea of the effects. There is some newer evidence that well-designed teacher evaluations can have positive effects on teacher performance (see here, for example), but these systems did not include test-based measures. [emphasis mine]

Matt, buddy - that's what we're talking about, isn't it? That's the entire issue. Nobody is against "well-designed teacher evaluations"; we're against poorly-designed ones. Do you think the evaluations people like Michelle Rhee and Chris Christie and Arne Duncan are selling are any good?

Apparently not:

This situation would seem to call for not simple “yes/no” answers, but rather proceeding carefully, using established methods of policy evaluation and design. That is not what is happening. Thanks in large part to Race to the Top, almost half of public school students in the U.S. are now enrolled in states/districts that already have or will soon have incorporated growth estimates into their evaluations. Most (but not all) of these states and districts are mandating that test-based productivity measures comprise incredibly high proportions of evaluation scores, and most have failed to address key issues such as random error and the accuracy of their data collection systems. Many refused to allow for a year or two of piloting these new systems, while few have commissioned independent evaluations of these systems’ effects on achievement and other outcomes, which means that, in most places, we’ll have no rigorous means of assessing the impact of these systems.[emphasis mine]

Given this, is it so extreme to say "don't use test scores to make high-stakes decisions about teachers"? Is that a position that is just as far out of the rational center as saying "fire and pay teachers based on test scores"?

In my view, this failure to address basic issues reflects extreme polarization between the “sides” in this debate. When positions are black and white, details and implementation get the short end of the stick.

Dude, my side isn't implementing ANYTHING! The corporate "reformers" are doing all the implementing! They want to radically change the way teachers are employed, paid, and fired on the basis of this stuff - not us teachers! And they want to do so without any of the caveats you're suggesting.

And yet, you seem to think it's incumbent on us teachers to give a little here:

On the other “side” of the divide, any admission that growth measures might play even a small, responsible role in evaluations risks the dreaded slippery slope, while a cautious acknowledgment that standardized testing data do provide “actionable” information somehow represents a foot in the door for an evil technocratic regime that will sap public education of all its humanity. [emphasis mine]

Matt, if you were here, I'd make you look me in the eye while I say this:

You admit that their side is pushing for a test score-driven method of evaluating teachers that is full of error. You admit that their side is going to make all sort of high-stakes decisions based on this system, even though we all know that is completely inappropriate and will certainly cause great harm to both the teaching corps and the schools in the coming years.

And yet - even though you admit these people are doing something very, very wrong - you want me to give them the benefit of the doubt, concede to piloting their methods, and not assume this is a "slippery slope"?

Matt, you have got to be kidding me.

What do you think will be the outcome of their "studies"? How "independent" do you think the "researchers" who come up with the conclusions will be? We may as well let BP study the damage from the Gulf oil spill; we may as well let Goldman-Sachs determine whether the markets are rigged (actually, I think we may be letting Wall Street do just that...).

These people have already shown their hand, Matt. There's no doubt what these "studies" will conclude. They have made up their minds and are going to cherry-pick whatever they can to conform with their world view.

How do I know this? Simple: they are doing it right now. If you want me to give them the benefit of the doubt, they're going to have to stop their march to implement a program you and I and everyone else knows has not been studied nearly enough and should not be implemented.

What do you think the odds of that happening are, Matt?

8 comments:

Anonymous said...: "Dude, my side isn't implementing ANYTHING!"

Um, pardon me for noticing, but your side has been implementing its own teacher evaluation system for many decades now ("Are you still alive? You get a raise! Did you get a useless masters' degree in educational philosophy? Another raise!"). But we know for sure that your side's system is a a dumbass way to do things.; August 24, 2011 at 7:21:00 PM PDT
DeclineRedux said...: Value-added is so last year. Check out California's new EQI rankings:
http://www.latimes.com/news/opinion/opinionla/la-ed-test-20110824,0,7688499.story; August 24, 2011 at 7:26:00 PM PDT
Duke said...: Almost there, Anon. You have to add something about "our failing schools" to really bring the platitude home.

Oh, and mention the all-powerful unions. That's always helpful, 'cause look at all the power they have right now...

(sigh); August 24, 2011 at 7:36:00 PM PDT
Unknown said...: Thanks, Anaon for reminding me that the know-nothings from the 19th century didn't disappear in the dustbin of history, but have re-appeared in the 21st century as education reformers and critics.; August 27, 2011 at 4:39:00 AM PDT
Anonymous said...: Neither of you smartasses can say a single thing disproving what I said: your side has "implemented" its idea of merit for a long time, with zero evidence that it accomplishes anything.; August 27, 2011 at 11:21:00 AM PDT
Duke said...: Burden of proof, Anon. Look up "argument from ignorance." You have the burden to prove merit pay will work.

Where's your proof?; August 28, 2011 at 6:44:00 AM PDT
Anonymous said...: Did I say "merit pay"? Nope. I'm just saying that it is idiotic on its face not to take test scores into account just because it's not a perfect measure -- OK, it's not perfect, but it sure as hell isn't perfect to completely ignore student learning either.; August 28, 2011 at 9:42:00 AM PDT
Duke said...: And now I know you're just a gainsayer. Read what I wrote - really read it.

The point is that HIGH-STAKES decisions should not be made with a method that has such high error rates.

I suppose the NRC and ETS and RAND and EPI and NEPC and all the rest are "idiotic" because they say the same thing. I guess you know better than all of them, what with their "statistics" and "scientific method" and "peer review" and all that other worthless junk...

No one "ignores" student learning. No one ignores test scores. But tying pay to test scores is error-prone and will do serious harm to the profession. Serious studies show this over and over again.

I've made my case. The burden of proof is on you to show, through REAL evidence (not the pitter-patter of your heart telling you so), that I am incorrect.

Saying I'm an idiot is not proof. Saying I can't prove a negative is not proof.

Let's raise the bar here a little, OK? Otherwise, why are you here?; August 28, 2011 at 8:23:00 PM PDT