CURMUDGUCATION: More Evidence That Tests Measure SES

Thursday, November 19, 2015

More Evidence That Tests Measure SES

Want more proof, again, some more, of the connection between socio-economic status and standardized test results? Twitter follower Joseph Robertshaw pointed me at a pair of studies by Randy Hoover, PhD, at the Department of Teacher Education, Beeghly College of Education, Youngstown State University.

Hoover is now a professor emeritus, but the validity of standardized testing and the search for a valid and reliable accountability system. He now runs a website called the Teacher Advocate and it's worth a look.

Hoover released two studies-- one in 2000, and one in 2007-- that looked at the validity of the Ohio Achievement Tests and the Ohio Graduate Test, and while there are no surprises here, you can add these to your file of scientific debunking of standardized testing. We're just going to look at the 2007 study, which was in part intended to check on the results of the 2000 study.

The bottom line of the earlier study appears right up front in the first paragraph of the 2007 paper:

The primary finding of this previous study was that student performance on the tests was most significantly (r = 0.80) affected by the non-school variables within the student social-economic living conditions. Indeed, the statistical significance of the predictive power of SES led to the inescapable conclusion that the tests had no academic accountability or validity whatsoever.

The 2007 study wanted to re-examine the findings, check the fairness and validity of the tests, and draw conclusions about what those findings meant to the Ohio School Report Card.

So what did Hoover find? Well, mostly that he was right the first time. He does take the time to offer a short lesson in statistical correlation analysis, which will be helpful if, like me, you are not a research scholar. Basically, the thing to remember is that a perfect correlation is 1.0 (or -1,0). So, getting punched in the nose correlates about 1.0 to feeling pain.

Hoover is out to find the correlation between what he calls the students' "lived experience" to district level performance is 0.78. Which is high.

If you like scatterplot charts (calling Jersey Jazzman), then Hoover has some of those for you, all driving home the same point. For instance, here's one looking at the percent of economically disadvantaged students as a predictor of district performance.

That's an r value of -0.75, which means you can do a pretty good job of predicting how a district will do based on how few or many economically disadvantaged students there are.

Hoover crunched together three factors to create what he calls a Lived Experience Index that shows, in fact, a 0.78 r value. Like Chris Tienken, Hoover has shown that we can pretty well assign a school or district a rating based on their demographics and just skip the whole testing business entirely.

Hoover takes things a step further, and reverse-maths the results to a plot of results with his live experience index factored out-- a sort of crude VAM sauce. He has a chart for those results, showing that there are poor schools performing well and rich schools performing poorly. Frankly, I think he's probably on shakier ground here, but it does support his conclusion about the Ohio school accountability system of the time to be "grossly misleading at best and grossly unfair at worst," a system that "perpetuates the political fiction that poor children can't learn and teachers in schools with poor children can't teach."

That was back in 2007, so some of the landscape such as the Ohio school accountability system (well, public school accountability-- Ohio charters are apparently not accountable to anybody) has changed, along with many reformster advances of the past eight years.

But this research does stand as one more data point regarding standardized tests and their ability to measure SES far better than they measure anything else.

25 comments:

AnonymousNovember 19, 2015 at 12:48 PM
One problem here is that median family income also predicts teacher turnover and teacher experience, things that Hoover does not make any attempt to control for. His results could easily be consistent with those high rates of teacher turnover and low level of experience driving low test scores.

I am curious to see if people here think that having high turnover and low levels of experience in the classroom have a negative impact on learning?
ReplyDelete
Replies
UnknownNovember 19, 2015 at 1:17 PM
My guess, TE, is that most or all of us think that "high rates of teacher turnover" and "low level of experience" are not good for kids -- this is, for most of us, probably the #1 reason to adamantly oppose VAM. Why would a good smart teacher choose to work in the low SES schools under a VAM system ? They wouldn't. Even someone like me, who basically wants to "do good" would not choose to end their careers prematurely by working in a low SES school.
ReplyDelete
Replies
alanbackmanNovember 19, 2015 at 3:44 PM
Hoover and now Greene miss the point. And given some of their familiarity with basic statistics, the miss must be intentional. Of course, SES correlates with standardized test scores. But that's not the point. SES is not a dependent variable but rather a metric within which to look at the underlying data. It's like saying that the price of a car correlates with horsepower. That is probably true as well, though again, it's not relevant since it's unlikely that someone shopping for a $25,000 car is also shopping for one that costs $100,000.

Instead, the more relevant question is what is the range of horsepower for cars that cost under $25,000. And can we learn anything from this ? Similarly, what is the range of test scores from those in the bottom quintile of SES ? And again, what can we learn from this ? In fact, there is an entire branch of statistics which works for this kind of analysis called Bayesian statistics.

Greene and Hoover likely don't want to look at things this way because they would see the same types of things that CREDO and Mathematica have found. If you look within the cohort of urban students (as an example conditioned on geography rather than income), you find that charter schools routinely outperform traditional schools. See links below.

Again, just like the person buying the $25,000 car is not buying a $100,000 car, the poor family in the Bronx is choosing between a failing traditional school and a charter school rather than an affluent school in Greenwich.

Stanford University's CREDO - "Across the 41 cities studied, students in charter schools learned significantly more than their peers attending traditional public schools – 40 more days worth of learning in math, and 28 more in reading."http://www.usnews.com/opinion/knowledge-bank/2015/03/19/new-study-shows-charter-schools-making-a-difference-in-cities

Mathematica - "In our exploratory analysis, for example, we found that study charter schools serving more low income or low achieving students had statistically significant positive effects on math test scores"http://www.mathematica-mpr.com/~/media/publications/PDFs/education/charter_school_impacts.pdf
ReplyDelete
Replies
alanbackmanNovember 19, 2015 at 3:48 PM
Hoover and now Greene miss the point. And given some of their familiarity with basic statistics, the miss must be intentional. Of course, SES correlates with standardized test scores. But that's not the point. SES is not a dependent variable but rather a metric within which to look at the underlying data. It's like saying that the price of a car correlates with horsepower. That is probably true as well, though again, it's not relevant since it's unlikely that someone shopping for a $25,000 car is also shopping for one that costs $100,000.

Instead, the more relevant question is what is the range of horsepower for cars that cost under $25,000. And can we learn anything from this ? Similarly, what is the range of test scores from those in the bottom quintile of SES ? And again, what can we learn from this ? In fact, there is an entire branch of statistics which works for this kind of analysis called Bayesian statistics.

Greene and Hoover likely don't want to look at things this way because they would see the same types of things that CREDO and Mathematica have found. If you look within the cohort of urban students (as an example conditioned on geography rather than income), you find that charter schools routinely outperform traditional schools. See links below.

Again, just like the person buying the $25,000 car is not buying a $100,000 car, the poor family in the Bronx is choosing between a failing traditional school and a charter school rather than an affluent school in Greenwich.

Stanford University's CREDO - "Across the 41 cities studied, students in charter schools learned significantly more than their peers attending traditional public schools – 40 more days worth of learning in math, and 28 more in reading."http://www.usnews.com/opinion/knowledge-bank/2015/03/19/new-study-shows-charter-schools-making-a-difference-in-cities

Mathematica - "In our exploratory analysis, for example, we found that study charter schools serving more low income or low achieving students had statistically significant positive effects on math test scores"http://www.mathematica-mpr.com/~/media/publications/PDFs/education/charter_school_impacts.pdf
ReplyDelete
Replies
Mike MackennaNovember 19, 2015 at 8:36 PM
For the sake of argument, let's concede that charters get kids to score higher on tests. Two questions about this: 1. So what? Is scoring higher on standardized tests proven to lead to anything important? 2. Many charters get those higher scores by insisting on militaristic discipline and narrowing the curriculum. Do we want kids to be taught blind obedience in a limited curriculum just to score high on a test?
ReplyDelete
Replies

Add comment

Pages

Thursday, November 19, 2015

More Evidence That Tests Measure SES

25 comments: