Accountability Without Equity: How School Rankings and Testing Penalize Underfunded Schools

Accountability in public education rests on a reasonable premise: schools should be measured on how well they serve their students, results should be transparent, and schools that are failing should face consequences and receive support. The accountability infrastructure built around that premise — standardized testing, school rating systems, report cards, performance rankings — is now embedded in how public education is governed at the federal, state, and local level.

The problem is not the premise. The problem is that the accountability systems built on that premise measure outcomes that correlate strongly with poverty and funding, assign ratings based on those outcomes, and then attach consequences — reduced enrollment, loss of funding, state intervention, public stigma — to the schools that serve the communities with the least. The result is an accountability infrastructure that consistently identifies underfunded schools as failing without addressing the conditions that produce the outcomes being measured.

What Standardized Tests Measure

Standardized tests in reading and mathematics measure a real thing. Students who score higher on well-designed assessments tend to have stronger academic skills than students who score lower, and those skills matter for educational and economic outcomes. The tests are not meaningless.

But test scores also correlate strongly with factors outside the school’s control. Family income is the most consistent predictor of standardized test performance in the research literature — more consistent than teacher quality, school funding, curriculum, or any other school-level variable. Students from higher-income families score higher on average than students from lower-income families, and that gap appears before children enter school and persists through the educational system.

The mechanisms are well-documented. Higher-income families have more resources to invest in early childhood development — books, educational materials, enriching experiences, stable housing, adequate nutrition, access to healthcare. They are less likely to expose children to the chronic stress of poverty, which research has linked to cognitive development and the capacity to learn. Their children arrive at kindergarten with larger vocabularies, stronger pre-literacy skills, and less exposure to the adverse childhood experiences that make learning more difficult.

Schools do not create these conditions. They inherit them. A school serving a high-poverty community starts every year with students who, on average, have had less preparation and more adversity than students in lower-poverty schools. That is not a statement about the potential of those students. It is a statement about the conditions under which they have been living.

How Rating Systems Work

Most state school rating systems assign schools letter grades or numeric scores based primarily on student test performance. Some systems incorporate growth measures — how much students improved over the course of a year — alongside absolute proficiency levels. Some include graduation rates, chronic absenteeism, and other indicators. But in most systems, absolute test score levels remain the dominant factor in school ratings.

The result is predictable and well-documented. Schools in higher-poverty communities receive lower ratings than schools in lower-poverty communities at rates that reflect the poverty-test score correlation more than they reflect differences in school quality. A school where teachers are working effectively with students who face significant challenges outside school will often receive a lower rating than a school where teachers are working with students who arrive with more advantages — not because the first school is doing worse work, but because the measurement system does not adequately separate school contribution from student background.

EdBuild, a nonprofit that studied school funding and accountability before closing in 2020, documented the relationship between poverty, race, and school ratings across multiple states. Their analyses consistently found that schools serving predominantly low-income and minority students received lower ratings regardless of whether those schools were improving, regardless of what resources they had, and regardless of what their students’ growth trajectories looked like. The ratings reflected who attended the school as much as what the school was doing.

How Ratings Affect Enrollment and Funding

School ratings are not administrative designations visible only to policymakers. They are published, ranked, shared on real estate websites, and used by families making housing and school choice decisions. The consequences for rated schools are real and compound in ways that worsen the conditions the ratings are supposed to address.

When a school receives a low rating, families with the means to choose leave — through school choice programs, through moving to a different attendance zone, through private school enrollment. The families most able to leave are typically higher-income families whose presence in a school contributes to the social capital, PTA fundraising, and peer effects that benefit all students. Their departure concentrates poverty in the rated school, which worsens the conditions that produced the low rating in the first place.

Some funding mechanisms respond to enrollment. Schools that lose students lose funding — per-pupil allocations follow the child in most systems. A school that is rated low, loses enrollment, and loses funding is then less able to retain teachers, maintain programs, and address the conditions producing the low rating. The accountability system has created a feedback loop that accelerates decline rather than supporting improvement.

In states and districts with school choice programs, low ratings drive students toward higher-rated schools — which are typically in wealthier communities — removing resources and engaged families from the rated school while overcrowding or straining the receiving schools. The accountability system redistributes students based on ratings without changing the underlying conditions that produced them.

The Consequences Prescribed for Low-Rated Schools

Under NCLB and, to a lesser degree, under ESSA, schools identified as persistently low-performing faced prescribed interventions. The NCLB interventions escalated from supplemental services to choice programs to restructuring — which could mean replacing staff, converting to a charter school, or state takeover. ESSA gave states more flexibility but retained requirements for identifying and intervening in the lowest-performing schools.

The research on these interventions is mixed at best. Mass staff replacement — removing teachers and administrators from a low-performing school and hiring new ones — has not consistently produced improvement and has sometimes produced worse outcomes in the transition period. Charter conversion has produced a wide range of results, with some charter operators improving outcomes and others performing similarly to or worse than the schools they replaced. State takeover has a poor track record — the most documented state takeover programs, in Michigan, Louisiana, and Tennessee, have not consistently produced the improvements their architects anticipated.

What the interventions share is a focus on the school as the unit of failure and the school as the unit of intervention. They do not address the funding inequality that limits what the school can do. They do not address the poverty concentration that produces the test score gaps being measured. They prescribe changes to the school without changing the conditions that shaped the school’s situation.

Teachers and administrators in low-rated schools describe the accountability environment in consistent terms: a focus on tested subjects at the expense of everything else, pressure to produce short-term score gains through test preparation rather than deep learning, demoralization from being publicly labeled as failing, and difficulty recruiting and retaining staff who do not want to work in a school under sanction. The accountability pressure itself becomes a factor in the school’s difficulty — a dynamic that research on school turnaround has documented repeatedly.

The Growth Measure Problem

The most defensible response to the poverty-test score correlation is to measure growth rather than absolute performance — to ask not “how do students at this school perform?” but “how much did students at this school improve?” A school that moves students from the 20th percentile to the 35th percentile is doing real work, even if its students are not proficient by state standards.

Many states have incorporated growth measures into their accountability systems, and this is a genuine improvement. Growth measures do a better job of capturing school contribution independent of student background. Schools serving high-poverty communities score better on growth measures than on absolute proficiency measures, on average.

But growth measures have not displaced absolute proficiency as the primary basis for school ratings in most states. They supplement it. The political and public perception of school quality remains anchored to test score levels, not growth. Real estate listings cite school ratings and test scores, not growth percentiles. Parents making school choice decisions respond to the public-facing ratings, which remain dominated by absolute performance. The technical improvement represented by growth measures has not fully translated into how schools are perceived and what consequences they face.

What This Establishes

The accountability infrastructure is not neutral with respect to funding inequality. It systematically identifies the schools serving the most disadvantaged students as low-performing, attaches stigma and consequences to those schools, and creates dynamics — enrollment loss, funding loss, staff departure — that compound the difficulties those schools face. It does this without addressing the funding conditions that constrain what those schools can do.

That does not mean accountability is wrong. It means accountability without equity is not accountability — it is a measurement system that produces predictable results based on who a school serves, and then treats those results as evidence of school failure rather than evidence of the conditions schools are operating in.

The final article in this hub surveys what reform proposals are designed to do about the structural conditions that produce these outcomes. If you have direct experience with school rating systems, testing accountability, or the consequences of low school ratings for a community — as a teacher, administrator, student, or parent — the forum is where that experience belongs.


This article was researched and drafted with AI assistance under human review. See our full AI and editorial practices.