10 Year Assessment of Education Reform in DC — a Fraud, says its former director of assessments, Richard Phelps

Stolen from Valerie Jablow:

Looking Back on DC Education Reform 10 Years Later, Part 1: The Grand Tourby Valerie Jablow

[Ed. Note: As DC’s office of the state superintendent of education (OSSE) seeks a waiver of PARCC testing again (recall that OSSE waived PARCC last school year due to the pandemic) and the DC auditor just released a bombshell report of poor stewardship of DC’s education data, it is time to revisit how standardized test data, teacher evaluations, and harsh school penalties were united by ed reformers in DCPS under mayoral control. This first-hand account of what went down in DCPS, the first of two parts by semi-retired educator Richard P. Phelps, appeared in Nonpartisan Education Review in September 2020 and is reprinted here with permission. The author thanks DC budget expert Mary Levy and retired DCPS teacher Erich Martel for their helpful comments in the research of this article.]

By Richard P. Phelps

Ten years ago, I worked as the director of assessments for DCPS. My tenure coincided with Michelle Rhee’s last nine months as chancellor. I departed shortly after Vincent Gray defeated Adrian Fenty in the September 2010 DC mayoral primary.

My primary task was to design an expansion of that testing program that served the IMPACT teacher evaluation system to include all core subjects and all grade levels. Despite its fame (or infamy), the test score aspect of the IMPACT program affected only 13% of teachers, those teaching either reading or math in grades four through eight. Only those subjects and grade levels included the requisite pre- and post-tests required for teacher “value added” measurements (VAM). Not included were most subjects (e.g., science, social studies, art, music, physical education), grades kindergarten to two, and high school.

Chancellor Rhee wanted many more teachers included. So, I designed a system that would cover more than half the DCPS teacher force, from kindergarten through high school. You haven’t heard about it because it never happened. The newly elected Vincent Gray had promised during his mayoral campaign to reduce the amount of testing; the proposed expansion would have increased it fourfold.

VAM affected teachers’ jobs. A low value-added score could lead to termination; a high score, to promotion and a cash bonus. VAM as it was then structured was obviously, glaringly flawed, as anyone with a strong background in educational testing could have seen. Unfortunately, among the many new central office hires from the elite of ed reform circles, none had such a background. (Even a primary grades teacher with the same group of students the entire school day had those students for less than six hours a day, five days a week, for less than half the year. All told, even in the highest exposure circumstances, a teacher interacted with the same group of students for less than a tenth of each student’s waking hours in a year, and for less than a twentieth in the tested subjects of English and math. In the lowest exposure circumstance, a high school teacher might interact with a class of English or math students for less than three percent of a student’s annual hours.

Before posting a request for proposals from commercial test developers for the testing expansion plan, I was instructed to survey two groups of stakeholders—central office managers and school-level teachers and administrators.

Not surprisingly, some of the central office managers consulted requested additions or changes to the proposed testing program where they thought it would benefit their domain of responsibility. The net effect on school-level personnel would have been to add to their administrative burden. Nonetheless, all requests from central office managers would be honored.

The Grand Tour

At about the same time, over several weeks of the late spring and early summer of 2010, along with a bright summer intern, I visited a dozen DCPS schools. The alleged purpose was to collect feedback on the design of the expanded testing program. I enjoyed these meetings. They were informative, animated, and very well attended. School staff appreciated the apparent opportunity to contribute to policy decisions and tried to make the most of it.

Each school greeted us with a full complement of faculty and staff on their days off, numbering a several dozen educators at some venues. They believed what we had told them: that we were in the process of redesigning the DCPS assessment program and were genuinely interested in their suggestions for how best to do it.

At no venue did we encounter stand-pat knee-jerk rejection of education reform efforts. Some educators were avowed advocates for the Rhee administration’s reform policies, but most were basically dedicated educators determined to do what was best for their community within the current context.

The Grand Tour was insightful, too. I learned for the first time of certain aspects of DCPS’s assessment system that were essential to consider in its proper design, aspects of which the higher-ups in the DCPS Central Office either were not aware or did not consider relevant.

The group of visited schools represented DCPS as a whole in appropriate proportions geographically, ethnically, and by education level (i.e., primary, middle, and high). Within those parameters, however, only schools with “friendly” administrations were chosen. That is, we only visited schools with principals and staff openly supportive of the Rhee-Henderson agenda.

But even they desired changes to the testing program, whether or not it was expanded. Their suggestions covered both the annual districtwide DC-CAS (or “comprehensive” assessment system), on which the teacher evaluation system was based, and the DC-BAS (or “benchmarking” assessment system), a series of four annual “no-stakes” interim tests unique to DCPS, ostensibly offered to help prepare students and teachers for the consequential-for-some-school-staff DC-CAS. (Though officially “no stakes,” some principals analyzed results from the DC-BAS to identify students whose scores lay just under the next higher benchmark and encouraged teachers to focus their instructional efforts on them. Moreover, at the high school level, where testing occurred only in grade 10, students who performed poorly on the DC-BAS might be artificially re-classified as held-back 9th graders or advanced prematurely to 11th grade in order to avoid the DC-CAS.)

At each staff meeting I asked for a show of hands on several issues of interest that I thought were actionable. Some suggestions for program changes received close to unanimous support. Allow me to describe several.

***Move DC-CAS test administration later in the school year. Many citizens may have logically assumed that the IMPACT teacher evaluation numbers were calculated from a standard pre-post test schedule, testing a teacher’s students at the beginning of their academic year together and then again at the end. In 2010, however, the DC-CAS was administered in March, three months before school year end. Moreover, that single administration of the test served as both pre- and post-test, posttest for the current school year and pretest for the following school year. Thus, before a teacher even met their new students in late August or early September, almost half of the year for which teachers were judged had already transpired—the three months in the spring spent with the previous year’s teacher and almost three months of summer vacation.

School staff recommended pushing DC-CAS administration to later in the school year. Furthermore, they advocated a genuine pre-post-test administration schedule—pre-test the students in late August–early September and post-test them in late-May–early June—to cover a teacher’s actual span of time with the students.

This suggestion was rejected because the test development firm with the DC-CAS contract required three months to score some portions of the test in time for the IMPACT teacher ratings scheduled for early July delivery, before the start of the new school year. Some small number of teachers would be terminated based on their IMPACT scores, so management demanded those scores be available before preparations for the new school year began. (Even a primary grades teacher with the same group of students the entire school day had those students for less than six hours a day, five days a week, for less than half the year. All told, even in the highest exposure circumstances, a teacher interacted with the same group of students for less than a tenth of each student’s waking hours in a year, and for less than a twentieth in the tested subjects of English and math. In the lowest exposure circumstance, a high school teacher might interact with a class of English or math students for less than three percent of a student’s annual hours.)

The tail wagged the dog.

***Add some stakes to the DC-CAS in the upper grades. Because DC-CAS test scores portended consequences for teachers but none for students, some students expended little effort on the test. Indeed, extensive research on “no-stakes” (for students) tests reveal that motivation and effort vary by a range of factors including gender, ethnicity, socioeconomic class, the weather, and age. Generally, the older the student, the lower the test-taking effort. This disadvantaged some teachers in the IMPACT ratings for circumstances beyond their control: unlucky student demographics.

Central office management rejected this suggestion to add even modest stakes to the upper grades’ DC-CAS; no reason given.

***Move one of the DC-BAS tests to year end. If management rejected the suggestion to move DC-CAS test administration to the end of the school year, school staff suggested scheduling one of the no-stakes DC-BAS benchmarking tests for late May–early June. As it was, the schedule squeezed all four benchmarking test administrations between early September and mid-February. Moving just one of them to the end of the year would give the following year’s teachers a more recent reading (by more than three months) of their new students’ academic levels and needs.

Central office management rejected this suggestion probably because the real purpose of the DC-BAS was not to help teachers understand their students’ academic levels and needs, as the following will explain.

***Change DC-BAS tests so they cover recently taught content. Many DC citizens probably assumed that, like most tests, the DC-BAS interim tests covered recently taught content, such as that covered since the previous test administration. Not so in 2010. The first annual DC-BAS was administered in early September, just after the year’s courses commenced. Moreover, it covered the same content domain—that for the entirety of the school year—as each of the next three DC-BAS tests.

School staff proposed changing the full-year “comprehensive” content coverage of each DC-BAS test to partial-year “cumulative” coverage, so students would only be tested on what they had been taught prior to each test administration.

This suggestion, too, was rejected. Testing the same full-year comprehensive content domain produced a predictable, flattering score rise. With each DC-BAS test administration, students recognized more of the content, because they had just been exposed to more of it, so average scores predictably rose. With test scores always rising, it looked like student achievement improved steadily each year. Achieving this contrived score increase required testing students on some material to which they had not yet been exposed, both a violation of professional testing standards and a poor method for instilling student confidence. (Of course, it was also less expensive to administer essentially the same test four times a year than to develop four genuinely different tests.)

***Synchronize the sequencing of curricular content across the District. DCPS management rhetoric circa 2010 attributed classroom-level benefits to the testing program. Teachers would know more about their students’ levels and needs and could also learn from each other. Yet, the only student test results teachers received at the beginning of each school year was half-a-year old, and most of the information they received over the course of four DC-BAS test administrations was based on not-yet-taught content.

As for cross-district teacher cooperation, unfortunately there was no cross-District coordination of common curricular sequences. Each teacher paced their subject matter however they wished and varied topical emphases according to their own personal preference.

It took DCPS’s chief academic officer, Carey Wright, and her chief of staff, Dan Gordon, less than a minute to reject the suggestion to standardize topical sequencing across schools so that teachers could consult with one another in real time. Tallying up the votes: several hundred school-level District educators favored the proposal, two of Rhee’s trusted lieutenants opposed it. It lost.

***Offer and require a keyboarding course in the early grades. DCPS was planning to convert all its testing from paper-and-pencil mode to computer delivery within a few years. Yet, keyboarding courses were rare in the early grades. Obviously, without systemwide keyboarding training in computer use some students would be at a disadvantage in computer testing.

Suggestion rejected.

In all, I had polled over 500 DCPS school staff. Not only were all of their suggestions reasonable, some were essential in order to comply with professional assessment standards and ethics.

Nonetheless, back at DCPS’s central office, each suggestion was rejected without, to my observation, any serious consideration. The rejecters included Chancellor Rhee, the head of the office of data and accountability—the self-titled “Data Lady,” Erin McGoldrick—and the head of the curriculum and instruction division, Carey Wright, and her chief deputy, Dan Gordon.

Four central office staff outvoted several hundred school staff (and my recommendations as assessment director). In each case, the changes recommended would have meant some additional work on their parts, but in return for substantial improvements in the testing program. Their rhetoric was all about helping teachers and students; but the facts were that the testing program wasn’t structured to help them.

What was the purpose of my several weeks of school visits and staff polling? To solicit “buy in” from school level staff, not feedback.

Ultimately, the new testing program proposal would incorporate all the new features requested by senior central office staff, no matter how burdensome, and not a single feature requested by several hundred supportive school-level staff, no matter how helpful. Like many others, I had hoped that the education reform intention of the Rhee-Henderson years was genuine. DCPS could certainly have benefitted from some genuine reform.

Alas, much of the activity labelled “reform” was just for show, and for padding resumes. Numerous central office managers would later work for the Bill and Melinda Gates Foundation. Numerous others would work for entities supported by the Gates or aligned foundations, or in jurisdictions such as Louisiana, where ed reformers held political power. Most would be well paid.

Their genuine accomplishments, or lack thereof, while at DCPS seemed to matter little. What mattered was the appearance of accomplishment and, above all, loyalty to the group. That loyalty required going along to get along: complicity in maintaining the façade of success while withholding any public criticism of or disagreement with other in-group members.

Unfortunately, in the United States what is commonly showcased as education reform is neither a civic enterprise nor a popular movement. Neither parents, the public, nor school-level educators have any direct influence. Rather, at the national level, U.S. education reform is an elite, private club—a small group of tightly connected politicos and academics—a mutual admiration society dedicated to the career advancement, political influence, and financial benefit of its members, supported by a gaggle of wealthy foundations (e.g., Gates, Walton, Broad, Wallace, Hewlett, Smith-Richardson).

For over a decade, The Ed Reform Club exploited DC for its own benefit. Local elite formed the DC Public Education Fund (DCPEF) to sponsor education projects, such as IMPACT, which they deemed worthy. In the negotiations between the Washington Teachers’ Union and DCPS concluded in 2010, DCPEF arranged a 3-year grant of $64.5 million from the Arnold, Broad, Robertson, and Walton foundations to fund a 5-year retroactive teacher pay raise in return for contract language allowing teacher excessing tied to IMPACT, which Rhee promised would lead to annual student test score increases by 2012. Projected goals were not met; foundation support continued nonetheless.

Michelle Johnson (nee Rhee) chaired the board of a charter school chain in California and occasionally collects $30,000+ in speaker fees but, otherwise, seems to have deliberately withdrawn from the limelight. Despite contributing her own additional scandals after she assumed the DCPS chancellorship, Kaya Henderson ascended to great fame and glory with a “distinguished professorship” at Georgetown; honorary degrees from Georgetown and Catholic universities; gigs with the Chan Zuckerberg Initiative, Broad Leadership Academy, and Teach for All; and board memberships with The Aspen Institute, The College Board, Robin Hood NYC, and Teach For America. Carey Wright is now state superintendent in Mississippi. Dan Gordon runs a 30-person consulting firm, Education Counsel, which strategically partners with major players in U.S. education policy. The manager of the IMPACT teacher evaluation program, Jason Kamras, now works as superintendent of the Richmond, VA public schools.

Arguably the person most directly responsible for the recurring assessment system fiascos of the Rhee-Henderson years, then chief of data and accountability Erin McGoldrick, now specializes in “data innovation” as partner and chief operating officer at an education management consulting firm. Her firm, Kitamba, strategically partners with its own panoply of major players in U.S. education policy. Its list of recent clients includes the DC Public Charter School Board and DCPS.

If the ambitious DC central office folk who gaudily declared themselves leading education reformers were not really, who were the genuine education reformers during the Rhee-Henderson decade of massive upheaval and per-student expenditures three times those in the state of Utah? They were the school principals and staff whose practical suggestions were ignored by central office glitterati. They were whistleblowers like history teacher Erich Martel, who had documented DCPS’s manipulation of student records and phony graduation rates years before the investigation of Ballou High School and was demoted and then “excessed” by Henderson. Or school principal Adell Cothorne, who spilled the beans on test answer sheet “erasure parties” at Noyes Education Campus and lost her job under Rhee.

Real reformers with “skin in the game” can’t play it safe.Valerie Jablow | March 11, 2021 at 4:28 pm | Categories: Uncategorized | URL: https://wp.me/p6Dj0P-3TL

Part Two: Cheating in DCPS

DC Education Reform Ten Years After, 

Part 2: Test Cheats

Richard P Phelps

Ten years ago, I worked as the Director of Assessments for the District of Columbia Public Schools (DCPS). For temporal context, I arrived after the first of the infamous test cheating scandals and left just before the incident that spawned a second. Indeed, I filled a new position created to both manage test security and design an expanded testing program. I departed shortly after Vincent Gray, who opposed an expanded testing program, defeated Adrian Fenty in the September 2010 DC mayoral primary. My tenure coincided with Michelle Rhee’s last nine months as Chancellor. 

The recurring test cheating scandals of the Rhee-Henderson years may seem extraordinary but, in fairness, DCPS was more likely than the average US school district to be caught because it received a much higher degree of scrutiny. Given how tests are typically administered in this country, the incidence of cheating is likely far greater than news accounts suggest, for several reasons: 

·      in most cases, those who administer tests—schoolteachers and administrators—have an interest in their results;

·      test security protocols are numerous and complicated yet, nonetheless, the responsibility of non-expert ordinary school personnel, guaranteeing their inconsistent application across schools and over time; 

·      after-the-fact statistical analyses are not legal proof—the odds of a certain amount of wrong-to-right erasures in a single classroom on a paper-and-pencil test being coincidental may be a thousand to one, but one-in-a-thousand is still legally plausible; and

·      after-the-fact investigations based on interviews are time-consuming, scattershot, and uneven. 

Still, there were measures that the Rhee-Henderson administrations could have adopted to substantially reduce the incidence of cheating, but they chose none that might have been effective. Rather, they dug in their heels, insisted that only a few schools had issues, which they thoroughly resolved, and repeatedly denied any systematic problem.  

Cheating scandals

From 2007 to 2009 rumors percolated of an extraordinary level of wrong-to-right erasures on the test answer sheets at many DCPS schools. “Erasure analysis” is one among several “red flag” indicators that testing contractors calculate to monitor cheating. The testing companies take no responsibility for investigating suspected test cheating, however; that is the customer’s, the local or state education agency. 

In her autobiographical account of her time as DCPS Chancellor, Michelle Johnson (nee Rhee), wrote (p. 197)

“For the first time in the history of DCPS, we brought in an outside expert to examine and audit our system. Caveon Test Security – the leading expert in the field at the time – assessed our tests, results, and security measures. Their investigators interviewed teachers, principals, and administrators.

“Caveon found no evidence of systematic cheating. None.”

Caveon, however, had not looked for “systematic” cheating. All they did was interview a few people at several schools where the statistical anomalies were more extraordinary than at others. As none of those individuals would admit to knowingly cheating, Caveon branded all their excuses as “plausible” explanations. That’s it; that is all that Caveon did. But, Caveon’s statement that they found no evidence of “widespread” cheating—despite not having looked for it—would be frequently invoked by DCPS leaders over the next several years.[1]

Incidentally, prior to the revelation of its infamous decades-long, systematic test cheating, the Atlanta Public Schools had similarly retained Caveon Test Security and was, likewise, granted a clean bill of health. Only later did the Georgia state attorney general swoop in and reveal the truth. 

In its defense, Caveon would note that several cheating prevention measures it had recommended to DCPS were never adopted.[2] None of the cheating prevention measures that I recommended were adopted, either.

The single most effective means for reducing in-classroom cheating would have been to rotate teachers on test days so that no teacher administered a test to his or her own students. It would not have been that difficult to randomly assign teachers to different classrooms on test days.

The single most effective means for reducing school administratorcheating would have been to rotate test administrators on test days so that none managed the test materials for their own schools. The visiting test administrators would have been responsible for keeping test materials away from the school until test day, distributing sealed test booklets to the rotated teachers on test day, and for collecting re-sealed test booklets at the end of testing and immediately removing them from the school. 

Instead of implementing these, or a number of other feasible and effective test security measures, DCPS leaders increased the number of test proctors, assigning each of a few dozen or so central office staff a school to monitor. Those proctors could not reasonably manage the volume of oversight required. A single DC test administration could encompass a hundred schools and a thousand classrooms.

Investigations

So, what effort, if any, did DCPS make to counter test cheating? They hired me, but then rejected all my suggestions for increasing security. Also, they established a telephone tip line. Anyone who suspected cheating could report it, even anonymously, and, allegedly, their tip would be investigated. 

Some forms of cheating are best investigated through interviews. Probably the most frequent forms of cheating at DCPS—teachers helping students during test administrations and school administrators looking at test forms prior to administration—leave no statistical residue. Eyewitness testimony is the only type of legal evidence available in such cases, but it is not just inconsistent, it may be socially destructive. 

I remember two investigations best: one occurred in a relatively well-to-do neighborhood with well-educated parents active in school affairs; the other in one of the city’s poorest neighborhoods. Superficially, the cases were similar—an individual teacher was accused of helping his or her own students with answers during test administrations. Making a case against either elementary school teacher required sworn testimony from eyewitnesses, that is, students—eight-to-ten-year olds. 

My investigations, then, consisted of calling children into the principal’s office one-by-one to be questioned about their teacher’s behavior. We couldn’t hide the reason we were asking the questions. And, even though each student agreed not to tell others what had occurred in their visit to the principal’s office, we knew we had only one shot at an uncorrupted jury pool. 

Though the accusations against the two teachers were similar and the cases against them equally strong, the outcomes could not have been more different. In the high-poverty neighborhood, the students seemed suspicious and said little; none would implicate the teacher, whom they all seemed to like. 

In the more prosperous neighborhood, students were more outgoing, freely divulging what they had witnessed. The students had discussed the alleged coaching with their parents who, in turn, urged them to tell investigators what they knew. During his turn in the principal’s office, the accused teacher denied any wrongdoing. I wrote up each interview, then requested that each student read and sign. 

Thankfully, that accused teacher made a deal and left the school system a few weeks later. Had he not, we would have required the presence in court of the eight-to-ten-year olds to testify under oath against their former teacher, who taught multi-grade classes. Had that prosecution not succeeded, the eyewitness students could have been routinely assigned to his classroom the following school year.

My conclusion? Only in certain schools is the successful prosecution of a cheating teacher through eyewitness testimony even possible. But, even where possible, it consumes inordinate amounts of time and, otherwise, comes at a high price, turning young innocents against authority figures they naturally trusted. 

Cheating blueprints

Arguably the most widespread and persistent testing malfeasance in DCPS received little attention from the press. Moreover, it was directly propagated by District leaders, who published test blueprints on the web. Put simply, test “blueprints” are lists of the curricular standards (e.g., “student shall correctly add two-digit numbers”) and the number of test items included in an upcoming test related to each standard. DC had been advance publishing its blueprints for years.

I argued that the way DC did it was unethical. The head of the Division of Data & Accountability, Erin McGoldrick, however, defended the practice, claimed it was common, and cited its existence in the state of California as precedent. The next time she and I met for a conference call with one of DCPS’s test providers, Discover Education, I asked their sales agent how many of their hundreds of other customers advance-published blueprints. His answer: none.

In the state of California, the location of McGoldrick’s only prior professional experience, blueprints were, indeed, published in advance of test administrations. But their tests were longer than DC’s and all standards were tested. Publication of California’s blueprints served more to remind the populace what the standards were in advance of each test administration. Occasionally, a standard considered to be of unusual importance might be assigned a greater number of test items than the average, and the California blueprints signaled that emphasis. 

In Washington, DC, the tests used in judging teacher performance were shorter, covering only some of each year’s standards. So, DC’s blueprints showed everyone well in advance of the test dates exactly which standards would be tested and which would not. For each teacher, this posed an ethical dilemma: should they “narrow the curriculum” by teaching only that content they knew would be tested? Or, should they do the right thing and teach all the standards, as they were legally and ethically bound to, even though it meant spending less time on the to-be-tested content? It’s quite a conundrum when one risks punishment for behaving ethically.

Monthly meetings convened to discuss issues with the districtwide testing program, the DC Comprehensive Assessment System (DC-CAS)—administered to comply with the federal No Child Left Behind (NCLB) Act. All public schools, both DCPS and charters, administered those tests. At one of these regular meetings, two representatives from the Office of the State Superintendent of Education (OSSE) announced plans to repair the broken blueprint process.[3]

The State Office employees argued thoughtfully and reasonably that it was professionally unethical to advance publish DC test blueprints. Moreover, they had surveyed other US jurisdictions in an effort to find others that followed DC’s practice and found none. I was the highest-ranking DCPS employee at the meeting and I expressed my support, congratulating them for doing the right thing. I assumed that their decision was final.

I mentioned the decision to McGoldrick, who expressed surprise and speculation that it might have not been made at the highest level in the organizational hierarchy. Wasting no time, she met with other DCPS senior managers and the proposed change was forthwith shelved. In that, and other ways, the DCPS tail wagged the OSSE dog. 

* * *

It may be too easy to finger ethical deficits for the recalcitrant attitude toward test security of the Rhee-Henderson era ed reformers. The columnist Peter Greene insists that knowledge deficits among self-appointed education reformers also matter: 

“… the reformistan bubble … has been built from Day One without any actual educators inside it. Instead, the bubble is populated by rich people, people who want rich people’s money, people who think they have great ideas about education, and even people who sincerely want to make education better. The bubble does not include people who can turn to an Arne Duncan or a Betsy DeVos or a Bill Gates and say, ‘Based on my years of experience in a classroom, I’d have to say that idea is ridiculous bullshit.’”

“There are a tiny handful of people within the bubble who will occasionally act as bullshit detectors, but they are not enough. The ed reform movement has gathered power and money and set up a parallel education system even as it has managed to capture leadership roles within public education, but the ed reform movement still lacks what it has always lacked–actual teachers and experienced educators who know what the hell they’re talking about.”

In my twenties, I worked for several years in the research department of a state education agency. My primary political lesson from that experience, consistently reinforced subsequently, is that most education bureaucrats tell the public that the system they manage works just fine, no matter what the reality. They can get away with this because they control most of the evidence and can suppress it or spin it to their advantage.

In this proclivity, the DCPS central office leaders of the Rhee-Henderson era proved themselves to be no different than the traditional public-school educators they so casually demonized. 

US school systems are structured to be opaque and, it seems, both educators and testing contractors like it that way. For their part, and contrary to their rhetoric, Rhee, Henderson, and McGoldrick passed on many opportunities to make their system more transparent and accountable.

Education policy will not improve until control of the evidence is ceded to genuinely independent third parties, hired neither by the public education establishment nor by the education reform club.

The author gratefully acknowledges the fact-checking assistance of Erich Martel and Mary Levy.

Access this testimonial in .pdf format

Citation:  Phelps, R. P. (2020, September). Looking Back on DC Education Reform 10 Years After, Part 2: Test Cheats. Nonpartisan Education Review / Testimonials. https://nonpartisaneducation.org/Review/Testimonials/v16n3.htm


[1] A perusal of Caveon’s website clarifies that their mission is to help their clients–state and local education departments–not get caught. Sometimes this means not cheating in the first place; other times it might mean something else. One might argue that, ironically, Caveon could be helping its clients to cheat in more sophisticated ways and cover their tracks better.

[2] Among them: test booklets should be sealed until the students open them and resealed by the students immediately after; and students should be assigned seats on test day and a seating chart submitted to test coordinators (necessary for verifying cluster patterns in student responses that would suggest answer copying).

[3] Yes, for those new to the area, the District of Columbia has an Office of the “State” Superintendent of Education (OSSE). Its domain of relationships includes not just the regular public schools (i.e., DCPS), but also other public schools (i.e., charters) and private schools. Practically, it primarily serves as a conduit for funneling money from a menagerie of federal education-related grant and aid programs

What did Education Reform in DC Actually Mean?

Short answer: nothing that would actually help students or teachers. But it’s made for well-padded resumes for a handful of insiders.

This is an important review, by the then-director of assessment. His criticisms echo the points that I have been making along with Mary Levy, Erich Martel, Adell Cothorne, and many others.

Nonpartisan Education Review / Testimonials

Access this testimonial in .pdf format

Looking Back on DC Education Reform 10 Years After, 

Part 1: The Grand Tour

Richard P Phelps

Ten years ago, I worked as the Director of Assessments for the District of Columbia Public Schools (DCPS). My tenure coincided with Michelle Rhee’s last nine months as Chancellor. I departed shortly after Vincent Gray defeated Adrian Fenty in the September 2010 DC mayoral primary

My primary task was to design an expansion of that testing program that served the IMPACT teacher evaluation system to include all core subjects and all grade levels. Despite its fame (or infamy), the test score aspect of the IMPACT program affected only 13% of teachers, those teaching either reading or math in grades four through eight. Only those subjects and grade levels included the requisite pre- and post-tests required for teacher “value added” measurements (VAM). Not included were most subjects (e.g., science, social studies, art, music, physical education), grades kindergarten to two, and high school.

Chancellor Rhee wanted many more teachers included. So, I designed a system that would cover more than half the DCPS teacher force, from kindergarten through high school. You haven’t heard about it because it never happened. The newly elected Vincent Gray had promised during his mayoral campaign to reduce the amount of testing; the proposed expansion would have increased it fourfold.

VAM affected teachers’ jobs. A low value-added score could lead to termination; a high score, to promotion and a cash bonus. VAM as it was then structured was obviously, glaringly flawed,[1] as anyone with a strong background in educational testing could have seen. Unfortunately, among the many new central office hires from the elite of ed reform circles, none had such a background.

Before posting a request for proposals from commercial test developers for the testing expansion plan, I was instructed to survey two groups of stakeholders—central office managers and school-level teachers and administrators.

Not surprisingly, some of the central office managers consulted requested additions or changes to the proposed testing program where they thought it would benefit their domain of responsibility. The net effect on school-level personnel would have been to add to their administrative burden. Nonetheless, all requests from central office managers would be honored. 

The Grand Tour

At about the same time, over several weeks of the late Spring and early Summer of 2010, along with a bright summer intern, I visited a dozen DCPS schools. The alleged purpose was to collect feedback on the design of the expanded testing program. I enjoyed these meetings. They were informative, animated, and very well attended. School staff appreciated the apparent opportunity to contribute to policy decisions and tried to make the most of it.

Each school greeted us with a full complement of faculty and staff on their days off, numbering a several dozen educators at some venues. They believed what we had told them: that we were in the process of redesigning the DCPS assessment program and were genuinely interested in their suggestions for how best to do it. 

At no venue did we encounter stand-pat knee-jerk rejection of education reform efforts. Some educators were avowed advocates for the Rhee administration’s reform policies, but most were basically dedicated educators determined to do what was best for their community within the current context. 

The Grand Tour was insightful, too. I learned for the first time of certain aspects of DCPS’s assessment system that were essential to consider in its proper design, aspects of which the higher-ups in the DCPS Central Office either were not aware or did not consider relevant. 

The group of visited schools represented DCPS as a whole in appropriate proportions geographically, ethnically, and by education level (i.e., primary, middle, and high). Within those parameters, however, only schools with “friendly” administrations were chosen. That is, we only visited schools with principals and staff openly supportive of the Rhee-Henderson agenda. 

But even they desired changes to the testing program, whether or not it was expanded. Their suggestions covered both the annual districtwide DC-CAS (or “comprehensive” assessment system), on which the teacher evaluation system was based, and the DC-BAS (or “benchmarking” assessment system), a series of four annual “no-stakes” interim tests unique to DCPS, ostensibly offered to help prepare students and teachers for the consequential-for-some-school-staff DC-CAS.[2]

At each staff meeting I asked for a show of hands on several issues of interest that I thought were actionable. Some suggestions for program changes received close to unanimous support. Allow me to describe several.

1. Move DC-CAS test administration later in the school year. Many citizens may have logically assumed that the IMPACT teacher evaluation numbers were calculated from a standard pre-post test schedule, testing a teacher’s students at the beginning of their academic year together and then again at the end. In 2010, however, the DC-CAS was administered in March, three months before school year end. Moreover, that single administration of the test served as both pre- and post-test, posttest for the current school year and pretest for the following school year. Thus, before a teacher even met their new students in late August or early September, almost half of the year for which teachers were judged had already transpired—the three months in the Spring spent with the previous year’s teacher and almost three months of summer vacation. 

School staff recommended pushing DC-CAS administration to later in the school year. Furthermore, they advocated a genuine pre-post-test administration schedule—pre-test the students in late August–early September and post-test them in late-May–early June—to cover a teacher’s actual span of time with the students.

This suggestion was rejected because the test development firm with the DC-CAS contract required three months to score some portions of the test in time for the IMPACT teacher ratings scheduled for early July delivery, before the start of the new school year. Some small number of teachers would be terminated based on their IMPACT scores, so management demanded those scores be available before preparations for the new school year began.[3] The tail wagged the dog.

2. Add some stakes to the DC-CAS in the upper grades. Because DC-CAS test scores portended consequences for teachers but none for students, some students expended little effort on the test. Indeed, extensive research on “no-stakes” (for students) tests reveal that motivation and effort vary by a range of factors including gender, ethnicity, socioeconomic class, the weather, and age. Generally, the older the student, the lower the test-taking effort. This disadvantaged some teachers in the IMPACT ratings for circumstances beyond their control: unlucky student demographics. 

Central office management rejected this suggestion to add even modest stakes to the upper grades’ DC-CAS; no reason given. 

3. Move one of the DC-BAS tests to year end. If management rejected the suggestion to move DC-CAS test administration to the end of the school year, school staff suggested scheduling one of the no-stakes DC-BAS benchmarking tests for late May–early June. As it was, the schedule squeezed all four benchmarking test administrations between early September and mid-February. Moving just one of them to the end of the year would give the following year’s teachers a more recent reading (by more than three months) of their new students’ academic levels and needs.

Central Office management rejected this suggestion probably because the real purpose of the DC-BAS was not to help teachers understand their students’ academic levels and needs, as the following will explain.

4. Change DC-BAS tests so they cover recently taught content. Many DC citizens probably assumed that, like most tests, the DC-BAS interim tests covered recently taught content, such as that covered since the previous test administration. Not so in 2010. The first annual DC-BAS was administered in early September, just after the year’s courses commenced. Moreover, it covered the same content domain—that for the entirety of the school year—as each of the next three DC-BAS tests. 

School staff proposed changing the full-year “comprehensive” content coverage of each DC-BAS test to partial-year “cumulative” coverage, so students would only be tested on what they had been taught prior to each test administration.

This suggestion, too, was rejected. Testing the same full-year comprehensive content domain produced a predictable, flattering score rise. With each DC-BAS test administration, students recognized more of the content, because they had just been exposed to more of it, so average scores predictably rose. With test scores always rising, it looked like student achievement improved steadily each year. Achieving this contrived score increase required testing students on some material to which they had not yet been exposed, both a violation of professional testing standards and a poor method for instilling student confidence. (Of course, it was also less expensive to administer essentially the same test four times a year than to develop four genuinely different tests.)

5. Synchronize the sequencing of curricular content across the District. DCPS management rhetoric circa 2010 attributed classroom-level benefits to the testing program. Teachers would know more about their students’ levels and needs and could also learn from each other. Yet, the only student test results teachers received at the beginning of each school year was half-a-year old, and most of the information they received over the course of four DC-BAS test administrations was based on not-yet-taught content.

As for cross-district teacher cooperation, unfortunately there was no cross-District coordination of common curricular sequences. Each teacher paced their subject matter however they wished and varied topical emphases according to their own personal preference.

It took DCPS’s Chief Academic Officer, Carey Wright, and her chief of staff, Dan Gordon, less than a minute to reject the suggestion to standardize topical sequencing across schools so that teachers could consult with one another in real time. Tallying up the votes: several hundred school-level District educators favored the proposal, two of Rhee’s trusted lieutenants opposed it. It lost.

6. Offer and require a keyboarding course in the early grades. DCPS was planning to convert all its testing from paper-and-pencil mode to computer delivery within a few years. Yet, keyboarding courses were rare in the early grades. Obviously, without systemwide keyboarding training in computer use some students would be at a disadvantage in computer testing.

Suggestion rejected.

In all, I had polled over 500 DCPS school staff. Not only were all of their suggestions reasonable, some were essential in order to comply with professional assessment standards and ethics. 

Nonetheless, back at DCPS’ Central Office, each suggestion was rejected without, to my observation, any serious consideration. The rejecters included Chancellor Rhee, the head of the office of Data and Accountability—the self-titled “Data Lady,” Erin McGoldrick—and the head of the curriculum and instruction division, Carey Wright, and her chief deputy, Dan Gordon. 

Four central office staff outvoted several-hundred school staff (and my recommendations as assessment director). In each case, the changes recommended would have meant some additional work on their parts, but in return for substantial improvements in the testing program. Their rhetoric was all about helping teachers and students; but the facts were that the testing program wasn’t structured to help them.

What was the purpose of my several weeks of school visits and staff polling? To solicit “buy in” from school level staff, not feedback.

Ultimately, the new testing program proposal would incorporate all the new features requested by senior Central Office staff, no matter how burdensome, and not a single feature requested by several hundred supportive school-level staff, no matter how helpful. Like many others, I had hoped that the education reform intention of the Rhee-Henderson years was genuine. DCPS could certainly have benefitted from some genuine reform. 

Alas, much of the activity labelled “reform” was just for show, and for padding resumes. Numerous central office managers would later work for the Bill and Melinda Gates Foundation. Numerous others would work for entities supported by the Gates or aligned foundations, or in jurisdictions such as Louisiana, where ed reformers held political power. Most would be well paid. 

Their genuine accomplishments, or lack thereof, while at DCPS seemed to matter little. What mattered was the appearance of accomplishment and, above all, loyalty to the group. That loyalty required going along to get along: complicity in maintaining the façade of success while withholding any public criticism of or disagreement with other in-group members.

Unfortunately, in the United States what is commonly showcased as education reform is neither a civic enterprise nor a popular movement. Neither parents, the public, nor school-level educators have any direct influence. Rather, at the national level, US education reform is an elite, private club—a small group of tightly-connected politicos and academicsa mutual admiration society dedicated to the career advancement, political influence, and financial benefit of its members, supported by a gaggle of wealthy foundations (e.g., Gates, Walton, Broad, Wallace, Hewlett, Smith-Richardson). 

For over a decade, The Ed Reform Club exploited DC for its own benefit. Local elite formed the DC Public Education Fund (DCPEF) to sponsor education projects, such as IMPACT, which they deemed worthy. In the negotiations between the Washington Teachers’ Union and DCPS concluded in 2010, DCPEF arranged a 3 year grant of $64.5M from the Arnold, Broad, Robertson and Walton Foundations to fund a 5-year retroactive teacher pay raise in return for contract language allowing teacher excessing tied to IMPACT, which Rhee promised would lead to annual student test score increases by 2012. Projected goals were not metfoundation support continued nonetheless.

Michelle Johnson (nee Rhee) now chairs the board of a charter school chain in California and occasionally collects $30,000+ in speaker fees but, otherwise, seems to have deliberately withdrawn from the limelight. Despite contributing her own additional scandalsafter she assumed the DCPS Chancellorship, Kaya Henderson ascended to great fame and glory with a “distinguished professorship” at Georgetown; honorary degrees from Georgetown and Catholic Universities; gigs with the Chan Zuckerberg Initiative, Broad Leadership Academy, and Teach for All; and board memberships with The Aspen Institute, The College Board, Robin Hood NYC, and Teach For America. Carey Wright is now state superintendent in Mississippi. Dan Gordon runs a 30-person consulting firm, Education Counsel that strategically partners with major players in US education policy. The manager of the IMPACT teacher evaluation program, Jason Kamras, now works as Superintendent of the Richmond, VA public schools. 

Arguably the person most directly responsible for the recurring assessment system fiascos of the Rhee-Henderson years, then Chief of Data and Accountability Erin McGoldrick, now specializes in “data innovation” as partner and chief operating officer at an education management consulting firm. Her firm, Kitamba, strategically partners with its own panoply of major players in US education policy. Its list of recent clients includes the DC Public Charter School Board and DCPS.

If the ambitious DC central office folk who gaudily declared themselves leading education reformers were not really, who were the genuine education reformers during the Rhee-Henderson decade of massive upheaval and per-student expenditures three times those in the state of Utah? They were the school principals and staff whose practical suggestions were ignored by central office glitterati. They were whistleblowers like history teacher Erich Martel who had documented DCPS’ student records’ manipulation and phony graduation rates years before the Washington Post’s celebrated investigation of Ballou High School, and was demoted and then “excessed” by Henderson. Or, school principal Adell Cothorne, who spilled the beans on test answer sheet “erasure parties” at Noyes Education Campus and lost her job under Rhee. 

Real reformers with “skin in the game” can’t play it safe.

The author appreciates the helpful comments of Mary Levy and Erich Martel in researching this article. 

Access this testimonial in .pdf format

Even the Chancellor Calls the Results ‘Sobering’

The Washington City Paper has an article on the PARCC results with way more graphs and charts than I do, and they quote even Chancellor Kaya Henderson as saying the results were ‘sobering’.

Please remind me why she still has a job?

She and several other speakers said that the PARCC results were more ‘honest’ than the old DC-CAS results, probably because the new ‘passing’ scores are lower than the old ones. I guess that means that it’s more ‘honest’ to say that students are doing worse than we were previously led to believe, under the current regime of all-testing-all-the-time and turn-half-the-students-over-to-unregulated charters?

 

PARCC Results Released in DC

I just got back from watching the public release of the results of the PARCC test that students in Washington DC took about 7 months ago.

(Let that sink in: it took the testing company, and their consultants, and the city’s consultants, over HALF A YEAR to massage the data into a releasable form. So much for having these tests be able to be used to ‘inform instruction’ or help teachers figure out what kind of help their students need. It’s now the last day of November, and the students have been in school since August. What kind of help is that to teachers or parents? And tho I haven’t looked at the released school scores or samples of what the teachers will see, I’m not optimistic. If the past is any guide, the scores themselves will be essentially useless as well…)

(It won’t take so long next time, we were assured…)

I got to see Mayor Bowser, Councilman Grosso, Chancellor Kaya Henderson, [powerless] Superintendent Hanseul Kang, and Deputy Mayor for Education Jenny Niles, and charter honcho Scott Pearson perform and answer some mostly-lame questions from some members of the media.

What we saw were that advanced students in DC (largely white ones) do exceedingly well on this PARCC battery of tests, and that others (blacks; hispanics; SPEDs; students on free or reduced lunch; ELLs; or Students At Risk) do much worse. Which of course is  the very same result we’ve seen on the NAEP for a couple of decades.

In fact, of all the cities and states measured on the NAEP, Washington DC has the very widest gaps in test scores between the Upper Caucasia Haves and the Have-Nots everywhere else, and those gaps are if anything getting wider.

It was interesting to hear Henderson’s defenses of the results, which still showed very low percentages of most students “passing” the PARCC. She said, among other things, that

(1) since students at the lower grades generally scored higher than those at the upper grades, that show’s we are on the right path [seems to me it shows the exact opposite; the longer that students have been exposed to “Reform”, the worse they do… and

(2) It takes a long time, you can’t just expect to turn a switch and have everything be wonderful overnight, we need lots of wrap-around services and a longer school day and school year and support for teachers.

Regarding the latter excuse: isn’t that exactly what teachers were condemned for saying under Chancellor Rhee, whose understudy was none other than Kaya Henderson? Didn’t Rhee imply that the only reason that poor students did poorly in school was that their greedy, lazy teachers, empowered by their evil union, refused to teach them anything? And that anybody who said that it’s a lot harder to teach impoverished students of color with chaotic families (if any) than it is to teach middle-class children with educated parents – why those people were just making excuses for poverty?

 

A Few PARCC Scores Have Been Released for DC Public Schools

If you would like to see how District of Columbia public high school students did on the PARCC, you can look here at a press release from DCPS administration. This test was on ELA (reading) and Geometry. The scores for grades 3-8 have not yet been released.

The disparities in ‘pass’ rates between the DCPS magnet schools (Banneker and Walls) and every other DC public high school are amazing, particularly in geometry. Notice that several schools had not a single student ‘pass’. This year’s test gives students scores from 1 to 5; only a score of 4 or 5 is considered ‘college and career ready’ — although no studies have actually been done to determine whether that statement is actually true. Banneker and Walls have the lowest rates of students labeled ‘at risk’.

Here are two graphs which I cut-and-pasted from the press release. Click on them to enlarge them.

HS-PARCC dc ELA

HS-PARCC geometry

Given what I’ve seen of the convoluted questions asked on released sample PARCC questions, it is no wonder that ‘pass’ rates dropped a lot this year, compared with previous years. The DC-CAS wasn’t a very good test, but PARCC is terrible.

Please keep in mind that public education in the District of Columbia has been under the control of DEformers like Michelle Rhee, Kaya Henderson, and the Gates and Broad foundations, for over 8 years now. The students taking this test last spring have been under their rule since they were rising third graders. Every single teacher in DCPS was either hired by Rhee or by Henderson or else passed numerous strict evaluations with flying colors, year after year, and has been teaching just as they were directed to – or else.

And this is the best that the DEformers can do?

Demonstrate at the Wilson Building Tomorrow at 9 AM to Allow the Washington Teachers’ Union Access to Important Teacher Data

ACTION ALERT!

Join us tomorrow

to demand access to information on IMPACT

  On Tuesday, June 30 at 9 am join fellow DCPS educators, parents and other WTU allies at the Wilson Building to oppose cutting off access to information about the DCPS teacher evaluation system, IMPACT.  

Tomorrow morning the City Council will vote on legislation that would cut off access to IMPACT information, which your union, researchers and others need to judge the fairness and effectiveness of the evaluation system, and to determine whether D.C. Public Schools’ policies are really helping our children succeed.

The Washington Teachers’ Union (WTU) has always stood for transparent decision-making and open government. The union and others have urged the mayor and council members to remove from the Mayor’s Budget Support Act the provision that would prevent the union, educators and others from having access to IMPACT data, and to hold hearings on the provision.   

This is an urgent matter!
Be at the Wilson Building (14th St. and Pennsylvania Ave. NW) on Tuesday morning at 9 and let the DC City Council know that you strongly oppose keeping important IMPACT evaluation data secret.

Send us an email at dialogue@wtulocal6.net

and let us know you’ll be joining us!

A Quick Look at the National Academy of Science report on Mayoral Control of Schools in Washington DC

Last week, the National Academy of Science released a very long report assessing the progress (or lack of it) of the education of young people in Washington DC under mayoral control in both the regular public schools and in the charter schools.

.

The picture isn’t pretty, as Candi Peterson has pointed out.

.

Here are my major conclusions:

.
1. Mayoral control of schools has been a spectacular failure if you care anything about reducing the gaps between achievement levels of white students and those of color, the poor, special ed students, and English language learners (i.e. immigrants). The gaps between the pass rates on the DC-CAS standardized tests of those groups under mayoral control or the PERAA (Public Education Reform Amendment Act) are enormous and have essentially remained unchanged since 2007, when the law was implemented, according to the data in this report. Note that the report combines the data for both the DC public schools and charter schools, combined, at all grade levels, in both reading and math. Here are two graphs, made by me from data in the report, which show the lack of change. I estimated the percentage of students ‘proficient’ or ‘advanced’ in each of the groups (whites, blacks, hispanics, students receiving free or reduced-price lunches, English language learners, and special education students) from graphs provided by the report, and then subtracted the pass rates from each other. HIGH NUMBERS ARE BAD because they show large gaps in proficiency rates. Low numbers are good. Notice that there has been almost no change since mayoral control; some lines go up a tiny bit, some go down a bit, others waver back and forth a bit. Not a success story.
gap[s under mayoral control, math, dc-cas, acc to NAP report on PERAAgaps under mayoral control, reading, acc to national academies press
2. Ratings for teachers remain very much dependent on what students they teach. Many millions of dollars and enormous effort has been spent to devise supposedly scientific ways of measuring teacher effectiveness — i.e. VAM and IMPACT. Every single teacher remaining in DCPS has either been hired under Their Chancellorships or has been repeatedly measured as efffective or better. Yet the ratings for teachers at schools with high poverty rates, and in wards 7 and 8, remain much lower than those at schools with low poverty rates and in ward 3. Repeat: these low-ranked teachers are not holdovers from the ‘bad old days’ – they are either brand-new hires or have been repeatedly measured as good or excellent under IMPACT. (One bit of data: at my last school, from which I retired 5 years ago and which has over 100 faculty and administrators, only about 5 or 6 teachers remain from my time there.) I copied these two tables directly from the report:
teac her ratings under IMPACT, by ward teacher ratings under impact by ses
3. Now that we have 60-odd publicly-funded local school districts in Washington DC, most of which [the charter schools] are not required to provide much of anything in the way of data, we no longer have any effective way of saying what are good practices and which are poor practices, because we have no city-wide way of describing what is going on.
 .
4. The report generally omits any data from before 2007, and in some cases before 2009, which makes it hard to compare pre-mayoral control and post-MC. The exceptions are with some NAEP data, in which it is clear that any progress post-PERAA is indistinguishable from progress before PERAA. See these four graphs, which could have been taken from my blog but are again from the National Academy of Science report (I added the stuff in red for emphasis):
pre-post mayoral control naep scores 4th grade math pre-post mayoral control naep scores 4th grade reading pre-post mayoral control naep scores 8th grade math pre-post mayoral control naep scores 8th grade reading
5. The report totally omits the contractual obligations entered into by Rhee and Henderson with the Broad, Arnold and other foundations back in 2007 when they laid out 60-some goals they said they would meet by 2014. As you may recall from looking at my blog or what Erich Martel wrote on the topic, their success rate in meeting those goals (regarding things like NAEP and CAS scores for the most part) was approximately TWO PERCENT. Not 20%. But 2%. And I was being generous.
.
6. Finally, despite all the really damning data in the report, I predict that the Washington Post and others of their mindset will proclaim that it shows that mayoral control has been a wonderful success.
PS, here is the link so that you can download your own copy of the 341-page report:
http://www.nap.edu/download.php?record_id=21743

What Exactly Were the Differences Between Cheating in Atlanta Under Beverly Hall and the Cheating in DC Under Michelle Rhee?

We all know that administrators and teachers in DC and in Atlanta cheated in order to keep their jobs and gain large cash bonuses. In one city, scores of teachers were indicted, some plea=bargained, some went to jail, and the chief died of cancer. In the other city, only a couple of whistle-blowers lost their jobs, but the chief went on to fame and fortune while all the other culpable parties kept their bonuses.

But why is it that only in Atlanta were teachers and administrators indicted and convicted, but nowhere else?

What difference was there in their actual behavior?

To me, the answer is simple: in DC, officials at every level, from the Mayor’s office up to the President of the US and the Secretary of Education, were determined to make sure that Michelle Rhee’s lying and suborning of perjury and lies would never be revealed, no matter what.

Read for yourself part of the official documents in Atlanta (I’m quoting from The Answer Sheet) and see if you can find any real differences in behavior between what happened there and what happened in DC.

“A[tlanta] P[ublic] S[schools] principals and teachers were frequently told by Beverly Hall and her subordinates that excuses for not meeting targets would not be tolerated. When principals and teachers could not reach their targets, their performance was criticized, their jobs were threatened and some were terminated. Over time, the unnreasonable pressure to meet annual APS targets led some employees to cheat on the CRCT. The refusal of Beverly Hall and her top administrators to accept anything other than satisfying targets created an environment where achieving the desired end result was more important than the students’ education.

“To satisfy annual targets and AYP, test answer sheets were altered, fabricated, and falsely certified. Test scores that were inflated as a result of cheating were purported to be the actual achievement of targets through legitimately obtained improvements in students’ performance when, in fact, the conspirators knew those results had been obtained through cheating and did not reflect students’ actual academic performance.

“As part of the conspiracy, employees of APS who failed to satisfy targets were terminated or threatened with termination, while others who achieved targets through cheating were publicly praised and financially rewarded. For example, teachers who reported other teachers who cheated were terminated, while teachers who were caught cheating were only suspended. The message from Beverly Hall was clear: there were to be no exceptions and no excuses for failure to meet targets.

“Beverly Hall placed unreasonable emphasis on achieving targets; protected and rewarded those who achieved targets through cheating; terminated principals who failed to achieve targets; and ignored suspicions CRCT score gains at schools within APS. As a result, cheating became more and more prevalent within APS, until by the time the 2009 CRCT was administered, cheating was taking place in a majority of APS’s 83 elementary and middle schools. This was substantiated by GOSA’s erasure analysis, which identified 43 APS elementary and middle schools with at least one out of four classrooms within those schools having a statistically improbable number of erasures changing wrong answers to right answers. GOSA’s erasure analysis identified an additional 9 APS elementary and middle schools as having at least one out of five classrooms with a statistically improbable number of erasures changing wrong answers to right answers. Confessions by dozens of APS employees subsequently confirmed what GOSA’s statistical analysis indicated; widespread cheating occurred on the 2009 CRCT.

“It was further a part of the conspiracy and endeavor that targets achieved through cheating were used to obtain financial and other rewards for many of the conspirators.

“It was further part of the conspiracy and endeavor that targets achieved through cheating were used by Beverly Hall to obtain substantial performance bonuses.

“It was further part of the conspiracy and endeavor that Beverly Hall and other conspirators would interfere with, suppress and obstruct investigations into cheating using various methods. Conspirators would refuse to investigate reports of cheating; suppress and deny the existence of reports of cheating; fail to act upon APS investigators’ conclusions that cheating was occurring; suppress and deny the APS investigators’ conclusions that cheating was in fact occurring; fail and refuse to provide complaints of cheating to the Governor’s Special Investigators, the Georgia Bureau of Investigation and investigators from the Fulton County District Attorney’s Office; and intimidate witnesses with the intent to hinder, delay, or prevent the communication of criminal offenses to law enforcement officers. When questioned by the Governor’s Special Investigators and law enforcement officers, many of the conspirators made false statements some under oath denying their knowledge of and participation in the cheating.”

A closer look at charter and regular public school enrollments, percentages of students at risk, and percentages of students ‘proficient’

Here is another look at the brand-new data concerning four variables in the District of Columbia schools, about which I wrote a couple of days ago. The difference here is that the dots representing the schools are more-or=less proportional to the size of the student body.

1. Is this a regular public school, or a charter school (blue or red):

2. What fraction of the kids at that school are officially considered to be At Risk? (That’s the scale along the x-axis at the bottom of the page)

3. What is the average percentage of the kids at that school are ‘proficient’ in reading and math on the DC-CAS? (That’s the scale along the y-axis at the left-hand side of the page)

4. How big is the school? (That’s the size of the dot, more or less; the legend is at the bottom left-hand corner of the graph)

Time spent looking carefully at this graph will be well-spent. If you click on it, it will expand.

It will certainly show that charter schools have not revolutionized education for the better in DC: for both types of schools, there remains a very strong, negative correlation between the percentages of kids At Risk and ‘pass’ rates on the DC-CAS.

Note that most schools have between 200 and 500 students and that most of the ones that are smaller are actually charter schools. As I wrote a couple of days ago, the schools with the largest fraction of At-Risk students (say, over 2/3 of the student body) are almost all regular DC public schools.

On the second graph, which is otherwise identical to the first, I’ve labeled some of the larger schools.

fixed bicolor, size of school and at risk vs average dc cas 2014 proficiency, both regular public and charter, dc

Here is the one with names of some of the larger schools, so you can see how individual schools fall on this graph.

(Sorry, I there was not enough room to label every single one, and my non-existent HTML skills won’t allow me to make it so that any of the dots are clickable. If any of my readers know how to do that and would like to offer to make that happen, then please let me know in the comments.)

again fixed and revised names and bicolor, size of school and at risk vs average dc cas 2014 proficiency, both regular public and charter, dc

And here is the entire data table. So you can see where every single school lies on these three dimensions.

(PS: I added a few more names of schools and corrected four other small errors, two pointed out by an alert reader.. 2/22/2015)

%d bloggers like this: