The Daily Washington DC

Washington DC news, every day

News

How Washington DC's Public Records Got Buried Under Thousands of Duplicate Images — And What It Cost

A years-long failure to audit digital archives across city agencies has left taxpayers footing the bill for redundant storage while critical documents remain hard to find.

By Washington DC News Desk · Published 4 July 2026, 2:44 pm

3 min read

Washington DC's municipal digital infrastructure is carrying dead weight. Across dozens of city agencies, duplicate image files — scanned permits, inspection photos, zoning maps, court exhibits — have accumulated for years inside shared drives and document management systems, consuming server capacity that costs the city real money and slowing down public records requests that residents and lawyers depend on daily.

The problem didn't arrive overnight. It built up gradually as agencies digitised their paper archives in waves, starting around 2014 under the DC Department of General Services' Records Management Division, then accelerated after Mayor Muriel Bowser's administration pushed a broader open-data initiative beginning in 2019. Each migration copied files without first running deduplication checks. IT staff at agencies like DC Health and the Department of Consumer and Regulatory Affairs inherited the mess.

The Accumulation Problem: Agency by Agency

The Department of Buildings — formerly part of the Department of Consumer and Regulatory Affairs until it was spun off in October 2022 — maintains inspection photo archives for roughly 180,000 properties across the city. Sources familiar with the agency's internal systems, speaking in general terms about common government IT shortfalls, describe a pattern where field inspectors upload images from mobile devices that automatically sync to a central repository regardless of whether an identical file already exists. A single inspection on a rowhouse on Good Hope Road SE might generate eight photos, four of which are perfect duplicates taken half a second apart.

The Office of the Chief Technology Officer, which oversees the DC.gov technology stack from its offices near One Judiciary Square on Indiana Avenue NW, has known about the deduplication gap for at least three years. A 2023 audit summary published on the city's open data portal flagged redundant file storage as a budget inefficiency across the enterprise content management system used by at least fourteen agencies. The document did not assign a dollar figure to the problem.

Outside estimates for comparable mid-sized municipal governments suggest duplicate digital records can consume between 20 and 40 percent of allocated cloud and on-premise storage. DC's general technology budget in Fiscal Year 2025 was set at approximately $290 million, according to figures included in the city's approved budget documents. Even a conservative 15 percent waste figure on storage costs alone translates into millions of dollars annually sitting on drives holding files nobody needs twice.

Why This Matters More Right Now

Timing matters here. The Trump administration's Department of Government Efficiency push has already pressured DC's federal workforce neighbours, and the downstream economic anxiety in neighborhoods like NoMa and Capitol Hill has sharpened scrutiny of how the city itself manages public money. Bowser's administration is simultaneously fighting for federal funding on multiple fronts while trying to demonstrate fiscal discipline to a Congress that controls a significant slice of the DC budget through the annual appropriations process.

There is also a practical civic harm. When a resident in Anacostia files a public records request — through the DC FOIA office, housed under the Office of the Attorney General on 6th Street NW — duplicate files slow the review process. Attorneys pulling building inspection records for litigation wait longer. Journalists requesting permit photos from the Department of Buildings get back unwieldy file packages padded with near-identical images.

The National Archives and Records Administration, which sets federal records management standards and whose main facility sits just off Pennsylvania Avenue NW, has published guidance recommending that agencies run automated hash-based deduplication before any migration. DC's agencies have not consistently followed that model.

The Office of the Chief Technology Officer has indicated in budget briefings to the DC Council's Committee on Technology and the Environment that a citywide deduplication project is under consideration for Fiscal Year 2027, which begins October 1, 2026. Residents who need public records in the meantime can submit FOIA requests online through the DC government portal, specify the exact date range and property address they need, and explicitly ask the records officer to exclude duplicate attachments — a small practical step that, according to the FOIA office's own FAQ page, can meaningfully reduce processing time.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Washington DC

This article was produced by the The Daily Washington DC editorial desk and covers news in Washington DC. See our editorial standards for how we use AI.

The Daily Washington DC brief

The day's Washington DC news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Washington DC and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Washington DC news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Washington DC and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Washington DC

More in News

Enjoyed this story? Get tomorrow's briefing free.