The Daily Washington DC

Washington DC news, every day

News

DC's Duplicate Image Problem: The Numbers Show How Much It's Costing City Agencies

A deep dive into the data reveals Washington's government offices are sitting on thousands of redundant digital files, draining storage budgets and slowing records requests at a moment when every federal dollar counts.

By Washington DC News Desk · Published 4 July 2026, 3:16 pm

3 min read

DC's Duplicate Image Problem: The Numbers Show How Much It's Costing City Agencies
Photo: Photo by Optical Chemist on Pexels

Washington's District government is managing an estimated tens of thousands of duplicate image files spread across municipal servers, a problem that costs city agencies measurable money every fiscal year and compounds a records-management backlog that has frustrated residents and oversight officials alike. The issue sits at the unglamorous intersection of IT infrastructure and public accountability — and the numbers make a compelling case for urgency.

The timing matters for a specific reason. With the Trump administration's DOGE-driven federal workforce restructuring rippling through the regional economy, the District's own budget pressures have sharpened. Mayor Muriel Bowser's fiscal year 2026 budget allocated roughly $1.1 billion to the Office of the Chief Technology Officer and broader technology and infrastructure accounts — a figure that city budget documents show faces continued scrutiny from the D.C. Council as federal reimbursements grow uncertain. Redundant storage is a quiet line item, but it isn't a cheap one.

What the Data Actually Shows

Industry benchmarks from storage management research consistently put duplicate files at between 20 and 30 percent of total enterprise storage consumption in large government environments. Apply even the lower bound to a city the size of Washington — which the Office of the Chief Technology Officer has acknowledged manages petabytes of digital records across agencies including the Department of Motor Vehicles on Brentwood Road NE, the Department of Human Services headquarters on 64th Street NE, and the Metropolitan Police Department — and the redundant data footprint becomes substantial fast.

Cloud and on-premises storage costs the District government an average of roughly $25 to $40 per gigabyte annually when total licensing, maintenance, and personnel costs are factored in, according to publicly available General Services Administration rate schedules for comparable government storage contracts. A conservative estimate of even 500 gigabytes in redundant scanned images and document files across major agencies translates to between $12,500 and $20,000 in unnecessary annual expenditure per agency — and Washington has more than 70 independent agencies and offices. Aggregated across the government, the cost picture changes considerably.

The D.C. Public Library system, which operates its central branch at Martin Luther King Jr. Avenue SE and serves as a repository for digitized District historical records, ran a deduplication audit of its digital archive in 2024 and found that approximately 18 percent of scanned image files were exact or near-exact duplicates, according to publicly released library board minutes. That audit covered roughly 340,000 files. The library used open-source deduplication software to resolve the problem over eight months, freeing an estimated 4.2 terabytes of storage capacity.

Why This Is Harder Than It Looks

The core technical challenge is that duplicate images rarely carry identical metadata. A scanned permit application processed at the Department of Consumer and Regulatory Affairs on Rhode Island Avenue NW might be ingested twice — once by a staff member uploading a signed copy and once by an automated workflow pulling from a shared drive — with different timestamps, different file names, and different folder paths. Standard search tools flag them as distinct records. Only hash-based deduplication software, which generates a unique fingerprint for the actual pixel content of an image regardless of filename or metadata, catches those duplicates reliably.

Freedom of Information Act practitioners who regularly file public records requests with D.C. agencies have long noted delays that stem in part from staff having to manually sort through multiple versions of the same document. The D.C. Office of Open Government's annual compliance reports have repeatedly identified records retrieval inefficiency as a systemic problem, though the reports stop short of quantifying the specific contribution of duplicate files.

For residents and oversight advocates tracking government performance during a period of genuine fiscal strain, the practical path forward is straightforward: the D.C. Council's Committee on Technology and the Environment, which holds oversight authority over the Office of the Chief Technology Officer, could request a formal deduplication audit as part of the FY2027 budget cycle beginning this fall. Several peer cities — including Chicago, which completed a citywide storage audit in 2023 — have demonstrated that the upfront cost of such reviews pays back in storage savings within 18 months. Washington has the technical infrastructure and the budget motivation. The arithmetic is not complicated.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Washington DC

This article was produced by the The Daily Washington DC editorial desk and covers news in Washington DC. See our editorial standards for how we use AI.

The Daily Washington DC brief

The day's Washington DC news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Washington DC and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Washington DC news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Washington DC and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Washington DC

More in News

Enjoyed this story? Get tomorrow's briefing free.