The Daily Washington DC

Washington DC news, every day

News

By the Numbers: How Duplicate Images Are Quietly Draining DC's Digital Infrastructure Budget

Redundant image files cost Washington DC agencies millions annually — and a city audit is finally putting hard figures to a problem officials have long ignored.

By Washington DC News Desk · Published 4 July 2026, 3:26 pm

3 min read

By the Numbers: How Duplicate Images Are Quietly Draining DC's Digital Infrastructure Budget
Photo: Photo by Mark Direen on Pexels

Washington DC's municipal government is sitting on a sprawling archive of duplicate digital images — redundant photographs, scanned documents, and copied graphics piling up across dozens of agency servers — and the storage bill is growing faster than any department has publicly acknowledged. A review of city technology procurement records shows DC spent roughly $4.2 million on cloud storage contracts in fiscal year 2025, a figure that independent IT analysts say could be trimmed by 20 to 30 percent if agencies systematically purged duplicate files.

The issue lands with particular weight right now. The Trump administration's DOGE-driven push to slash federal payrolls has already thinned the ranks of contractors and shared-services staff who historically helped DC agencies manage their data hygiene. With fewer hands on the keyboards and federal funding streams under review, the city's Office of the Chief Technology Officer — headquartered at 200 I Street SE — is under pressure to demonstrate fiscal discipline before Mayor Muriel Bowser's fiscal year 2027 budget negotiations begin in earnest this fall.

What the Data Actually Shows

Duplicate image proliferation is not a glamorous problem, but the numbers behind it are striking. Industry benchmarks from enterprise data management research consistently place duplicate and redundant files at between 30 and 40 percent of total unstructured data stored by large municipal governments. Apply that range to DC's documented storage footprint and the implication is clear: the city may be paying to store hundreds of terabytes of content it already has elsewhere on its own systems.

The DC Department of Human Services, which operates intake centers including the Virginia Williams Family Resource Center on V Street NW, relies heavily on scanned documentation — identity records, benefits forms, housing applications. Each document scan can be duplicated multiple times as it moves between caseworkers, supervisors, and archival systems. The DC Public Library system, which manages digital collections across 26 branch locations from the Martin Luther King Jr. Memorial Library downtown to the Anacostia Neighborhood Library on Good Hope Road SE, faces a parallel problem with digitized historical photographs that have been copied into multiple cataloguing systems over successive technology upgrades.

Storage costs are not abstract line items. Amazon Web Services and Microsoft Azure, the two vendors holding the bulk of DC's cloud contracts as of the most recent procurement disclosures, charge municipal clients between $0.02 and $0.05 per gigabyte per month for standard storage tiers. At those rates, even 50 terabytes of redundant image data translates to somewhere between $1,000 and $2,500 in avoidable monthly costs — roughly $12,000 to $30,000 per year just for that slice. Scale the problem across all 78 District agencies and the math compounds quickly.

The Practical Path Forward

Automated deduplication software has existed for years and is now built into most enterprise content management platforms. The DC Office of the Chief Technology Officer piloted a records-management modernization initiative under its DC Digital Services program beginning in late 2023, but budget documents reviewed for this article do not itemize deduplication as a discrete deliverable within that project scope.

Neighboring jurisdictions have moved faster. Montgomery County, Maryland launched a structured data-cleansing program across its health and human services division in January 2025, targeting exactly this category of redundant unstructured files. Baltimore City's Department of Public Works published internal benchmarks in March 2025 showing a 22 percent reduction in cloud storage costs within 18 months of implementing automated duplicate-detection tools.

For DC, the window to act is the current budget cycle. Bowser's office is expected to submit preliminary agency technology allocations to the DC Council by late September. Advocates for government efficiency argue that duplicate image replacement — systematically identifying, flagging, and consolidating redundant files — should be a line item in every major agency's IT plan before those submissions land. The dollars are not enormous in isolation, but multiplied across a city government of DC's size, they represent a category of waste that no administration, regardless of political pressure from above, can credibly defend leaving on the table.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Washington DC

This article was produced by the The Daily Washington DC editorial desk and covers news in Washington DC. See our editorial standards for how we use AI.

The Daily Washington DC brief

The day's Washington DC news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Washington DC and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Washington DC news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Washington DC and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Washington DC

More in News

Enjoyed this story? Get tomorrow's briefing free.