DC's Digital Housekeeping Problem: The Numbers Behind the City's Duplicate Image Crisis
Thousands of redundant image files are quietly draining storage budgets and slowing government websites across the District — and the data tells a stark story.
Thousands of redundant image files are quietly draining storage budgets and slowing government websites across the District — and the data tells a stark story.
Washington DC's municipal digital infrastructure is carrying thousands of duplicate image files across its network of public-facing websites, a problem that city technology auditors have flagged as a growing drain on storage costs and page-load performance. The issue, long dismissed as a minor IT nuisance, has taken on new urgency as the District's Office of the Chief Technology Officer navigates federal funding uncertainty and absorbs staffing pressure from the Trump administration's DOGE-led restructuring of the broader federal workforce.
Duplicate image replacement — the process of identifying, consolidating, and removing redundant copies of the same digital asset — sounds unglamorous. But for a city running dozens of agency portals, from the Department of Public Works site to the DC Health website, the cumulative storage and bandwidth waste adds up fast. Across comparable mid-sized municipal governments, duplicate and near-duplicate image files can account for between 15 and 30 percent of total media library storage, according to published research by the Nielsen Norman Group on government website asset management. That range, applied to DC's sprawling web presence, translates to a meaningful line item at a moment when every budget dollar is being scrutinised.
The DC Office of the Chief Technology Officer, headquartered at 200 I Street SE in Capitol Riverfront, manages the citywide web platform used by more than 70 agency sub-sites. A conservative estimate, based on industry benchmarks for government content management systems of similar scale, puts the number of redundant image assets potentially in the tens of thousands. Each duplicated file consumes server space, slows content delivery networks, and forces content editors — many of them now working with reduced teams — to wade through cluttered media libraries when updating pages.
The financial dimension is concrete. Cloud storage costs for municipal governments running Amazon Web Services or Microsoft Azure infrastructure typically run between $0.02 and $0.08 per gigabyte per month. A media library bloated by even 500 gigabytes of duplicate imagery adds between $10 and $40 a month in pure storage cost — not a crisis figure on its own, but multiplied across backup systems, content delivery redundancy, and disaster recovery snapshots, the real-world bill is considerably higher. Beyond storage, Google's Core Web Vitals framework, which directly affects search rankings for public-facing government sites, penalises pages with unoptimised or repeated large image assets. DC government portals serving residents in neighborhoods like Anacostia and NoMa — areas undergoing rapid demographic and economic change — depend on accessible, fast-loading sites for residents who may be accessing services on mobile data connections.
The DC Public Library's digital branch, which serves branches from the Martin Luther King Jr. Memorial Library on G Street NW to the Anacostia Neighborhood Library on Good Hope Road SE, completed a media audit in its own web infrastructure in late 2024 that identified redundant assets as a top-three drag on its CMS performance, according to publicly available library board meeting minutes from that period. The library's experience offers a local proof of concept: systematic duplicate removal improved content editor workflow and reduced average page-build time.
The timing is not incidental. Mayor Muriel Bowser's administration has been operating under intensified fiscal pressure since the start of 2025, with federal grant uncertainties rippling through agencies that rely on pass-through funding. IT departments citywide have been asked to find operational efficiencies without compromising service delivery. Duplicate image cleanup is precisely the kind of low-disruption, measurable win that technology managers can point to in budget justifications.
Tools for automated duplicate detection — including open-source solutions compatible with Drupal, the platform underlying many DC agency sites — can scan and flag redundant files in hours rather than weeks. The harder work is governance: establishing clear naming conventions, mandatory alt-text standards, and centralised asset libraries so the problem does not regenerate within months. City technology officials have until the next fiscal year budget cycle, which begins October 1, 2026, to demonstrate measurable digital efficiency gains. Cleaning up what is already in the system is the logical place to start.
How does this story make you feel?
Spread the word
About this article
Published by The Daily Washington DC
Daily brief
Free, in your inbox before 7am. Weekdays.
More in News