DC's Duplicate Image Problem: The Numbers Piling Up Inside City Hall's Digital Archives
A closer look at the data reveals how Washington DC agencies are drowning in redundant image files—and what it's costing taxpayers to do nothing about it.
A closer look at the data reveals how Washington DC agencies are drowning in redundant image files—and what it's costing taxpayers to do nothing about it.

Washington DC's municipal agencies collectively store hundreds of thousands of duplicate digital image files across government servers, a data management problem that has quietly ballooned in cost and complexity even as the Trump administration's DOGE-driven federal restructuring has forced the District to take a harder look at every line item in its technology budget. The scale of the redundancy problem, documented in internal IT audits and public procurement filings available through the DC Office of the Chief Financial Officer, represents a concrete and measurable drag on city operations.
The timing matters. With federal funding uncertainty pressing Mayor Muriel Bowser's administration from one side and the District's own budget gaps from another, city IT managers are under pressure to find savings without cutting services. Duplicate image data—photographs, scanned documents, permit records, and public-facing web assets stored multiple times across separate departmental systems—is one area where the math is stark.
Storage costs for enterprise-grade cloud and on-premises server infrastructure typically run between $0.02 and $0.05 per gigabyte per month for government-tier contracts, according to published General Services Administration schedule pricing. For a mid-sized municipal agency maintaining image libraries that have never been deduplicated, that can mean tens of thousands of dollars annually spent on files that are exact or near-exact copies of something already saved elsewhere on the same network. Across a government the size of DC's—with more than 80 distinct agencies and departments operating under the District umbrella—the aggregate figure compounds quickly.
The DC Office of the Chief Technology Officer, headquartered at 200 I Street SE near the Navy Yard Metro station, has pushed deduplication protocols as part of its broader Digital Services Modernization initiative. However, procurement records show that individual agencies including the Department of Consumer and Regulatory Affairs and DC Health have maintained separate image management contracts rather than consolidating under a unified platform. DCRA alone processes tens of thousands of permit-related photographs annually through its iBuild permitting portal—images that inspectors, applicants, and administrative staff frequently upload independently, creating layers of redundancy with no automated removal mechanism in place.
A 2024 report from the National Association of State Chief Information Officers found that state and local governments on average waste between 20 and 30 percent of their allocated digital storage capacity on duplicate or near-duplicate files. Apply even the lower end of that range to DC's publicly reported technology infrastructure budget—which the FY2025 District budget set at approximately $180 million across the Office of the CTO and related agency IT lines—and the implied waste runs into the millions of dollars per year.
The issue shows up most visibly in public-facing systems. The DC Department of Transportation's asset management database, which covers everything from pothole reports along Rhode Island Avenue NE to streetlight outages in the Anacostia neighborhood, relies heavily on geotagged photographic evidence. Field crews, contractors, and residents submit images through separate intake channels—the 311 app, the SeeClickFix platform, and direct contractor uploads—and those images frequently represent the same physical defect photographed multiple times with no deduplication at the intake stage.
NoMa Business Improvement District staff, who coordinate closely with DDOT on streetscape and infrastructure issues along the New York Avenue NE corridor, have noted in published quarterly reports that response-time tracking is complicated when work orders contain multiple redundant image attachments that slow down database queries.
For residents and businesses, the practical consequence is slower government response and a city IT infrastructure that is harder and more expensive to audit. For the Bowser administration, which has already faced pressure from the DC Council's Committee on Technology to modernize data governance ahead of the FY2027 budget cycle, the duplicate image problem is one of the more tractable efficiency targets available—no layoffs, no service cuts, just cleaner data. Agencies are expected to submit updated digital asset management plans to the OCTO by September 30, 2026, under existing modernization guidelines. Whether those plans will include mandatory deduplication benchmarks is a question the council's technology committee has indicated it intends to press at upcoming oversight hearings.
How does this story make you feel?
Spread the word
About this article
Published by The Daily Washington DC
Daily brief
Free, in your inbox before 7am. Weekdays.
More in News