The Daily Washington DC

Washington DC news, every day

News

DC's Digital Archives Are Full of Duplicate Images. Experts Say the City Is Paying Twice for Storage It Doesn't Need.

As federal funding uncertainty squeezes the District's budget, officials and technologists are calling out a quiet problem bloating city databases and costing taxpayers real money.

By Washington DC News Desk · Published 4 July 2026, 2:44 pm

4 min read

Washington DC's government databases are riddled with duplicate digital images — redundant photographs, scanned documents and copied files sitting in multiple storage systems simultaneously — and the price tag for that inefficiency is climbing. Technology officials and archivists who work with District agencies say the problem has grown worse as city departments expanded remote work infrastructure after 2020, creating overlapping cloud and on-premises storage environments that were never properly reconciled.

The issue matters right now for a specific reason: the District is entering a period of unusual fiscal pressure. With the Trump administration's Department of Government Efficiency restructuring pushing federal workers out of office buildings along Pennsylvania Avenue and K Street, the local economy is absorbing real shocks. Mayor Muriel Bowser's administration is simultaneously trying to demonstrate that DC's own government runs lean. Wasteful digital storage — invisible to the public but visible on IT procurement invoices — is the kind of line item that draws scrutiny when budgets tighten.

What the Specialists Are Saying

Digital archivists at institutions including the DC Public Library's Washingtoniana Division and technology staff at the Office of the Chief Technology Officer have long flagged redundant data as a structural problem rather than an occasional mistake. The core issue is that when agencies migrate from one platform to another — say, from a legacy server in a downtown data center to a Microsoft Azure environment — files frequently copy across rather than move, leaving the original untouched. Multiply that across dozens of city agencies and hundreds of thousands of image files, and the redundancy becomes significant.

Technology policy researchers who study municipal government IT spending point to deduplication software as the standard fix. Enterprise-grade deduplication tools typically reduce storage footprints by 30 to 50 percent in environments with high volumes of scanned documents and photographs, according to widely cited figures from the storage industry. For a mid-sized city government running multiple cloud contracts, that translates to tens of thousands of dollars annually in avoided storage fees — not transformative, but not trivial when every department is being asked to justify expenditures.

The DC Office of the Chief Technology Officer, which coordinates technology policy across District agencies, has not published a specific public report on duplicate image volumes as of this writing. But the office has previously identified data governance as a priority area in its multi-year technology roadmap, and archival staff at institutions including the Historical Society of Washington DC, headquartered on Mount Vernon Square, have described the broader digital stewardship environment as uneven across agencies.

Local Systems, Concrete Stakes

Two city systems illustrate the stakes most clearly. The DC Department of Buildings, which absorbed the former Department of Consumer and Regulatory Affairs' permitting functions, maintains large archives of property inspection photographs. The DC Office of Planning holds extensive image records tied to Historic Preservation Review Board cases — many of them covering neighborhoods like Anacostia and NoMa, where development pressure has made historic documentation both more voluminous and more contested. In both cases, images generated at multiple stages of a review process can end up stored in separate folders within the same system, or across two systems that were never integrated.

The practical consequence is not just cost. Duplicate images complicate public records requests. When a resident or attorney files a Freedom of Information Act request with a DC agency, staff must search multiple locations to ensure completeness. If duplicates are inconsistently named or tagged, retrieval takes longer and errors become more likely.

The fix is not technically complicated, which is part of what frustrates specialists in this space. Deduplication runs as a background process; most modern cloud storage platforms include it as a native feature that simply needs to be enabled and configured. The barrier is almost always administrative — agencies operating in silos, IT staff stretched across competing priorities, and procurement cycles that don't reward tidiness in existing systems.

For DC residents and businesses that interact regularly with city digital systems, the practical advice from technology specialists is straightforward: when submitting documents or photographs to any District agency portal, avoid re-uploading the same file under a different name, and use the agency's official file-naming conventions where published. That won't solve the structural problem, but it reduces the chance your records become part of it. The larger fix remains a policy decision, and officials at the Wilson Building will need to make it deliberately rather than by default.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Washington DC

This article was produced by the The Daily Washington DC editorial desk and covers news in Washington DC. See our editorial standards for how we use AI.

The Daily Washington DC brief

The day's Washington DC news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Washington DC and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Washington DC news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Washington DC and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Washington DC

More in News

Enjoyed this story? Get tomorrow's briefing free.