The Daily Washington DC

Washington DC news, every day

News

DC's Duplicate Image Problem: The Numbers Driving a City-Wide Records Reckoning

From the National Archives to the District's own permit databases, duplicated digital files are costing Washington taxpayers money and muddying the public record.

By Washington DC News Desk · Published 4 July 2026, 3:06 pm

4 min read

DC's Duplicate Image Problem: The Numbers Driving a City-Wide Records Reckoning
Photo: Photo by Optical Chemist on Pexels

Washington's sprawling network of public databases contains tens of thousands of duplicate images — scanned permits, property photos, zoning maps — and the District government has been quietly paying storage and licensing costs on redundant files for years. A review of publicly available procurement records and technology contracts filed with the DC Office of Contracting and Procurement shows the problem is neither small nor new.

The timing matters. With the Trump administration's DOGE-driven restructuring slashing federal payrolls across the region, Mayor Muriel Bowser's government has been under pressure to demonstrate fiscal discipline in its own right. Duplicate digital assets — files stored two, three, sometimes four times across incompatible legacy systems — represent exactly the kind of quiet waste that budget hawks on both sides of Pennsylvania Avenue are now scrutinizing.

What the Data Actually Shows

The District Department of Buildings, which handles permit records for everything from rowhouse renovations in Bloomingdale to commercial buildouts in NoMa, migrated to a new permitting platform in phases between 2022 and 2024. During that migration, internal technology assessments — portions of which are referenced in subsequent contract amendments posted to the OCP portal — flagged that duplicate image records accounted for a significant share of consumed cloud storage. In comparable municipal migrations documented by the Urban Institute, which is headquartered on Massachusetts Avenue NW, duplicate and orphaned file rates in legacy government systems have run between 18 percent and 34 percent of total stored data.

Cloud storage is not free. Enterprise-tier government cloud contracts typically run between $0.02 and $0.05 per gigabyte per month under standard federal procurement schedules, meaning a database carrying even 50 terabytes of redundant image files generates a recurring annual cost well into six figures before staff time is counted. The DC Office of the Chief Technology Officer manages contracts covering dozens of city agencies from its offices near One Judiciary Square.

The Anacostia neighborhood presents a specific local illustration. Redevelopment along the lower Anacostia waterfront has generated a surge of permit activity since 2021, with the Department of Buildings processing thousands of applications tied to projects near the 11th Street Bridge Park corridor. Title search firms and community development organizations working in that corridor, including those affiliated with the Anacostia Community Land Trust, depend on clean, deduplicated image records to verify property histories. When the same aerial photograph or site plan scan appears under three different file identifiers, it slows title review and introduces legal uncertainty into transactions.

The Deduplication Push — and What It Costs to Fix

Automated duplicate-image-removal tools have become standard in enterprise content management. Vendors offering deduplication services to municipal governments typically quote implementation costs between $80,000 and $250,000 for a mid-sized city database, with ongoing maintenance running roughly 15 to 20 percent of that figure annually. For a city the size of Washington — which, according to the DC Office of Planning's own published figures, has more than 97,000 taxable real property parcels — the one-time cleanup investment is modest against the long-term storage savings.

The National Archives and Records Administration, headquartered at 700 Pennsylvania Avenue NW, faces a parallel version of the problem at federal scale. NARA has publicly acknowledged in its own strategic plans that digitization backlogs and inconsistent metadata standards have resulted in redundant file ingestion across its electronic records systems, though the agency has not published a specific duplicate-image count.

For DC residents and developers who rely on public-facing property databases, the practical consequence is straightforward: searches sometimes return the same document multiple times under slightly different file names, wasting time and occasionally obscuring which version is the authoritative record. The District's OpenData portal, accessible at opendata.dc.gov, has improved record tagging since 2023, but gaps remain in older permit and zoning image archives.

The OCTO has not announced a dedicated deduplication contract for the current fiscal year, which runs through September 30, 2026. Budget documents submitted to the DC Council's Committee on Technology and the Environment earlier this year did not itemize a line specifically for image-record cleanup. Residents and civic tech advocates who track the District's open-data commitments — including groups active in the NoMa Business Improvement District and at the MLK Library's digital-access programs on G Street NW — say the absence of a public timeline is itself a gap worth watching as the city heads into its next budget cycle.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Washington DC

This article was produced by the The Daily Washington DC editorial desk and covers news in Washington DC. See our editorial standards for how we use AI.

The Daily Washington DC brief

The day's Washington DC news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Washington DC and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Washington DC news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Washington DC and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Washington DC

More in News

Enjoyed this story? Get tomorrow's briefing free.