Washington's government agencies and cultural institutions are sitting on a sprawling, largely unaudited mess of duplicate digital images — scanned documents, photographs, and records that exist in multiple copies across servers, cloud storage accounts, and legacy hard drives. The problem has been building for years. What's new is the pressure to fix it.
The combination of ongoing federal workforce restructuring under the Trump administration, budget uncertainty flowing from DOGE-driven efficiency reviews, and the District's own fiscal planning cycle has pushed the question of digital asset management from the back office to city hall. Administrators, archivists, and technology procurement officers are now confronting a question they've avoided for years: what does it actually cost to identify, consolidate, and replace redundant image records at scale?
The Scale of the Problem in DC
The DC Public Library system, headquartered at the Martin Luther King Jr. Memorial Library on G Street NW, completed a digitization expansion in 2023 that added tens of thousands of images to its online collections. Librarians and digital archivists working on that project have since acknowledged, in public forums and professional conferences, that deduplication — the process of identifying and removing or replacing identical or near-identical image files — was not systematically built into the original workflow. The library's digital collections now span multiple platforms, including its partnership with the Digital Public Library of America.
The DC Office of the Chief Technology Officer, which coordinates digital infrastructure across District agencies, has been fielding questions from agency heads about exactly this issue since at least late 2025. The OCTO has pointed agencies toward federally aligned data governance frameworks, but those frameworks don't specify a standard for image deduplication or mandate a timeline for compliance. That gap is where the current debate is centered.
At the Anacostia Community Museum on Fort Place SE, curatorial staff have flagged the issue in internal planning documents as a resource drain: staff time spent managing redundant files is time not spent on new acquisitions or public programming. The museum, part of the Smithsonian Institution, operates under a separate federal funding stream that is itself subject to congressional appropriations uncertainty in the current fiscal environment.
What Experts Are Recommending
Digital asset management specialists working with District government contractors have outlined a tiered approach: first, automated scanning to flag files with identical or near-identical hash values; second, human review of flagged items to confirm true duplicates versus intentional variations; third, replacement of outdated low-resolution images with authoritative high-resolution masters stored in a single designated repository. The process, for a mid-sized municipal archive, typically runs between $80,000 and $250,000 depending on collection size and whether the work is done in-house or outsourced.
The DC Office of Contracting and Procurement posted a request for information in March 2026 seeking vendor capabilities in digital records management, which specialists in the field read as a signal that a formal solicitation may follow before the end of fiscal year 2026, which closes September 30. No contract has been awarded as of this date.
The timing is complicated by the federal dimension. A significant share of the digital image records held by District-adjacent institutions — including collections managed by contractors based in the NoMa neighborhood along New York Avenue NE — are technically federal property or are governed by federal data-sharing agreements. Determining who has authority to authorize deletion or replacement of a duplicate image record sometimes requires clearance from a federal agency that has itself seen staffing reductions in the past 18 months.
For District residents and researchers who rely on digital archives — whether tracing property records near the H Street NE corridor or accessing historical photographs of neighborhoods like Deanwood and Brightwood — the practical effect of the current gridlock is slower search results, mislabeled files, and occasional broken links to records that exist in multiple places but are properly accessible from none of them. Officials from Mayor Muriel Bowser's office have not announced a dedicated initiative to address the backlog, but the OCTO is expected to include digital asset governance benchmarks in its next annual technology strategic plan, due out in the fall.