Washington's public agencies are sitting on a growing digital mess. Duplicate images — identical or near-identical photographs, scans, and graphics stored multiple times across separate servers — have quietly become one of the most stubborn inefficiencies in municipal and federal record-keeping, and the people responsible for fixing it are increasingly vocal about what it will take.
The problem is sharpening now for a specific reason: the Trump administration's push through the Department of Government Efficiency to slash operational costs has put every line item under scrutiny, including cloud storage contracts that ballooned during the pandemic-era shift to remote work. For the District government, which operates its own sprawling digital infrastructure separately from federal systems, the audit pressure is compounding an existing backlog.
Where the Redundancy Lives
The DC Office of the Chief Technology Officer, headquartered on 441 4th Street NW, manages data systems across roughly 80 city agencies. Digital asset management — the formal term covering image libraries, document scans, and multimedia files — falls partly under its purview. The DC Public Library system, which runs 26 branches including the MLK Jr. Memorial Library on G Street NW, maintains separate digital collections that archivists have flagged internally as containing substantial duplication from years of ad hoc scanning projects.
The Martin Luther King Jr. Memorial Library alone digitized tens of thousands of historical photographs during a major renovation project that wrapped in 2020, and library technology specialists have noted that cross-referencing those files against the existing Washingtoniana collection revealed significant overlap. No official deduplication audit has been completed and published as of this writing.
The DC Office of Budget and Finance reported in its fiscal year 2025 technology spend overview that cloud storage costs for District agencies rose substantially over the prior three-year period, though the office has not broken out what share of that increase is attributable to redundant file storage specifically. Technology consultants who work with mid-sized municipal governments estimate that duplicate digital assets can account for between 20 and 30 percent of total storage consumption in organizations that lack automated deduplication protocols — a range that, applied to DC's scale, would represent a meaningful budget line.
What the Experts Are Recommending
Digital archivists at institutions like the Smithsonian Institution — which operates its primary administrative offices near the Mall and manages one of the largest open-access image repositories in the world — have advocated publicly for hash-based deduplication tools that can identify byte-for-byte identical files automatically. The approach is standard in private-sector data management but adoption across public agencies remains inconsistent.
Urban technology policy researchers affiliated with American University's School of Public Affairs in Tenleytown have argued that the real cost is not just storage fees. When an agency's image library contains dozens of copies of the same photograph filed under different names or dates, it degrades the reliability of public records requests, slows Freedom of Information Act responses, and creates legal exposure if a record is produced in litigation without certainty that it is the definitive version.
The tension between Mayor Muriel Bowser's administration and the federal government over funding streams adds another layer of urgency. Several District technology modernization grants that had been routed through federal agencies are under review or have been paused under the current administration's restructuring priorities. That leaves the OCTO with less external funding to invest in the kind of enterprise content management overhaul that would address duplication at scale.
For residents and organizations dealing with DC government digital systems — particularly nonprofits in neighborhoods like Anacostia and NoMa that rely on publicly accessible city data for grant applications and planning work — the practical advice from digital governance advocates is straightforward: when requesting images or records from city agencies, ask explicitly whether the file provided is the canonical version held in the primary records system, and note the file metadata including creation date and last-modified timestamp. That documentation matters if the record is ever disputed. Meanwhile, the agencies themselves face a harder task — and a ticking clock on storage bills that are unlikely to get smaller on their own.