Zurich's public institutions are sitting on a problem that has been building quietly for years: tens of thousands of duplicate digital images spread across municipal servers, archive databases and public-facing portals, with no unified policy governing how outdated or replicated files get identified, retired or replaced. The pressure to act has sharpened considerably in 2026, as the city's digital transformation office moves toward a consolidated data governance framework with an internal review deadline set for September 30.
The issue matters now for a specific reason. Zurich Stadtarchiv, housed on Neumarkt in the Altstadt, is midway through a digitisation programme that has already processed more than 1.2 million historical documents and photographs. When duplicate or low-resolution images go untagged, they pollute search results, inflate storage costs and — critically — risk becoming the canonical record that researchers and journalists rely on, displacing higher-quality originals. The same problem affects Stadt Zürich's open data portal, which publishes geodata, planning maps and photographic records drawn from multiple city departments that do not share a common metadata standard.
What the Institutions Are Wrestling With
At ETH Zürich, the university library on Rämistrasse has been developing an automated duplicate-detection pipeline using perceptual hashing — a technique that compares images at the pixel-cluster level rather than by file name or size. The approach can flag near-identical images even when they have been resized, recoloured or lightly edited. ETH library staff have been piloting the tool on a subset of the university's scientific image repository since January 2026, with early results suggesting that somewhere between 8 and 14 percent of stored images in any large institutional archive are functionally redundant. That range, drawn from comparable projects in European university libraries, gives Zurich's municipal planners a working benchmark even before city-specific numbers are compiled.
The Zentralbibliothek Zürich on Zähringerplatz faces a related but distinct version of the challenge. Its digital collections include newspaper photo archives acquired from multiple sources over two decades, many ingested without rigorous deduplication at the point of entry. The library is a partner in the Swiss national programme Swisscollections, which aggregates cultural heritage metadata across 900 institutions, meaning a duplicate in Zurich can propagate errors across the entire national index.
Cost is a concrete pressure. Commercial cloud storage billed to Swiss public bodies has risen sharply since 2023. Retaining redundant image files is no longer a trivial overhead. Municipal IT departments are under instruction from the Stadtrat to identify storage efficiencies as part of the broader fiscal consolidation agenda ahead of the 2027 budget cycle.
The Decisions That Cannot Be Delayed
Three choices are coming to a head before year's end. First, whether to mandate a single metadata standard — likely Dublin Core extended with Swiss-specific fields — across all city-affiliated archives, or to allow institutions to maintain separate schemas linked by a crosswalk. The first option is cleaner but requires retagging legacy collections at significant staff cost. The second preserves institutional autonomy but perpetuates the fragmentation that created the duplication problem.
Second, city planners must decide what happens to a replaced image's URL. Broken links in public planning documents, court records and news archives are a real and lasting harm. A redirect protocol — keeping old addresses alive and pointing to the canonical replacement — adds infrastructure cost but protects the integrity of historical references embedded in PDFs and official correspondence going back to the early 2000s.
Third, and most politically loaded, is the question of who decides which version of a duplicated image is authoritative. For historical photographs, that judgment carries genuine cultural weight. Stadtarchiv has proposed a curatorial review panel that would include a representative from the Heimatschutz Zürich preservation organisation, but the proposal has yet to be formally adopted.
The September 30 review date is not a final deadline — no binding ordinance is yet attached to it. But it is the point at which the digital transformation office will table its recommendation to the Stadtrat. Institutions that have not submitted their duplicate-detection audit results by then risk being excluded from the first round of replacement funding earmarked in the 2026 supplementary budget. For archives that have been waiting years for resources, that is a meaningful incentive to act before the summer is out.