The Daily Zurich

Zurich news, every day

News

Zurich's Digital Archive Reckoning: What Officials, Experts and Key Figures Are Saying About the Duplicate Image Crisis

From Stadtarchiv Zurich to ETH Zürich's image labs, the push to purge duplicate photos from public databases is drawing sharp responses across the city's cultural and tech institutions.

By Zurich News Desk · Published 4 July 2026, 9:23 pm

3 min read

Zurich's Digital Archive Reckoning: What Officials, Experts and Key Figures Are Saying About the Duplicate Image Crisis
Photo: Photo by Fran Zaina on Pexels

Zurich's public institutions are sitting on a problem that has quietly compounded for more than a decade: tens of thousands of duplicate images clogging digital archives, degrading search results, and consuming server capacity that costs real money. Stadtarchiv Zürich, the city's official historical records body headquartered near Neumarkt, confirmed this spring that an internal audit flagged significant redundancy across its digitised photo collections — though the archive has not yet published final figures from that review.

The issue matters now because the city is mid-way through a broader digitisation push tied to Zürich's 2025–2030 smart-city strategy, which allocates funding across municipal departments for data infrastructure upgrades. As collections grow, unresolved duplication does not merely waste storage — it undermines the accuracy of public-facing tools, from heritage portals used by researchers at ETH Zürich to neighbourhood planning databases consulted by residents navigating the city's entrenched Wohnungsnot housing crisis.

Where the Pressure Is Coming From

ETH Zürich's Computer Vision Lab, based on the Hönggerberg campus, has spent several years developing automated deduplication tools designed for large institutional image sets. Researchers there have presented findings at international conferences arguing that hash-based matching alone — the most common method used by Swiss public archives — catches only the most obvious copies and misses near-duplicates created by re-scanning, colour correction, or format conversion. Their position, communicated through published academic work rather than public statements, is that perceptual hashing combined with machine-learning classifiers represents the current standard.

Kunsthaus Zürich, which completed a major expansion into the Chipperfield wing in 2021, has also faced the duplication question internally as it merged legacy image databases with new digital acquisition records. The institution has not publicly described the scale of the problem in its own holdings, but archivists in the broader Swiss museum sector have noted in professional forums that post-merger database consolidation — familiar language to anyone who followed the UBS-Credit Suisse integration — almost inevitably surfaces redundant image records.

At Stadt Zürich's Amt für Städtebau, planners working with historical aerial photography along the Limmat corridor have flagged to council committees that duplicated image records slow down geodata workflows. The directorate handles spatial planning for districts including Aussersihl and Wiedikon, where redevelopment pressure is highest. Processing delays, even minor ones, carry real administrative cost in a city where office rents on Bahnhofstrasse rank among Europe's highest and staff time is priced accordingly.

What the Experts Recommend

Professionals in the Swiss digital preservation sector broadly agree on a three-step response: audit first with automated tooling, set institutional deduplication policy before the next major ingestion cycle, and allocate a dedicated remediation budget rather than treating cleanup as a background task. The challenge is that Zurich's public institutions operate under cantonal procurement rules that can make rapid software adoption slow — a tension that came up repeatedly during discussions around the city's 2024 digitisation procurement review.

The financial dimension is not trivial. Cloud storage pricing for cultural institutions in Switzerland, where data residency requirements push organisations toward Swiss-hosted infrastructure providers, runs considerably higher than equivalent European Union options. Estimates circulating in Swiss library and archive professional networks suggest institutional image archives in the mid-size range — holding between 500,000 and two million assets — can reduce annual storage costs by 15 to 30 percent through systematic deduplication, though those figures depend heavily on the format and compression of the original files.

For institutions that have not yet begun the process, archivists advise running an initial automated scan before the summer recess ends in August, when IT teams return to full strength and budget planning for 2027 begins in earnest at cantonal level. Stadtarchiv Zürich's audit timeline points to a public progress report later this year. How detailed that report turns out to be — and whether it prompts a coordinated response across Zürich's broader network of cultural institutions — will determine whether this remains a technical footnote or becomes a line item in the city's next digital infrastructure debate.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Zurich

This article was produced by the The Daily Zurich editorial desk and covers news in Zurich. See our editorial standards for how we use AI.

The Daily Zurich brief

The day's Zurich news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Zurich and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Zurich news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Zurich and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Zurich

More in News

Enjoyed this story? Get tomorrow's briefing free.