The Daily Zurich

Zurich news, every day

News

Zurich Leads on Duplicate Image Replacement in Public Archives — But Other Cities Are Closing the Gap

As digital catalogues balloon and AI scanning tools mature, how Zurich's institutions are tackling the problem of redundant imagery compared to Amsterdam, Vienna and Singapore.

By Zurich News Desk · Published 4 July 2026, 8:45 pm

3 min read

Zurich Leads on Duplicate Image Replacement in Public Archives — But Other Cities Are Closing the Gap
Photo: Photo by Natalia Sevruk on Pexels

Zurich's city archive, the Stadtarchiv Zürich on Alfred-Escher-Strasse, completed the first phase of its duplicate image replacement programme in June 2026, clearing more than 14,000 redundant scans from its publicly accessible digital catalogue — a figure the archive confirmed in its mid-year operational report. The clean-up, running since January, used automated perceptual hashing tools cross-referenced with manual curatorial review to flag identical or near-identical image records before replacing low-resolution duplicates with a single high-resolution master file.

The timing matters. Zurich's cultural institutions have spent the past three years accelerating mass-digitisation after a federal push to make Swiss heritage collections fully searchable by 2028. That sprint produced results — and clutter. The Zentralbibliothek Zürich on Zähringerplatz, which holds one of the largest newspaper photograph collections in the German-speaking world, acknowledged publicly in its 2025 annual report that duplicate entries had inflated its publicly listed image inventory by an estimated 18 percent. Weeding that out is not cosmetic housekeeping; duplicates generate broken metadata chains, confuse rights-clearance workflows and cost storage budget that institutions in a city with Zurich's cost structure feel acutely.

How Zurich Compares With Amsterdam and Vienna

The approach differs sharply from what comparable European institutions are doing. Amsterdam's Stadsarchief — the city archive that holds roughly 750,000 publicly accessible photographs — opted two years ago for a crowd-sourced tagging model, inviting registered users to flag suspected duplicates before professional staff review them. The model is slower but has surfaced contextual errors that purely algorithmic tools miss: two images that look identical but were taken minutes apart during the same event, for instance, can carry different documentary value.

Vienna's Wienbibliothek im Rathaus took a different route again, contracting with a specialist digital preservation firm in 2024 to run a one-time bulk deduplication before migrating to a new content management system. The advantage was speed. The risk, which Viennese archivists have discussed at conferences including the European Conference on Digital Preservation held in Edinburgh last October, is that a one-time purge does not build institutional capacity for ongoing maintenance.

Singapore's National Archives ran its own duplicate-reduction programme across its visual collections between 2023 and 2025, applying AI-assisted clustering to a holdings set exceeding two million items. The programme reduced active catalogue entries by approximately 11 percent, according to figures the National Archives of Singapore published in its 2025 corporate report. Crucially, Singapore paired the technical work with updated acquisition guidelines to prevent duplication at the point of ingest — something Zurich's institutions have not yet formalised at city level.

What Zurich's Institutions Are Building Next

The Stadtarchiv's next phase, scheduled to begin in September 2026, will extend the deduplication review to its architectural photography holdings — a collection documenting Zurich's building stock from the Kreis 4 and Kreis 5 neighbourhoods, areas that underwent intensive redevelopment documentation between 1980 and 2010. Those records are among the most heavily duplicated because multiple departments commissioned photography of the same sites during the same periods.

ETH Zürich's computer science department has a working relationship with the Stadtarchiv on image-recognition research, and doctoral work published through the university's e-collection repository has fed directly into the hashing methodology the archive is now using. That proximity to a globally ranked research institution gives Zurich an advantage that cities without an equivalent technical university partnership — such as many mid-sized European capitals — have to buy in from the private sector at considerably higher cost.

For residents and researchers who use Zurich's public digital collections, the practical effect will be felt in the search interface on the Stadtarchiv's online portal, where duplicate results have historically pushed relevant unique images down the results page. The archive has indicated the portal will reflect the cleaned catalogue by the end of the third quarter of 2026. Anyone relying on those collections for property research, journalism, or academic work should cross-check any image records accessed before that date against the post-cleaning catalogue once it goes live.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Zurich

This article was produced by the The Daily Zurich editorial desk and covers news in Zurich. See our editorial standards for how we use AI.

The Daily Zurich brief

The day's Zurich news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Zurich and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Zurich news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Zurich and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Zurich

More in News

Enjoyed this story? Get tomorrow's briefing free.