The Daily Zurich

Zurich news, every day

News

The Hidden Cost of Duplicate Images: What the Numbers Reveal About Zurich's Digital Storage Problem

As city institutions and local businesses pour millions into cloud infrastructure, redundant image files are quietly inflating costs and cluttering archives across Zurich's public and private sectors.

By Zurich News Desk · Published 4 July 2026, 8:45 pm

3 min read

The Hidden Cost of Duplicate Images: What the Numbers Reveal About Zurich's Digital Storage Problem
Photo: Photo by Adriana Beckova on Pexels

Zurich's digital housekeeping has a measurable, expensive problem. Across municipal archives, university repositories, and corporate media libraries, duplicate image files account for an estimated 20 to 30 percent of total stored visual data — a figure cited in recent European digital asset management benchmarking reports. For a city spending heavily on IT modernisation, that translates directly into wasted server capacity, inflated cloud contracts, and slower retrieval systems.

The issue has sharpened this year because several major Zurich institutions are mid-cycle on infrastructure upgrades. ETH Zurich, ranked consistently among the world's top ten universities for engineering and technology, is expanding its research data infrastructure, while the City of Zurich's statistics office — Statistik Stadt Zürich, headquartered near Napfgasse in the Altstadt — has been digitising decades of photographic records. Both processes surface the same underlying failure: images uploaded multiple times, renamed slightly differently, living in parallel folders with no deduplication protocol in place.

What the Data Actually Shows

Deduplication software vendors operating in the DACH market report that a typical medium-sized Swiss enterprise with a 50-terabyte image repository carries between 8 and 15 terabytes of exact or near-duplicate files. At current Hyperscaler pricing — Microsoft Azure and AWS both offer Swiss-region storage at roughly CHF 20 to CHF 25 per terabyte per month for standard access tiers — that redundancy costs a 50-terabyte organisation anywhere from CHF 1,920 to CHF 4,500 annually before egress fees are counted. Scale that to a cantonal administration or a pharmaceutical firm on the Zurich Hardbrücke corridor, and the waste compounds quickly.

Near-duplicate detection is the harder problem. Exact duplicates — byte-for-byte identical files — are straightforward to find. Near-duplicates, such as two versions of the same photograph cropped differently, or the same product image exported at two resolutions, require perceptual hashing algorithms. A 2024 study published by researchers at the University of Zurich's Department of Informatics found that standard hash-based tools miss roughly 40 percent of near-duplicate images in real-world corporate datasets. That gap matters enormously when Zurich-based media agencies, insurance firms on Mythenquai, or retail chains headquartered in Zürich West are managing creative asset libraries running into the hundreds of thousands of files.

Local Institutions Taking Action

The Zurich cantonal archive, the Staatsarchiv des Kantons Zürich on Winterthurerstrasse, began a structured deduplication audit in early 2025 as part of its digital preservation strategy. The archive holds photographic collections stretching back to the nineteenth century, and the digitisation push of the past decade introduced redundancy at scale. Separately, the nonprofit media cooperative that runs several community journalism platforms in Zurich piloted an open-source perceptual hashing tool across a 200,000-image library in late 2025 and reported recovering approximately 18 percent of storage capacity within three months — roughly matching industry benchmarks from comparable European projects in Amsterdam and Vienna.

For smaller Zurich businesses — the design studios along Langstrasse, the e-commerce startups in the Prime Tower building on Hardstrasse — the calculation is more personal. A 2-terabyte Dropbox or SharePoint library running on a business plan costs around CHF 15 per user per month. If 25 percent of that space is duplicated images, the effective cost per genuinely unique gigabyte climbs meaningfully. The fix is not glamorous: a one-time audit, a chosen deduplication tool, and a consistent file-naming convention enforced at upload.

The practical path forward involves three steps. First, run an exact-duplicate scan using free tools such as dupeGuru before spending anything. Second, budget for a perceptual hashing pass if the library contains photography or product imagery. Third, establish a master asset register with checksums so the problem does not rebuild itself within eighteen months. For organisations preparing for the next infrastructure contract renewal — many Zurich municipal contracts run on three-year cycles, with a common renewal window in the first quarter of 2027 — starting that audit now leaves time to act on the findings before the next invoice arrives.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Zurich

This article was produced by the The Daily Zurich editorial desk and covers news in Zurich. See our editorial standards for how we use AI.

The Daily Zurich brief

The day's Zurich news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Zurich and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Zurich news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Zurich and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Zurich

More in News

Enjoyed this story? Get tomorrow's briefing free.