major_clanger: Clangers (Royal Mail stamp) (Default)
[personal profile] major_clanger
Does your work (or hobby) involve archiving data, or looking at information that other people have archived? If so, you may be interested in the Digital Curation Blog, run by some people I know at Edinburgh University. Their latest post has some interesting snippets from a workshop on digital archiving, including this thought-provoking comment on storage formats:

...a half percent error rate in a BMP file shows a smattering of black pixels, whereas in a GIF file there were serious artefacts and visible damage introduced. Same error rate on a WAV file produces a barely audible rustle effect, while on a MP3 files sound is seriously distorted/. Same error rate on a DOC or PDF file, and you get “File damaged, cannot open”. Be very afraid!

Having done error-correction theory in my MSc (which included some painfully weird binary mathematics, some of which I understood for just long enough to pass the exam) it occurs to me that the overhead of adding error-correction coding to highly-compressed storage formats would be far less than the storage space saved my moving to them. An MP3 file is around a ten times smaller than the WAV file it is created from, so even going from 8 bits to 8+3 bits would still give a compression of seven times, with much improved resistance to bit errors. Anyone know anything about the use of ECC in highly-compressed lossy file formats?

Date: 2008-08-05 11:31 am (UTC)
From: [identity profile] marypcb.livejournal.com
this is why there's a move to XML formats for documents, away from binary, and why I'm so frustrated that people complain about Office 2007 - the British Library folk are very cogent on this

Profile

major_clanger: Clangers (Royal Mail stamp) (Default)
Simon Bradshaw

January 2022

S M T W T F S
      1
23 45678
9101112131415
16171819202122
23242526272829
3031     

Most Popular Tags

Style Credit

Expand Cut Tags

No cut tags
Page generated Feb. 8th, 2026 12:19 pm
Powered by Dreamwidth Studios