Checking data integrity

Checksums and CRC can be very quick, but have significant drawbacks. SHA-256 hashes are best, although there are also error-correcting codes used in optical disks.

Explainer: Error-correcting files

How to design a method which not only detects errors in file data, but enables them to be corrected? A tale of Hamming, Reed, Solomon and CDs.

Efficient resilient storage

Storage has to be reliable, efficient and resilient. However, efficiency and resilience oppose one another. What’s the best solution? New file formats, CRC in the file system, or what?

File Integrity 12 : Error correction for large files

Can you use error-correcting code to repair very large files, for example of around 20 GB or more?

File Integrity 10 : Effects of length of corruption on images and ECC recovery

What difference does it make when a file has a block of 512 bytes corrupted instead of just a single byte? Results from image formats and recovery using ECC.

File Integrity 9 : How error-correcting codes work

How can you squeeze recovery data into smaller storage space than you’d need for a second copy of a file? Using codes, explained here.

Last Week on My Mac: Why file integrity is important

For all their compactness and ease of access, are our files going to prove less durable than a clay tablet recording a commercial transaction over 4,000 years ago?

File Integrity 8 : Compression, encryption and disk images

Tests bring some surprises, with encrypted sparse bundles looking resilient to small amounts of corruption.

File Integrity 7 : Which other file formats are resilient?

Looks at plain text, CSV, XML, JSON, RTF, RTFD, .docx, .xlsx, and PDF. Which should you trust with your important documents in archives?

File Integrity 6 : Which image format is most resilient?

Which format – alongside Camera Raw – should you store archived images in: JPEG, PNG, TIFF or Apple’s new HEIC?

The Eclectic Light Company

error correcting