How Apple intends checking images for CSAM

Last week there was a surprise storm when Apple announced its plans to better protect our children. While I have my views on its proposals, which are almost certainly different from yours, what has become apparent is that few who have passed comment understand Apple’s proposals. Before deciding whether they’re a big step forward or a gross infringement of privacy, I believe it’s vital to understand just what Apple is intending to introduce.

For a start, there isn’t just one change, but three separate changes, one of which is part of Parental Controls, and another is some fairly uncontentious improvements in Siri and Search. By far the most controversial have been those which will apply to all who store images in iCloud using iPhotos or Photos (which Apple somewhat anachronistically refers to as iCloud Photos), which will be checked for CSAM (Child Sexual Abuse Material) before being uploaded. These raise many questions, in particular how images can be checked reliably, which is what I’m going to try to explain here.

Apple’s detailed announcement is essential reading, but its re-use of terms such as hashes can confuse, and it’s also easy to gain the impression that image matching is all performed by “an AI”. There’s much more to it, and while Machine Learning (ML) is involved, I don’t think those working in AI would consider this an AI system as such.

Image classification

We’re most used to ML being used to classify images, a function well-supported by Apple’s operating systems and used by Apple’s and third-party apps. It’s now fairly straightforward to produce an app which can tell the difference between different painting styles, and hazard a good guess who appears in your family photos. We’re also aware of the comical misclassifications which are quite common, and make amusing tweets. That’s why it would be useless for detecting CSAM.

While we’re thinking about others analysing your photos, though, remember that when you give an app full access to your photos in Privacy controls, you’re allowing it to rifle through them and run whatever classification methods it chooses. If you’re concerned that others might have an unhealthy interest in interpreting your Photos library, the first thing you need to do is keep a close watch on anything gaining access. Privacy dialogs may be a pain, but they have an important purpose too.

The perfect match

At the other end of the scale, file integrity checking apps use cryptographic hashes to check that files are perfect down to the very last bit. To do this, they calculate a hash, a large binary number, and compare that with what it should be. Using this technique to match images guarantees that any matches would be perfect, but it also fails to notice images which we can see are identical, or very close matches, but differ slightly. Maybe they’re JPEGs which use slightly different compression, one is a crop of the other, or the colour has been adjusted.

Trying to detect CSAM using cryptographic hashes would therefore be largely fruitless, and easily circumvented.

Matching colours

Allow me a slight diversion here for the sake of explanation. Let’s match colours instead of detailed images. Judging by eye, we’d take the two swatches, expose them under the same light, and compare them. You can do this numerically using a colour-measurement device, which can provide their colour co-ordinates.

How given levels of brightness (or lightness, when fixed to the brightest white), hue, and chroma combine to define a green.

There are several different systems for obtaining co-ordinates: above is the HSB method using Hue, Saturation and Brightness.

A CIE 1931 colour space chromaticity diagram using xyz co-ordinates, with a device gamut shown by the triangle.

Colour space diagrams like this using xyz co-ordinates are a bit more abstract, perhaps, but work similarly.

An app can determine whether two colour measurements match within a defined tolerance, which allows for subtle differences in the surfaces being compared, and to a degree in lighting.

PhotoDNA

Imagine now analysing an image by converting it to monochrome or even black and white, dividing it up into squares, and using those to produce a set of many co-ordinates which characterise that image. That’s essentially how PhotoDNA, developed by Hany Farid and Microsoft Research, works. Each image that you upload to social media, including Twitter and Facebook, is analysed by cloud servers to produce a ‘hash’, which is compared with a huge database of hashes of known CSAM.

There’s a delicate balance here between sensitivity and specificity. If each hash excludes too many near-identical images, then it will be highly specific but not sensitive enough; if it includes any dissimilar images, then it will be too sensitive and not specific enough. The method used to generate the hashes is thus critical to the success of PhotoDNA.

Currently, PhotoDNA only runs on servers, and no one seems to know whether it would ever be suitable for popular devices like iPhones. Although it’s extending to cover videos as well, the computational effort involved in that may well be beyond what is reasonable for a mobile phone: it handles video by extracting key frames and performing similar hash-matching for those. For multi-shot videos lasting several minutes or more, that could well be demanding even for computer servers to perform in anything like real time.

NeuralHash

Apple has adopted a similar approach, but with some important differences.

The first step is to develop a system of numeric image descriptors, like colour co-ordinates, which characterise each image. This involves Machine Learning, to discover which co-ordinates are best suited to the task, learning on huge collections – hundreds of thousands – of CSAM images which have been gathered by organisations like NCMEC. What this produces is a set of (real) numbers for each image and its variants, which are then combined into a unique hash for that group of images. The first of those steps uses a convolutional neural network, hence the name NeuralHash.

The goal here is to ensure that an iPhone can compute a NeuralHash for an image in real time, and quickly compare that against a local database of NeuralHash values for known CSAM. Each NeuralHash has to have high sensitivity and specificity, so that the chances of false positives and negatives are extremely low.

Training the convolutional neural network involves presenting it pairs of images. Some consist of the original CSAM image and a modified version of it which is perceived to be the same; other pairs consist of the original image and an image which isn’t seen as being the same at all. Only when the neural network performs well at matching both pairs correctly is it then ready for testing.

Matching NeuralHashes

The set of real numbers for each image cluster is extremely large, and not amenable to direct comparison. The integer NeuralHash is generated using a single bit for the ‘result’ of each of the co-ordinates measured, a process known as Hyperplane Locality Sensitivity Hashing.

Apple then uses an extended version of a Private Set Intersection (PSI) protocol to match the NeuralHashes of images to be transferred to iCloud Photos against those for known CSAM, which are supplied by organisations like NCMEC. This has to be performed so that NeuralHashes of ‘innocent’ images (which don’t match known CSAM) aren’t released from the device to Apple, and ensures that Apple’s server only learns about the matches if a threshold in number is reached. If the threshold is set at 5, for instance, the server will only know of matches if there are more than five matches. That is achieved using Threshold Secret Sharing. Additional protection for the user is provided by the use of a Safety Voucher. Apple describes each of these quite elaborate protections in its documentation, available from here.

The threshold serves another purpose: controlling the rate of false positives. There is a very small risk that any image matching could produce a positive result when the images are different and shouldn’t have been matched. Apple should have a good idea of that risk from testing of the method used to generate NeuralHashes. To further reduce the risk of such errors occurring, the system requires more than one positive match to occur before Apple is notified of the likelihood of CSAM content in the images to be transferred to iCloud Photos. And tweaking the threshold is a simple way to adjust the sensitivity of the whole matching system.

Extension

PhotoDNA was extended in 2016 to cover not only CSAM but images and video with extremist and terrorist content. As it performs matching on cloud servers, such extensions can be accommodated relatively easily.

Apple’s matching relies completely on local computation of the NeuralHash for each item to be checked. If it were to adopt the key frame approach to matching video too, each video to be checked would need to be decoded, key frames identified and extracted, then a NeuralHash computed on every one. Mobile devices with limited memory and constraints to multitasking are far from ideal platforms for such tasks.

There are also issues with extending the types of images matched. Operating the same system to match images out of the domain on which the convolutional neural network was trained and tested carries significant risk of loss of sensitivity and specificity. Extending matches to different types of image isn’t just a matter of adding more NeuralHashes to the list, and only Apple can tell how domain-specific its current solution is. In some domains, matching performance could be so poor as to be as comical as flawed attempts at image classification.

Does it need to work well?

There’s an argument, with support from Game Theory, that says that Apple can set a high threshold for the number of matches, and only detect and report a few cases of CSAM. Indeed, even that may be unnecessary to drive anyone currently sharing CSAM to abandon the use of iCloud Photos altogether.

The worst outcome would be for a high rate of false positives, which would only strengthen arguments against these measures. Whether any outcome would result in more prosecutions for or any reduction in CSAM is a matter of speculation, as are the motives of those involved. What is abundantly clear, though, is that Apple has devoted extensive engineering and development effort already, and is unlikely to be willing to abandon the scheme in a hurry.

9Comments

Add yours

1

Lars Baumstark on August 9, 2021 at 8:20 am

Good explanation, and totally in sync what Apple published in their own explanatory anti-CSAM technical pdf. I agree with you that the majority of people commenting now have not understood the matter, technically.

In one point I have to disagree: The worst outcome is not a huge number of positives (as this would be a clear sign of erroneously working algorithms).
The worst case scenario would imho be those two:

1. Non-democratic governements forcing Apple to include hash lists to support their (different) agenda.
Nobody knows if Winnie-the-poo picturs will be looked after in China (or the Tank Man, or Tian An Meng in general).
Nobody can tell by the hashes what is sought after. How would Apple possibly counter those demands? Leave the market in those countries? See what happened to RIM (Blackberry) when the Saudis and others wanted to have a backdoor to the email contents a few years ago…

2. Even in our western systems, I´d not generally rule out that some entities hack a target phone, place CSMA content on it thus triggering the alarm bell to get rid of unwanted opposition, journalists, you name it…
We have had cases already where doubtful material showed up on politicians gadgets just at a convenient time.

These two issues are totally non-technical and can not be removed by aligning the process, fine-tuning the algorithms etc. and are causing the concerns of EFF, CCC among many others.

LikeLiked by 1 person
- 2
  
  hoakley on August 9, 2021 at 8:49 am
  
  Thank you.
  The first of your worst cases falls into the category of Apple doing something other than what it has told us it’s going to do. As I’ve explained above, trying to add NeuralHashes for images which are outside the domain for which this system has been developed is high risk even if users are told about it. Apple would be risking a landslide of criticism for doing it, and probably trashing all the engineering which has gone into the proposed system. But that applies to almost every situation in which Apple does something different from what it has said, and isn’t inherent in how it checks images.
  The second of your worst cases is nothing new, and has already been used in many instances. All an attacker has to do is get incriminating files (they don’t even have to be CSAM) onto someone’s computer or mobile phone (etc.), then tip law enforcement off about it.
  There is a protection here which you don’t mention, though. For that to take advantage of this new system, those files have to be images which are written into that user’s Photos library, and they then have to try to upload those images to iCloud for them to be detected. It’s much easier just to tip off law enforcement than go to the lengths of hacking their privacy settings (which is why I drew attention to that issue early in the article) and relying on them sharing those images with iCloud, but not noticing those images in their Photos library. I don’t think, for example, that you could visibly conceal CSAM in an image which appeared to the user to be one of their innocent images, but its NeuralHash would trigger a match with known CSAM. That’s one reason why Apple doesn’t expose the underlying image metrics, only their NeuralHashes, to prevent attempts to abuse the system in any direction.
  Howard.
  
  LikeLike
- 3
  
  Mitch Conner on August 9, 2021 at 12:59 pm
  
  Your first proposed bad outcome scenario is nowhere near the worst possible outcome. For example, here’s a lot worse scenario: Apple’s new algorithm becomes self-aware, develops a dooms-day device and destroys all life on Earth. But here’s what’s important: these couple of “worst” outcome scenarios we have now concocted are not any more or less likely to actually occur whether or not Apple does this thing they have announced that they will do. Your argument is so called slippery slope: https://en.wikipedia.org/wiki/Slippery_slope
  
  LikeLiked by 1 person
4

Cicero on August 9, 2021 at 9:14 am

Great explanation but for me, the issues with this move have very little to do with technology and are all about privacy. Which, given Apple’s yapping for the last several years about empowering the user with more control over their data, makes this move all the more weird…

LikeLiked by 1 person
- 5
  
  hoakley on August 9, 2021 at 9:18 am
  
  Thank you. As said at the start, I’m not going to be drawn into that particular arena. What privacy means and whether it’s an absolute right are outside the scope of this article.
  Howard
  
  LikeLike
6

John on August 9, 2021 at 12:39 pm

Good explainer, thank you. Helps provide context to the End-to-End encryption debate. I’ll now have to spend some time considering exactly where the ends are…

I agree, the privacy argument in this context is a fruitless endeavor. I’ll say only this: No one is holding a gun to anyone’s head and forcing them to consume Apples products or services. Any violation of is happening on a volunteer basis.

LikeLiked by 1 person
7

Michael Tsai - Blog - Scanning iCloud Photos for Child Sexual Abuse on August 9, 2021 at 8:15 pm

[…] Howard Oakley: […]

LikeLike
8

FAQ about Apple's Expanded Protections for Children - TidBITS on August 9, 2021 at 8:45 pm

[…] exact match could be fooled by changes to an image’s format, size, or color. Howard Oakley has a more technical explanation of how this […]

LikeLike
9

Week 33 – 2021 – This Week In 4n6 on August 15, 2021 at 10:40 am

[…] Howard Oakley at ‘The Eclectic Light Company’How Apple intends checking images for CSAM […]

LikeLike