hoakley January 18, 2023 Macs, Technology

Is Apple checking images we view in the Finder?

Ever since Apple was unwise enough to suggest that it might check certain images to see whether they were Child Sexual Abuse Material (CSAM), rumours have been rife that it has pressed ahead and now does that. I gather a new claim is being pushed out that this is performed in Ventura 13.1, so this article is an attempt to determine whether there’s any truth in that.

This claim boils down to Apple automatically being sent identifiers of images that a user has simply ‘browsed in the Finder’ without that user’s consent or awareness. I should make it clear that this hasn’t been demonstrated: as far as I’m aware the only evidence provided is that a Mac on which images were being ‘browsed in the Finder’ tried to make an outgoing connection from mediaanalysisd to an Apple server at that time, as revealed by the software firewall Little Snitch.

When Apple was intending to check for CSAM, it kindly explained how it aimed to do that, by generating identifiers, known as neural hashes, for images. A moment’s thought should indicate that uploading every image is neither sensible nor practical; instead, some form of concise identifier is essential. Unlike normal hashes, which are intended to amplify the smallest change in the source file, neural hashes are intended to distinguish images according to their content and characteristics.

Not only did Apple explain the principles of its intended detection system, but it gave us a free demonstration of those in macOS Monterey, with Visual Look Up (VLU). That enables your Mac, with a little help from Apple’s servers to match neural hashes, to identify paintings, breeds of dog and cat, and sundry other subjects in images.

I have taken a deep look inside the processes involved in VLU, to the point where one of my free apps, Mints, can easily obtain a full account of them from the Unified log. I therefore performed a series of tests in a macOS 13.1 virtual machine running in Viable on my Mac Studio M1 Max, to discover what might explain the observation reported, and whether that supported the claim being made.

Gallery browsing

vlu01

To get an idea of whether mediaanalysisd or any other component involved in image analysis or neural hash generation was active when looking through images in a Gallery window in the Finder, I loaded 18 assorted images in different formats into the ~/Documents folder of my VM, opened a gallery view of them, and looked through them for a period of one minute. I then captured all log entries for that period, a total of more than 40,000, and saved that excerpt to a file, using Ulbow. Not only was there no evidence of any image analysis taking place, but in that period there were no log entries from mediaanalysisd at all. Not one.

I repeated this over a period of 30 seconds, this time using Mints to display all log entries associated with VLU and Live Text. There were none at all in that period.

Visual Look Up

Although I studied VLU and Live Text in detail in Monterey, before going any further I wanted to confirm that they behave similarly in macOS 13.1, and write similar sequences of messages in the log. I therefore obtained a log extract using Mints for single image VLU using Preview. This confirmed that messages and processes appear very similar to those I had analysed before. These are summarised in the following diagram.

VisualLookUp1

Note that mediaanalysisd doesn’t contact Apple’s servers until late in the process, to perform matching of the neural hashes generated by the preceding image analysis. The response from those servers then enables VLU results to be displayed in a window over the image.

QuickLook Preview

Although the original description given was ‘Finder browsing’, for some that might include the display of images as QuickLook Previews, by selecting the image and pressing the Spacebar. In my previous examination of VLU and Live Text, this wasn’t a feature that I had investigated. I therefore obtained log excerpts for two images being opened in QuickLook Preview. One of those images contained some handwritten text, the other did not.

vlu02

For both images, VisionKit initiated image analysis when the image was being opened in its preview window. For the image which didn’t contain text, this completed in a total processing time of 615 ms, failed to recover any text from that image, and attempted no remote connections. The image containing text took longer, 881 ms, and returned text of length 65 ‘DD’ (as given in the log) after a considerably more elaborate series of processes, including one outgoing secure TCP or Quic connection by mediaanalysisd lasting 58 ms, before the completion of Visual Search Gating.

This is consistent with the briefer task used in Live Text, and quite different from VLU. There is thus no evidence of the generation of neural hashes or any search query by PegasusKit typical of the later stages of VLU.

Conclusions

There is no evidence that local images on a Mac have identifiers computed and uploaded to Apple’s servers when viewed in Finder windows.
Local images that are viewed in QuickLook Preview undergo normal analysis for Live Text, and text recognition where possible, but that doesn’t generate identifiers that could be uploaded to Apple’s servers.
Images viewed in apps supporting VLU have neural hashes computed, and those are uploaded to Apple’s servers to perform look up and return its results to the user, as previously detailed.
VLU can be disabled by disabling Siri Suggestions in System Settings > Siri & Spotlight, as previously explained.
Users who want to block all such external mediaanalysisd look-ups can do so using a software firewall to block outgoing connections to Apple’s servers by that process through port 443. That may well disable other macOS features.
Trying to harvest VLU neural hashes to detect CSAM would be doomed to failure for many reasons, most of which were raised with Apple at the time of its original proposals, and remain valid today.
Alleging that a user’s actions result in controversial effects requires demonstration of the full chain of causation. Basing claims on the inference that two events might be connected, without understanding the nature of either, is reckless if not malicious.

If you doubt the accuracy or veracity of anything I have written above, then all the tools that I used are free, available from the links I’ve provided, and I look forward to reading your results.

51Comments

Add yours

1

Brian on January 18, 2023 at 12:55 pm

Am I reading it correctly that there were 40,000 log entries after looking at your gallery for 1 minute? Wow.

LikeLiked by 1 person
- 2
  
  hoakley on January 18, 2023 at 8:59 pm
  
  Yup. Not uncommon too. And that’s a VM, which lacks a lot of the chatter from Wi-Fi etc.
  Howard.
  
  LikeLike
3

CV on January 18, 2023 at 6:06 pm

As usual, great analysis and write-up by Howard – thank you sir. My view on this whole thing is this – if Apple really wants to monitor both client-side images and what users view, then they will just do it and mask the process beyond the point of detection. For example, they wouldn’t send out inquiries or any metadata at the time of viewing but rather at some other time when the system is especially busy with network traffic so as to obscure such communications. I would also predict that Apple would not have the watchful processes log anything whatsoever. I mean really, why would they?

If one wants to know for sure whether or not Apple is doing something this nefarious, well you will need access to the source code and then the ability to tap into all encrypted network traffic from the client machine back to the mother ship. Absent that, your best bet is to not use macOS at all and instead go with a more open OS.

All my $0.02…

LikeLiked by 1 person
- 4
  
  hoakley on January 18, 2023 at 9:07 pm
  
  Thank you.
  The strange thing here is that Apple would find it incredibly expensive to be nefarious. To do so would require writing a great deal of custom support services, for instance to handle TLS connections and for the ML work. This is because those subsystems are so garrulous in the Unified log. While it’s not at all easy to work out where a TLS connection is going, or what’s being transferred by it, each is accompanied by a torrent of log entries, for instance by “boringssl”. And not even Apple can just turn those off, as they’re written into its code.
  So yes, Apple could, but to do so without log evidence would require heavy investment in software engineering to avoid leaving a discoverable trail.
  That’s why I use the log so much for this type of investigation.
  Howard.
  
  LikeLike
5

Ross Fisher on January 18, 2023 at 11:22 pm

While I appreciate your analysis, I think the video from Louis Rossman summarizes it well: [link removed by hoakley: see comments]

I sold my Apple products due to an unrelated issue earlier this week. Totally not interested in whatever network activity is happening here.

LikeLiked by 1 person
- 6
  
  hoakley on January 18, 2023 at 11:24 pm
  
  Thank you.
  So what facts am I missing?
  Howard
  
  LikeLike
- 7
  
  hoakley on January 19, 2023 at 6:23 am
  
  I have now had a chance to review that video, and as a consequence remove the link you gave, on the grounds that it is a breach of copyright, and consists of disinformation. The video consists of someone reading a blog article written by someone else from start to finish, adding misleading comments about John Deere tractors (really). That article is riddled with factual errors and histrionics, which the reader repeats completely uncritically.
  If that’s your level of misunderstanding, then you’re doomed.
  Howard.
  
  LikeLiked by 1 person
  - 8
    
    Nathan on January 25, 2023 at 5:49 pm
    
    Well, actually I was on your side, but to me you lost a bit of credibility when you typed “remove the link you gave, on the grounds that it is a breach of copyright” lol. You should maybe research the concept of Fair Use on Wikipedia.
    
    1) “the purpose and character of the use, including whether such use is of a commercial nature or is for nonprofit educational purposes” — “criticism, comment, news reporting, … scholarship, or research, is not an infringement of copyright”
    2) “the nature of the copyrighted work”
    3) “the amount and substantiality of the portion used in relation to the copyrighted work as a whole” — note that: “using most or all of a work does not [necessarily] bar a finding of fair use” (wP)
    4) “the effect of the use upon the potential market for or value of the copyrighted work” — Rosman cited the blog post in the description of the video. I clicked the link and read it. So, it seems to me, the effect of Rosman’s video was to drive traffic to the person’s website.
    
    LikeLiked by 1 person
    - 9
      
      hoakley on January 25, 2023 at 7:02 pm
      
      Thank you for being so patronising. I’m well aware of the concept. And I’m well aware that sitting and reading every word of an article (i.e. 100% of it) in front of a camera, and adding next to nothing apart from some irrelevant jibes about tractors, and then making money from that is at best dishonest and plagiaristic, and almost certainly a breach of copyright under US or German law. As someone who has to go to great lengths to respect copyright, I won’t disseminate what I consider to be breaches of it.
      In any case, this isn’t about losing or gaining credibility. It’s about whether you can assess facts against wild unfounded speculation. My challenge remains as stated at the end of this article: put up or shut up. I think that’s fair.
      Howard.
      
      LikeLike
- 10
  
  no big deal on January 25, 2023 at 4:21 pm
  
  honestly I don’t see the big deal just block and move on and from what I know if you search “mediaanalysisd” on google/duck it has existed since Sierra/iOS 9 and provided the same function and is used in conjunction with photoanalysisd (for on-device face rec on the “photos” app on iOS/macOS) (on both the mobile/desktop OS) I don’t why know this a problem now if it wasn’t back in 2016-2017? who knows apple might’ve scanned you images for the CSAM back then. but who knows? If I worried about stuff like this I would drive myself crazy I watch louis rossman, mental outlaw (and other similar youtubers) too and i don’t take many of the videos very seriously, I watch YouTube for what it is… entertainment, not for factual info, louis is a cool dude and has made several “apology” videos on topics he admitted to be wrong on and I hope he sees this article. but before you make assumptions and believe everything you hear from some random dudes on YouTube, I suggest you find out for yourself like the author did in this article do google searches on dameon before you decide to burn your Mac. oh and if you really concerned about “privacy” on apple platforms, if you have an apple ID that you used for awhile you should download your data from apple your eyes will be open, anyway….. thats a WHOLE differnet topic for a WHOLE different blog :)
  
  LikeLiked by 1 person
11

Milo on January 19, 2023 at 12:56 pm

Thank you for this analysis.

Visual Lookup is a nice feature. But I don’t want this analysis to be performed on every image I look at using QuickLook. I would use it on a case by case basis. But that is not something Apple has thought of, unfortunately. I have therefore disabled Siri Suggestions in Spotlight settings.

Can you confirm, that mediaanalysisd will stop talking to Apple servers when the ‘Siri Suggestions’ setting is turned off? You mention that OCR might trigger some uploads to Apple servers as well?

I can’t check myself right now, because I don’t have a firewall like Little Snitch installed. Though I should maybe, considering the recent revelations.

LikeLiked by 1 person
- 12
  
  hoakley on January 19, 2023 at 1:06 pm
  
  “that is not something Apple has thought of”
  Apple has. The local analysis is performed when triggered – and that depends on how you are viewing the image – but no look-up is performed until you initiate it.
  It’s easy to tell whether you have turned this off. Try using it: if it doesn’t work, then you have disabled the whole of Visual Look Up.
  Live Text doesn’t appear to upload anything to Apple. As the connection is performed before any image analysis has been performed, and before recognising any text, it couldn’t upload anything about the image, could it? I believe from this and other situations that the connection is made to download any linguistics updates, which it then uses for text recognition.
  “considering the recent revelations” What revelations? This is all paranoia, conspiracy theory, a great way for the ignorant to earn money on YouTube, and from the gullible. But total bullshit, every last word of it.
  Howard.
  
  LikeLike
  - 13
    
    Milo on January 19, 2023 at 1:30 pm
    
    I am somewhat paranoid acutally :P . About revelations. It’s Apple that announced local scanning of photos. And apparently all the technical pieces were alrady there baked into the OS before they decided to postpone it.
    
    I might be now confused more than before. Does that mean that mediaanalysisd will connect to Apples servers every time QuickLook is used? That is certainly suspicious.
    
    LikeLiked by 1 person
    - 14
      
      hoakley on January 19, 2023 at 1:43 pm
      
      “Apple that announced local scanning of photos” No it didn’t, not in the least. It put forward a proposal for consideration, and the reaction was an overwhelming no. It then said it would go off and reconsider what might be acceptable and worthwhile. There was a long silence, and more recently Apple has made a clear statement that it has decided not to pursue CSAM image checking at all, but has introduced some other changes to help parents and children with safe Messaging – fairly unrelated.
      “all the technical pieces were alrady there baked into the OS” No they weren’t and no they aren’t. Tell me what you think is.
      “they decided to postpone it” They haven’t. They’ve cancelled it altogether.
      “that mean that mediaanalysisd will connect to Apples servers every time QuickLook is used?” Try reading the article again. No of course it doesn’t. Besides, your Mac connects to Apple’s servers often and for all sorts of perfectly innocent reasons that are nothing to do with this. There is nothing suspicious about this at all, and if you don’t like your Mac connecting to Apple’s servers, then I suggest that you try using Windows instead, and seeing where that gets you.
      By the way, are you aware that most non-Apple cloud services that support sharing of images have been scanning their content for years? No one seems to care about that, but it’s well known and has resulted in several erroneous court cases that have been reported widely.
      Howard.
      
      LikeLike
    - 15
      
      Milo on January 19, 2023 at 2:20 pm
      
      Of course they annouced it. Here is the reporting from macrumors, which is certainly not know inflamatory reporting about Apple:
      
      https://www.macrumors.com/2021/08/05/apple-new-child-safety-features/
      
      “Apple today previewed new child safety features that **will be coming to its platforms with software updates later this year**. The company said the features will be available in the U.S. only at launch and will be expanded to other regions over time.”
      
      I did read your explanation, multiple times acutally. What got me thinking was the following paragraph from above:
      
      “The image containing text took longer, 881 ms, and returned text of length 65 ‘DD’ (as given in the log) after a considerably more elaborate series of processes, **including one outgoing secure TCP or Quic connection by mediaanalysisd lasting 58 ms**, before the completion of Visual Search Gating.”
      
      I’m just trying to understand what is actually happening here. Your analysis is very helpful for this, altough not complete, since we can’t peak easily into the data packets sent. I am grateful for the work you put into this.
      
      LikeLiked by 1 person
    - 16
      
      hoakley on January 19, 2023 at 3:30 pm
      
      Regarding announcements and proposals, please read my original article, the first link in the article here.
      When Apple announces a new product or feature, it’s going ahead, and it gives the expected sales patter, and it goes ahead with what it announces. In this case, Apple announced a proposal rather than a product. It was also at an odd time – in early August, well after WWDC and details of the next versions of its operating systems.
      When Apple announces products and features, it doesn’t produce long papers explaining how they work, quite the opposite. Yet in this case, Apple produced (if I recall correctly) two or more papers, one going into depth about the technical details of what it was proposing. About the only thing it seemed definite about was that these features would initially only apply in the USA, because of local laws in the rest of the world. So there wasn’t even clarity as to whether this would ever happen where you live. And in some legislations, it probably wouldn’t have been possible at all.
      There’s also no doubt that Apple didn’t proceed with those proposals, and has recently confirmed what many of us suspected, that it now has no intention to do anything of the kind.
      Regarding Live Text in QuickLook Previews, I think it’s obvious that no transfer of information about images is taking place. Refer to my diagram, and you’ll see Visual Search Gating, well above MAD Visual Search. All analysis is local here, the sort that goes on in the background with images in Photos. This does coarse classification (is it a painting? a human face?) and object recognition (is there recognisable text?). At that stage, there is nothing distinctive about an image that even exists on the Mac. Text recognition occurs after the connection is complete. So without neural hashes or text content, what is there about the image that could be sent to Apple?
      It’s perhaps worth pointing out here that VLU is nothing like Apple’s proposals for CSAM recognition. The neural hashes used to look up images in VLU are ‘low resolution’. They’re good for distinguishing specific images of paintings, breeds of dog and cat, and so on. But – as you can read in my detailed account of the proposed method – they fall far short of what would have been required to detect CSAM. Another big difference is that initial CSAM detection was intended to be performed locally, using a database of ‘positive’ neural hashes stored locally. Apple recognised that looking them up online wasn’t feasible, but that’s exactly what VLU does, because it works differently, with a much larger database and forming less robust and more inaccurate matches. So VLU ≠ CSAM detection, not by a long way.
      And Live Text is different again, and doesn’t require any look up at all, like other methods of OCR.
      As a matter of interest, have you checked whether any app you might use for OCR phones home? While we’re being paranoid, that seems an important consideration.
      Howard.
      
      LikeLike
    - 17
      
      Milo on January 19, 2023 at 4:08 pm
      
      You say, Apple just “put forward a proposal for consideration”. But how can this be possible, if they annouced originally to roll it out within four months in an end of year dot release? For a company of the size of Apple this would only be possible if the feature was already finished, with some parts likely already present in the released os.
      
      LikeLiked by 1 person
    - 18
      
      hoakley on January 19, 2023 at 4:54 pm
      
      You are making a supposition. Tell me which part of Ventura is doing all this CSAM checking then. Where are the neural hashes? Where’s the local database of ‘bad’ hashes? Where are the people who have been convicted of CSAM offences as a result? It’s all fiction, isn’t it?
      If you’ve already convinced yourself that Apple is checking your images, then stop using Apple products. It’s as simple as that: if you don’t trust Apple, then buy other computers and devices instead.
      I have provided you with facts, not suppositions, rumours or histrionics. If you prefer those, that’s fine by me. It seems to be all the rage now.
      Howard.
      
      LikeLiked by 1 person
    - 19
      
      hoakley on January 19, 2023 at 5:03 pm
      
      See Wired’s quote from Apple late last year:
      “We have further decided to not move forward with our previously proposed CSAM detection tool for iCloud Photos. Children can be protected without companies combing through personal data, and we will continue working with governments, child advocates, and other companies to help protect young people, preserve their right to privacy, and make the internet a safer place for children and for us all.”
      Howard.
      
      LikeLike
    - 20
      
      Milo on January 19, 2023 at 4:15 pm
      
      “As a matter of interest, have you checked whether any app you might use for OCR phones home? While we’re being paranoid, that seems an important consideration.”
      
      When I use a dedicated software to OCR some text, I’m well aware that some, hopefully anonymous, artifacts might be processed by some server online. When I use QuickLook to check the contents of a file locally many times per day, my expectations are quite different though. By default I expect such a basic and essential piece of sotware to respect my privacy. It’s one of the reasons I choose Apple hardware and software.
      
      LikeLiked by 1 person
    - 21
      
      hoakley on January 19, 2023 at 4:57 pm
      
      There isn’t a shred of evidence that QuickLook doesn’t respect your privacy, is there?
      Can you really imagine Apple sat there getting hundreds of millions of QuickLook-ed images each day? What about all the bandwidth in transferring them? And mobile data from iPhones and iPads? Are you serious?
      Howard.
      
      LikeLike
    - 22
      
      Milo on January 19, 2023 at 7:18 pm
      
      I have never said, that Apple is today violating user’s privacy by uploading content without consent and I’m actually relieved when investigations like yours show that nothing like this is happening. I choose to use Macs and iPhones exactly for this reason.
      
      I do think though, that Apple should be more transparent when some of the features like OCR, Visual Lookup, Speech Recongnition only work with support from online services. There should be easy to understand switches in the OS where you can explicitly disable such behaviour. Hiding the off switches behind meaningless terms like “Siri suggestions” is very confusing and somewhat misleading as well.
      
      You seem to be taking my critique very personally. I hope you understand, that I’m not atacking you personally. I am sorry if I misused this forum to vent some of my frustrations with Apple.
      
      LikeLiked by 1 person
    - 23
      
      hoakley on January 19, 2023 at 9:50 pm
      
      You haven’t misused this forum at all. It’s not actually a forum as such – once upon a time these were comments! – and if I didn’t want to respond, then I wouldn’t.
      But having had to watch a 12 minute video before 6 this morning, of a jerk reading out a blog article that I was already only too familiar with, got the day off to a bad start. Since then I have once again learned the lesson that trying to tackle important and technical subjects with objectivity is a dumb move. I’ve had a load of comments and emails that demonstrate that many people aren’t interested in facts or objective assessments, they’re just victims of clickbait and wild rumour.
      That’s not you, of course, but the steady influx that I’ve had through a day when I’ve been trying to get on with researching and writing other articles.
      Perhaps after eight years of giving myself up to be grilled by anyone passing, it’s time to hang up my hat and do something I enjoy again.
      Oh, and Apple surely doesn’t need to spell out that Visual Look Up looks up some form of identifier derived from images. Does it really? And Live Text doesn’t look up your image or text at all, as I’ve been desperately trying to explain. AFAIK it obtains the latest linguistic data from Apple, and doesn’t send Apple anything derived from the image. But by now I’m past caring.
      Howard.
      
      LikeLike
24

Robert on January 19, 2023 at 1:51 pm

Great write up. Thank you for the details information. I think this is one of those times were people are slightly overly paranoid, but for good reason. Although the only thing I do on my MacBook is work, the last thing I need is some glitch somewhere else to release confidential files, or medical records to the public. You can never be too careful. I wonder why companies even try these types of things when they almost always come back to bite them.

LikeLiked by 1 person
- 25
  
  hoakley on January 19, 2023 at 3:06 pm
  
  Thank you.
  Howard.
  
  LikeLike
26

Myob on January 21, 2023 at 9:27 pm

This was a smart move––objectively investigate a circumstance. Someone else *might’ve* done it, still might do using your investigation as a source, maybe even as inspiration to better use their ability to think about other matters as well. Those who don’t take advantage of this opportunity through example cost nothing.

Thank you for the article. Happy computing, painting, reading, writing and living to you and friends.

LikeLiked by 1 person
- 27
  
  hoakley on January 21, 2023 at 9:44 pm
  
  Thank you.
  Howard.
  
  LikeLike
28

T on January 22, 2023 at 12:36 am

>Since then I have once again learned the lesson that trying to tackle important and technical subjects with objectivity is a dumb move.

Perhaps, but I believe most of us are here because you “tackle important and technical subjects with objectivity.” Tremendously grateful for your efforts and your insights.

LikeLiked by 1 person
- 29
  
  hoakley on January 22, 2023 at 7:50 am
  
  Thank you.
  Howard.
  
  LikeLike
30

L on January 22, 2023 at 12:20 pm

Excellent investigation and writeup. Please don’t forget that for every commenter there are many silent readers grateful for your efforts.

Also, I think no YouTube clickbait before 6am should be a rule, like not feeding gremlins after midnight.

LikeLiked by 1 person
- 31
  
  hoakley on January 22, 2023 at 4:39 pm
  
  Thank you. Sadly, I had no choice: I like to check all links in comments, and that was the only time that I had to do that.
  Howard.
  
  LikeLike
32

Lolz on January 22, 2023 at 4:41 pm

So the conclusion is no, I mean, yes they are, we can see the network traffic sending data to Apple’s servers, but we’re going to answer no anyway.

Are you a retard?

LikeLike
- 33
  
  hoakley on January 22, 2023 at 4:48 pm
  
  No, you didn’t read the article, did you? Or even its summary?
  Please don’t use ad hominem insults here. If you have a counter-argument, show us your facts before being gratuitously insulting.
  Howard.
  
  LikeLike
34

Ivo on January 23, 2023 at 12:10 am

Great and detailed article, as usual, Howard.
One thing that stood out when I was testing is that the mediaanalysisd connection was not only activated when I opened pictures containing text. Even pictures that had absolutely no text whatsoever triggered the connection. Not really sure what this means, but I don’t think it is malicious. Still, I am probably missing something.

Your articles are invaluable and highly appreciated!

LikeLiked by 1 person
- 35
  
  hoakley on January 23, 2023 at 7:47 am
  
  Thank you.
  Do you have log extracts for those? As I have stated above, that wasn’t the case in my tests, as ascertained in the log.
  Howard.
  
  LikeLike
36

Lunging Lounger on January 23, 2023 at 4:47 am

> When Apple was intending to check for CSAM, it **kindly** explained how

Corporations pay psychopaths hundreds of thousands of dollars per year to speak “kindly” on all sorts of topics, including the rape of the Earth, the mass murder of animals, and especially right now a war in Ukraine manufactured and propped up by Western powers. To say Apple is explaining “kindly” reveals you either don’t understand how PR works, or you are working wittingly or unwittingly for Apple PR. Either way, it puts your analysis under a cloud of suspicion.

VLU, far from being a “kindly” look behind the veil is a highly orchestrated attempt to psychologically manage a radical transformation in the concept of privacy. The purpose of the VLU demo is to normalize and defang the technology, just so that its sharp teeth can be put back in behind the scenes, if not by Apple, then by intelligence services. But you don’t see that. You see kindness.

> When Apple was intending to check for CSAM, it kindly explained how it aimed to do that, by generating identifiers, known as neural hashes, for images. A moment’s thought should indicate that uploading every image is neither sensible nor practical; instead, some form of concise identifier is essential.

This is a straw man argument on your part. The claim made in the blog accusing Apple of scanning local files didn’t claim the images were being sent to Apple. You made it up so you can sound smarter than they are.

> I repeated this over a period of 30 seconds

Anyone who knows how behind the scenes filesystem chron jobs work knows that you can just look at 30 seconds. You have to watch over a longer period of time. For all you know, the suspect activity took place 30 seconds after you stopped looking.

> The image containing text took longer, 881 ms, and returned text of length 65 ‘DD’ (as given in the log) after a considerably more elaborate series of processes, including one outgoing secure TCP or Quic connection by mediaanalysisd lasting 58 ms, before the completion of Visual Search Gating.

In the linked article you say:

>> Live Text uses a different mechanism to recognise text in images.

This issue takes us away from the topic at hand, but it appears that you are in contradiction with yourself. In this article, you claim the Live Text process involved mediaanalysisid and a TCP connection. In the linked article you claim “This doesn’t rely on any information being sent from your Mac anywhere else”. Well, which is it?

It also reveals that you are perfectly happy with Apple making TCP connections via medianalysisid to use Live Text, which is something many many many many other people are radically uncomfortable with.

In other words, you appear to be someone entirely happy to live in the womb of the Apple ecosystem and it’s decisions about that is safe and what counts as security. Many many many many other people do not feel this way. Instead of viewing Live Text’s TCP activity as a completely harmless event, we view it as just one more brick in the wall building an infrastructure of total surveillance on Apple devices. In other words, your evidence works against you in this line of reasoning. In other words, you simply do not get what people are concerned about. You are therefore not someone we can trust to perform the analysis, or to report on it its significance.

> VLU can be disabled by disabling Siri Suggestions in System Settings > Siri & Spotlight, as previously explained.

So you admit that VLU is on by default? Being an expert you also know that end users rarely change the default setting, which is why nefarious features are on by default, the switch primarily useful for big tech monopolies to defend themselves with BS arguments in court (“But they could have turned it off…”). Normalizing VLU leads, ultimately, directly to the very thing we are talking about.

> Trying to harvest VLU neural hashes to detect CSAM would be doomed to failure for many reasons, most of which were raised with Apple at the time of its original proposals, and remain valid today.

This is just false. You should realize it’s false since this is precisely the technology Apple intended to deploy on its iPhone platform. Perhaps you didn’t understand how it was supposed to work. Let me explain it to you. Apple scanned content on the iPhone. If it reached a critical threshold for “child” and “sex”, various data including thumbnails would be sent to Apple for manual analysis. If the manual analysis proved the content was CP, they would send it to the NCMEC for further processing. This method uses neural hashes, whether or not it specifically uses VLU.

Furthermore, even the use of traditional hash scanning wouldn’t be doomed to failure, since it could catch many people hosting old and well known illicit content on their devices. This in fact already takes place, on a vast scale.

Finally, and this is really the point, the danger of this infrastructure is in its far reaching consequences. Allowing any of this infrastructure, any tiny part of it, including the fun and “kindly” VLU and the fun and “kindly” Live Text function means opening the door to technologies that auto-scan files on our system for ANYTHING the state doesn’t like. Live in Germany? No swastikas for you, even in private. But I guess being a normie you’re OK with Nazi dabblers facing prison time for downloading sh-t posts on 4chan. How about Hong Kong residents dabbling in resistance to the Chinese government by scanning for illicit pictures of Winnie the Pooh, or pictures of sheets of paper, or umbrellas?

If you can systematically and practically invisibly scan for and report “illicit” contents of any kind, it is an inevitability that this technology will be used for something else. What “something else” that may be will depend upon the government in question. Right now the BBC documentary into anti Muslim riots has been banned by the Indian government. With the tools that you feel so unconcerned with already deployed on everyone’s devices (at first experimentally, then forced onto devices by law), the Indian government could not only block the documentary, know who had it and either prosecute them or quietly persecute them.

You seem completely unaware of these dangers. I suggest you wake up and realize that you simply lack the awareness or levels of concern necessary to perform an adequate analysis of the problem.

Stop hand waving away our very legitimate concerns.

In conclusion, your analysis has failed to prove that what was reported as taking place is not taking place. That doesn’t mean it is taking place. But it does mean that you have failed.

LikeLiked by 1 person
- 37
  
  hoakley on January 23, 2023 at 7:50 am
  
  Thank you.
  So, not one new fact, just more paranoia and histrionics.
  To me, that’s just total bullshit.
  Howard.
  
  LikeLike
38

Mac & Cheese on January 25, 2023 at 5:23 pm

I really hope Apple scratched that plan entirely, it’s really scary. But Rossmann reported the opposite:
[link removed by hoakley]

LikeLiked by 1 person
- 39
  
  hoakley on January 25, 2023 at 6:55 pm
  
  Well, late last year Apple made a statement to the press in which it made clear that it had no intention of pursuing it at all. I’ve cited the link in another reply. But some people carefully ignore reality, and tell lies.
  Howard.
  
  LikeLike
  - 40
    
    Ross Fisher on January 25, 2023 at 7:11 pm
    
    At least here, Louis Rossman is the prime leader of the Right to Repair movement in the US. I don’t take any information I read or hear about online at face value, of course.
    
    I have sold all of my Apple Products and have moved to Linux and GrapheneOS. I’m not comfortable with my local files and network activity around such being sent back to Apple, regardless of reason or the specifics when not signed into an iCloud account. It’s a step too far that I’m just not interested in.
    
    Google has already done this and the poor lad had to fight a year with Law Enforcement: https://www.nytimes.com/2022/08/21/technology/google-surveillance-toddler-photo.html
    
    It sounds like you may be in Germany, but here in the US, even just being falsely accused of any sort of child abuse will basically end your life. You are likely to be killed in prison, even if just held overnight, let alone unemployable. Suicide rates are around 50% for those falsely accused. You can understand when any breath around CSAM and device scanning is mentioned, us Americans want no part of it.
    
    LikeLike
    - 41
      
      hoakley on January 25, 2023 at 7:22 pm
      
      “I have sold all of my Apple Products and have moved to Linux and GrapheneOS”
      I’m sorry, I don’t cover those at all. Do you come here for the art, then?
      “my local files and network activity around such being sent back to Apple”
      What evidence do you have of what local files and network activity are being sent back to Apple, then? As I wrote, put up or shut up.
      “It sounds like you may be in Germany”
      Wrong again.
      So you do believe all this scurrilous and unsubstantiated gossip about Apple and CSAM? I’m so sorry that you have been hoodwinked by charlatans who can’t even be bothered to check their claims against fact. Neither he nor the original article mentioned Apple’s clear statement made late last year that it had no intention to proceed with CSAM detection. But let’s just lie, and pretend that never happened.
      Howard.
      
      LikeLike
    - 42
      
      Ross Fisher on January 25, 2023 at 7:35 pm
      
      I think you are hoodwinking yourself.
      
      The direct quote from Apple is:
      “Based on feedback from customers, advocacy groups, researchers
      and others, we have decided to take additional time over the coming months to collect input and make improvements before
      releasing these critically important child safety features.”
      
      At no point did Apple say that they were not proceeding with the CSAM detection.
      
      “make improvements before releasing these critically important child safety features”
      
      “before releasing these critically important”
      
      Please share where Apple stated that they were not proceeding with the CSAM features.
      
      LikeLiked by 1 person
    - 43
      
      hoakley on January 25, 2023 at 7:38 pm
      
      Ah. You see, being patronising kicks back. If you had taken the trouble, then you would have come across this article in Wired, where Apple updated its position.
      More facts, aren’t they inconvenient.
      Howard.
      
      LikeLike
    - 44
      
      hoakley on January 25, 2023 at 7:41 pm
      
      In case you’re having problems with that link, here’s an unequivocal answer:
      “We have further decided to not move forward with our previously proposed CSAM detection tool for iCloud Photos. Children can be protected without companies combing through personal data, and we will continue working with governments, child advocates, and other companies to help protect young people, preserve their right to privacy, and make the internet a safer place for children and for us all.”
      OK, now you won’t believe it, or claim it doesn’t really say what it so manifestly does.
      Howard.
      
      LikeLike
45

Michael Tsai - Blog - Network Connections From mediaanalysisd on January 25, 2023 at 9:31 pm

[…] Howard Oakley: […]

LikeLike
46

John B on January 25, 2023 at 11:16 pm

Thankyou for putting in A LOT of time & effort into something that ultimately was a waste of time because of a YouTuber jumping the gun and assuming something without waiting for someone like yourself to look deeper into the issue.

Your article is so well thought out and logical that its hard to think that anyone would not be completely swayed by it.

I am so sorry that you feel like hanging up your hat due to the completely understandable position of being sick of being “grilled by anyone passing” but I would urge you to reconsider.

You are a wonderful talent and my profession is made that much brighter by having you in it. Consider me the silent majority.

“I love those who can smile in trouble, who can gather strength from distress, and grow brave by reflection. ‘Tis the business of little minds to shrink, but they whose heart is firm, and whose conscience approves their conduct, will pursue their principles unto death.”

LikeLiked by 1 person
- 47
  
  hoakley on January 26, 2023 at 6:26 am
  
  Thank you.
  The hat is still on, for the moment at least!
  Howard.
  
  LikeLike
48

Ocaml on January 26, 2023 at 4:04 pm

Did you seriously came to that conclusion after only:

1. Looking at pictures for a minute
2. Analysing logs that were generated by the thing you’re analysing (lmao)
3. Encrypted network activity

HELLO?????

How can you be sure that the daemon isn’t collecting data locally just to send it later? Or maybe it will only send them if you’re viewing pictures on a sunny monday at 1 PM?

There are just so many variables left out. This proves absolutely nothing.

Perhaps start by disassembling the binary next time?

Regards,

LikeLiked by 1 person
- 49
  
  hoakley on January 26, 2023 at 5:57 pm
  
  Thank you.
  I’m sorry that you were unable to read the second paragraph above, because you are making completely different allegations.
  
  The claim was that, *while* a user was browsing images in the Finder, an outgoing connection was made by mediaanalysisd to an Apple server, and that was interpreted as sending Apple image identifiers to be used in checking the browsed image against CSAM data.
  
  So no one (at least, no one in their right mind) has here suggested that image identifiers are being secreted away and uploaded to Apple on a later occasion. If you have any evidence to support that claim, I suggest that you either give it, or retract your allegation as being without any basis in reality.
  
  So what I have done here does address the claim for which evidence was provided.
  
  I’m also sorry that you know so little about the Unified log in macOS. It may come as a surprise, but processes that carry out network communications like BoringSSL make copious log entries which can’t be turned off unless the kernel itself imposes a complete blackout on writing to the log. Of course Apple could have a complete set of ‘dark’ versions of all network communications processes that don’t write to the log, but if you believe that, I think you may need professional help.
  
  This also applies to encrypted network activity (what do you think TLS is?). That isn’t silent in the log, and its entries are copious and detailed, although obviously some of the message content is encrypted or censored.
  
  You might find it helpful to read some of my articles introducing the Unified log before voicing opinions on its usefulness or content for investigating such matters. A few years of experience might also help.
  
  While I’m very grateful to you for suggesting that I should start reversing code in macOS, which of the many dozens of processes involved here would you suggest that I start with? How would I then know whether any particular code was actually used at the time of image analysis, without stepping through in a debugger, a task that I think would take several man-years of effort to even scratch the surface? Given the reliance on neural networks here, how would you tackle that in disassembly? Have you ever disassembled ANE code, for example? What tool did you use?
  
  I also note that, despite my invitation to present facts, your comment is completely fact-free. I invite you to put your money where your mouth is and investigate these claims before rejecting solid evidence. It’s simple: put up, or shut up.
  
  Howard.
  
  LikeLike
50

johndo on May 1, 2023 at 11:44 pm

Can someone please explain why every iso or dmg file gets mounted during an osx update.It really peaves me off. An update shouldn’t require mounting of DMG files or iso’s.

LikeLiked by 1 person
- 51
  
  hoakley on May 2, 2023 at 7:20 am
  
  I’m sorry, I don’t understand what you’re experiencing. Could you please explain in more detail?
  Howard.
  
  LikeLike