hoakley June 13, 2024 Macs, Technology

What’s going on with AI in Sequoia?

If you only read the headlines, you’d now presume that ChatGPT was the major new feature in macOS Sequoia and its sister operating systems due for release in about three months. This article explains how that’s not correct, and what is really going on with AI and ML on the Mac. Because it’s clearest to understand, I’ll focus here on changes coming in how macOS works with text, features that Apple calls Writing Tools. To understand where we are now, I’ll start forty years ago.

Spell-check

Back in the early 1980s when personal computers including the Mac were starting off, one big breakthrough was being able to check the spelling of the words you typed into word processors and later in laid-out pages when the Mac brought the Desktop Publishing revolution.

spellcheck1

Although this started simply, checking spelling turned out to be more complex, as it had to cope with language variations, and even then wasn’t as clever as a human. How could it tell whether that word should be there, their or they’re, for example? As we became increasingly unimpressed, so checking spelling extended into grammatical context, and came to examine our grammar too.

spellcheck2

Word completion

On devices where typing is performed by tapping on tiny keyboards or on-screen, predictive text became universal. Using a system of simple rules, it enables you to enter the most likely words with the fewest keystrokes. Since its introduction this has steadily become more sophisticated, so that it learns which words a user is most likely to type, and in which contexts. This has now extended to word completion for Mac text entry, with considerable ability to learn what to suggest next. That machine learning is performed on-device, and doesn’t require any assistance from external resources.

spellcheck3

Optical character recognition

Over the same period, computers and devices have become able to recognise text in images. At first this was the preserve of dedicated Optical Character Recognition (OCR) software used to convert scanned pages into text, and sometimes was performed off-device. It was built into macOS Monterey as Live Text, alongside Visual Look Up.

Live Text doesn’t transfer any data off device, although it’s dependent on linguistic support data that may have to be downloaded. Much of the work performed in Visual Look Up also remains on-device, although image recognition may call on servers containing data for specific types of image, such as paintings.

One popular technique is for the local calculation of a form of hash that is distinctive to a part or the whole of an image, and for the remote system to match that against hashes for known objects, and propose the closest match as its identification of the image. As the hash function used is one-way (and normally derived from a neural network), there’s no way to reconstruct the original image from that hash. That contrasts with online systems that require the image to be uploaded for remote analysis and identification, so putting your privacy at risk.

Writing Tools

These are extensions of existing text analysis that look beyond a word and its immediate context, to analyse paragraphs or whole documents. To do this requires more substantial models, extending up to Large Language Models (LLMs) that have become so famous in popular AI like ChatGPT.

Apple has developed an LLM that is powerful enough to provide Writing Tools, but is small enough to run on a device, or an Apple silicon Mac. This has been fine-tuned into a task-specific version for performing text-based functions such as summarising and proofreading text. It can turn your text into a summary of key points, or generate lists or tables from it. These work both with editable text in most editors and word processors, and even with non-editable text from other sources.

Writing Tools don’t themselves generate new content in text, but use the original text to produce derivatives. I’m particularly looking forward to using its proofreading feature, which can suggest improvements that I can choose to ignore, or adapt to my own style, as I wish.

Private Cloud Compute

Some of the more demanding tasks in Apple Intelligence can’t be run on-device at present, although as techniques and hardware improve that may become possible in a year or two. For those challenges that need more computing power, Apple has devised what it terms Private Cloud Compute, using servers with Apple silicon chips designed to preserve privacy at all times, and overseen by independent experts to verify privacy measures. Apple has just published an article explaining how that will work.

Until there’s more experience of which tasks can be run on-device, it’s not clear which will benefit from Apple’s servers. Given the capable hardware in an Apple silicon Mac, it currently looks unlikely that any Writing Tools will be required to be run remotely.

Generative AI

Writing Tools don’t create new content, they use your original to generate derivatives, much in the way that a sub-editor might proofread and create a summary for an author. Generative AI uses LLMs and other methods to create new content, perhaps bringing together text from a range of other sources to write you an essay about something you want to know about. While some of us (me included) have no interest in using ChatGPT or its competitors, Apple recognises that many who use Macs and devices do want easy access to those services, and has promised to integrate access to them into Sequoia and its sisters. However, their use is entirely optional, and at present there are no plans to somehow incorporate third-party LLMs into everyday macOS features.

Summary

Starting this autumn, rather than having to write my own summary of an article like this, I will be able to use the Writing Tools in Apple Intelligence to do it for me. This will take place within the confines of my Mac, where it remains private. I retain complete control, and can reject the summary provided, or modify it to my own taste. For some more demanding tasks, macOS may decide to enlist the help of Apple’s Private Cloud Compute, which will also respect my privacy and not retain any of my text once the job is complete. And no, I’m not going to use ChatGPT, thank you, but if that’s what you like, it will be there and free to use.

22Comments

Add yours

1

EcleX on June 13, 2024 at 7:22 am
Reply

Thanks for the interesting article and summary. BTW, do you think that given the advances of AI it could be possible to automatically score-count the nodes of mind maps (available as PDF, or images like JPG, PNT, etc). I mean, a standalone tool to do it, not requiring the user to develop such tool. That would be great to evaluate mind maps at University, for instance. Now it can be done manually, which is tedious, time-consuming and prone to error. Imagine manually scoring 100 or more mind maps with 500 or more nodes each.

LikeLiked by 2 people
- 2
  
  hoakley on June 13, 2024 at 11:53 am
  Reply
  
  Thank you.
  I thought that we had already discussed your score-count problem. I’m sure that it would be amenable to automation, but that isn’t something likely to come in the general tools that are coming in Sequoia. It would therefore need to be written for the purpose, using ML and AI features available in macOS.
  Howard.
  
  LikeLike
3

fujimidai1 on June 13, 2024 at 7:29 am
Reply

Howard, like you I am not really interested in generative AI and LLM but there is one area of Mac usage where Apple could put AI to work and that is Dictation. Not only is keyboard input so 19th century but it is also a literal pain for people with RSI in the wrist and arms (like me) and other motor disabilities. I’d have guessed that since Apple professes to be accessibility focussed this app would be the be all and end all of accuracy but it still basically sux. I use both an M1 MBA and an M2 MM with Dictation to make a living from home but Dictation still throws me stupid errors in almost every sentence. Here’s an (annotated) example where it’s lack of intelligence is manifest.

Dictation Example
Surely Mr. musk is the worlds brightest most experienced and intelligent engineer as he claimed recently and can’t possibly be wrong!
(Everything in this example including this annotation was dictated using dictation in Sequoia without manual correction by me, and you can see that it still doesn’t understand capitalization of a name after a period or use of the apostrophe mark. I had to say apostrophe mark twice to stop it from inserting an’ symbol, like it has just done without a character space. I’m not sure that it’s using very much Apple intelligence at the moment. Interestingly, it did misspell Manuel (there it has done the same same thing again) but later connected it to MANUAL, which I just dictated letter by letter but has now come out in uppercase.

LikeLiked by 2 people
- 4
  
  Enzo Vincenzo on June 13, 2024 at 8:07 am
  Reply
  
  Maybe “musk” is intended 😉 and your Mac (rightly…) disagrees with this idea of you… 🙂
  You can try these other examples of dictation:
  “Surely Mr. Steve Jobs was the most brilliant, experienced and intelligent founder in the world!” or “Surely Mr. Einstein was the most brilliant, experienced and intelligent physicist in the world!” and you will see that Sequoia is not wrong 🙂
  
  P.S. Just kidding (or not?…) 😉
  
  LikeLiked by 1 person
- 5
  
  hoakley on June 13, 2024 at 11:55 am
  Reply
  
  Thank you.
  One immediate suggestion that comes to mind for some of those errors would be to use the proofreading feature in Writing Tools. However, I suspect that ML is already at work in Dictation – it’s just that current results aren’t as good as you’d like. I’m sure that they will improve.
  Howard.
  
  LikeLike
6

fujimidai1 on June 13, 2024 at 7:30 am
Reply

And it misused it’s for its too in the opening.

LikeLiked by 2 people
7

Florian on June 13, 2024 at 8:52 am
Reply

Thank you very much to put into a bigger perspective AI as it has been used for a long time.

This development has been ongoing for decades, beginning with companies in Africa having hundreds of employees identifying objects in millions of photos, so a model can be had then we can search cat or whatever and find it on our iPhone

LikeLiked by 1 person
- 8
  
  hoakley on June 13, 2024 at 11:56 am
  Reply
  
  Thank you.
  Howard.
  
  LikeLike
9

iain henderson on June 13, 2024 at 10:29 am
Reply

Howard, I think you missed some AI in there. Unless I miss my grammar detection/correction was AI powered, that seems like something you’d use a https://en.wikipedia.org/wiki/Markov_chain for. Later stage spell check might be AI too.

Remember, during the https://en.wikipedia.org/wiki/AI_winter anything utilizing AI would have that removed from the description (unlike like the current times wehre AI is slapped on everything).

LikeLiked by 1 person
- 10
  
  hoakley on June 13, 2024 at 11:58 am
  Reply
  
  Thank you.
  One of the points that I’m trying to make is that this is a continuum. Where you place ML and AI along that is open to discussion and dispute. For most of us, I think contextual spell- and grammar- checking is well within ML (as are Markov chains), whereas summarising and proofreading are squarely AI. But the key word in AI is ‘intelligence’, which clearly goes well beyond ML.
  Howard.
  
  LikeLike
11

Duncan on June 13, 2024 at 2:53 pm
Reply

Thank you for that logical build-up from early spell-checkers to today’s capabilities. One aspect not mentioned, however, are the storage requirements for on-device libraries that provide the contexts for process analysis.

As an example, GPS applications used to rely entirely on external map sources due to device’s then-limited storage capacity. Now it is common to download and cache much of that data for use away from network connectivity. I imagine within many of our lifetimes we could see the entire contents of Wikipedia, for example, downloaded to our phones, which brings a vast source of information to bear on ‘AI’ processing. (The data will need to be continually updated, of course, but any given snapshot will still be useful.)

So it appears to me that while Apple’s work on AI-specific hardware and software is useful, their progress (indeed the entire industry’s that wishes to process on-device) will be constrained by storage limits.

I hope that we see an order-of-magnitude breakthrough on storage capacity soon to enable better on-device results, otherwise we will be reliant on remote servers and their concomitant energy footprint to support this industry.

LikeLiked by 1 person
- 12
  
  hoakley on June 13, 2024 at 3:48 pm
  Reply
  
  Thank you.
  Yes, I think this is well-recognised, and something specifically addressed by Apple in the LLM being built into Sequoia. The expectation is that going off-device should be infrequent in the features of Writing Tools. Those who want to play with generative AI like ChatGPT will of course have to accept that most will then take place remotely.
  Howard.
  
  LikeLike
13

prehensileblog on June 13, 2024 at 7:16 pm
Reply

The current proliferation of the term “A.I.” (Artificial Intelligence, Apple or otherwise) seems most problematic to me because of its second definition (according to Onuora Amobi: the ability of computer-controlled robots of digital computers to perform tasks more commonly performed by intelligent beings), which is to a great degree contradictory to the first (the study and design of intelligent agents). Its denotation is very similar to that of the word “nonplussed”. “Nonplussed” has only two meanings, the second being almost precisely the opposite of the first. In other words, the term A.I. can mean almost anything one wants it to.

LikeLiked by 1 person
- 14
  
  hoakley on June 13, 2024 at 9:13 pm
  Reply
  
  I’m sorry, I don’t rate Onuora Amobi as an expert in AI or ML, and find his definitions severely lacking.
  As I wrote, the key word in AI is intelligence. While it has long been a disputed subject in neurosciences, there are some fairly clear distinctions that can be drawn. A lot of optimisation methods aren’t in the least bit intelligent, and most of those that adjust weights or similar factors do so according to relatively simple rules that have nothing whatsoever to do with any quality that might be construed as intelligence. If something doesn’t even pass that first test, then the best you can describe it as is Machine Learning, which is still perfectly respectable and actually has a far better track record. Instead, so many overstate because they think AI is sexy. All they are doing is hastening the next collapse of AI, and their own loss of funding.
  You might question their intelligence, perhaps.
  Howard.
  
  LikeLiked by 2 people
  - 15
    
    prehensileblog on June 13, 2024 at 11:40 pm
    Reply
    
    I guess I should have compared the buzz word appeal of the term “AI” to that of “atomic” that occurred around the middle of the last century. When it was first introduced, that term was a lot more controversial than “AI”, of course, but I think you would agree that there isn’t much chance of seeing “atomic” in the name of a product or service nowadays, even if that product is from a small company. My point is that the overuse of the term “AI” is far likelier to eventually result in a negative connotation for anything so named, whether deserved or not.
    
    Speaking to the question of actual intelligence (whether artificial or conventional), I typed the following questions and answers from my ChatGPT-enabled ATA application (installed by Homebrew) a few minutes ago.
    
    Question: What’s a conservative estimate of the average amount of fossil fuel that is burned per day in the world?
    
    Answer: A conservative estimate of the amount of fossil fuel burned per day in the world is around 90 million barrels of oil equivalent (BOE) per day.
    
    Maybe I entered my follow up question incorrectly, but when I asked “At a continued rate of use of 90 million barrels of oil equivalent (BOE) per day, how soon will all potential supplies of fossil fuel in the world be depleted?”, I received an error which included the message, “You exceeded your current quota, please check your plan and billing details.” Personally, I consider the answers to such questions more important than whether their solutions are credited to artificial or conventional intelligence.
    
    LikeLiked by 1 person
16

fazalmajid on June 13, 2024 at 9:29 pm
Reply

I am not a cryptographer, but there seems to be a critical flaw in Private Cloud Compute, in its use of OHTTP (oblivious HTTP), as also used in Apple Private Relay. In OHTTP, there is an intermediate gateway, operated by CloudFlare. You connect to Apple’s servers and send them a payload encrypted with CloudFlare’s private key. Apple sees your IP but not the payload. CloudFlare sees the payload but not the IP. The critical part is, the protocol is not end-to-end encrypted (e2ee) like HTTPS, and the input payload sent to the AI cloud probably contains identifying information that CloudFlare can now see.

LikeLiked by 1 person
- 17
  
  fazalmajid on June 13, 2024 at 9:58 pm
  Reply
  
  Hmmm. The text is a bit vague on this, but it looks like the payload sent through OHTTP is encrypted with the PCC node’s key, so CloudFlare will.not be able to see it.
  
  LikeLiked by 1 person
18

John Gilbert on June 13, 2024 at 11:22 pm
Reply

For me, ELIZA was the start of AI. It was the first chat bot. For those that can’t remember it https://en.wikipedia.org/wiki/ELIZA

LikeLiked by 1 person
19

ConfuSomu on June 19, 2024 at 7:09 pm
Reply

Thank you for this blog post which provides a new perspective on LLMs for writing as an evolution of the existing writing help that was introduced over time on computers. Nevertheless, I feel that LLMs and other generative AI tools change fundamentally the way that we create things and remove the element that makes art art. It removes the human touch from the final work, which makes me quite sad. I noticed that it shares a few similarities, at a higher level, with modern smartphone photography: there are now an impressive amount of models processing your photo thought your original photographic subject has not been completely transformed, unlike what you would get with generative AI tools. Though, Writing Tools creates derivatives of your existing text and can improve your text similarity to “filters” when editing your photos, which makes it closer to the models used in smartphone photography. Yet, Writing Tools would change the fundamental structure and flow of the text, even if it contains the same ideas, so it might not be as easily compared to modern smartphone photography. But could even a parallel be drawn between the two?

Furthermore, humanity, and nature, cannot afford the energy and water consumption required by (continuously, as people expect them to become better) training these large models. We must reduce our energy and resource consumption as it is currently unsustainable. We already passed this year’s Earth Overshoot Day (which I know is not a perfect calculation).

On the other side, AI or ML tools do bring positive developments to accessibility, such as Sound Identification or better speech-to-text on Apple’s platforms and Android, or Firefox new feature that allows generating alt text for images that do not have any. Thus, everything is not entirely gloomy.

Sorry for my unconstructive tirade the other day, even if I just reiterated a few elements. I shall stop now 🙂

LikeLiked by 1 person
- 20
  
  hoakley on June 19, 2024 at 8:04 pm
  Reply
  
  No problems.
  While I do have concerns over the use of generative AI, I think Writing Tools is fundamentally different.
  Each article that I write for this blog is researched and written without the use of any form of AI. I then refine and revise it until I think it’s right, and read it aloud to my editor-in-chief (wife) for final amendment and approval. What I’d like to do is get Writing Tools to read it through before that final step, as a proofreader. Furthermore, I don’t want to accept its improvements automatically, but use them to rework text so that it reads how I want it to be. Although this writing doesn’t aspire to be art in any way, I don’t see that proofreading as degrading wordcraft, any more than spell-checking does.
  Interestingly my commercial publisher has warned that any contributor using AI to write their articles will lose all future commissions, and I’m delighted that they’re standing firm on that.
  Howard.
  
  LikeLike
  - 21
    
    ConfuSomu on June 19, 2024 at 8:23 pm
    Reply
    
    I understand and agree with your stance on proofreading. More proofreading is always a good thing, and computers can serve as a good tool in that regard. This reminds me of the Antidote tool which does grammar correction, before this “AI” hype, and is quite useful (it has a good handy dictionary and guides). They now also have an English version.
    
    Once Sequoia releases, I will try out Writing Tools to see at least what it provides, even if I don’t intend to use it.
    
    LikeLiked by 1 person
    - 22
      
      hoakley on June 20, 2024 at 5:37 am
      
      Thank you. Yes, I’m looking forward to trying them out to see what they can do for me.
      Howard.
      
      LikeLike