hoakley March 23, 2024 Macs, Technology

Is Apple lagging in AI?

As we’ve become increasingly besotted by the achievements and failings of Large Language Models (LLMs) like GPT-4 in ChatGPT, it has become fashionable to speculate that Apple has been left behind these recent developments in AI and Machine Learning (ML), and wonder when and how it might catch up. This article considers what Apple might have up its sleeve for macOS in the coming year or so.

Apple has declared that its greatest interest is in on-device ML and AI, citing its overriding concerns with privacy. While this is heartening, there are other good reasons that Apple doesn’t want to commit to large-scale off-device AI, which could only compete for bandwidth and servers with iCloud and other profitable services.

Apple’s products are already well-equipped to perform sophisticated ML tasks, unlike the hardware of most of its competitors. For instance, all iPhone models introduced since the iPhone 8 in September 2017, all iPads introduced since March 2020, all Apple silicon Macs, and the Apple Vision Pro have a neural engine (ANE) in their chip. Poorest-equipped are Intel Macs, although Apple’s engineers have surprised us with the amount of ML they have been able to run with only limited hardware support.

The last few years have seen the use of ML in a wide range of tasks across macOS, iPadOS and iOS. Some of the more familiar examples include:

Visual Look Up, local image analysis and recognition, with off-device reference data;
Live Text, on-device text recognition in images;
machine translation, on- or off-device;
natural language processing, on-device.

One simple illustration of these is in natural language processing, as shown in my app Nalaprop.

nlp1

This recognises and analyses multiple languages in the same text document, here with French (upper paragraph) and Spanish (lower). Words are parsed into different parts of speech, and can be taken down to lemmas, word roots such as the verb be for is, are, was, etc. Currently this supports half a dozen languages in full, and recognises most others. Nalaprop doesn’t have to work any magic with neural networks or ML to deliver this, as it’s all built into an accessible API.

Although off-device translation is widely available, Apple’s operating systems also offer on-device translation, to ensure that no user content leaves that Mac or device.

nlp2

Other examples of ML in use on Apple devices include motion activity models, gait analysis, hand gestures for the Apple Vision Pro, and further improvements in word and sentence completion.

Commentators appear to have been surprised at a recent paper published online in arXiv, although this is but one of many that provide signs of Apple’s interests, and the productivity of its AI/ML teams. Recent major papers include:

arXiv:2310.07704v1 October 2023, describing the Ferret multimodal large language model (MLLM) that understands spatial references in images and their verbal descriptions, for spatial understanding;
arXiv:2309.17102v2 February 2024, describing instruction-based image editing enhanced by an MLLM;
arXiv:2306.07952v3 March 2024, introducing MOFI, a vision foundation model to learn image representations from noisy annotated images, with a constructed dataset of 1.1 billion images with 2.1 million entities, based on a web corpus of 8.4 billion image-text pairs;
arXiv:2403.09611v1 March 2024, detailing the design and implementation of MM1, a family of MLLMs for multi-image reasoning, demonstrated using captioning and visual questioning tasks. These can, for example, count the number of beers on a table, read the prices from an image of the menu, and calculate their total cost.

The emphasis in this recently published work is on the integration of images with the text of LLMs to form multimodal large language models (MLLMs) that can be used to reason about the content of images. Although this research uses very large corpora and datasets in training, it appears aimed at delivery for on-device use.

While LLMs like GPT-4 in ChatGPT have been grabbing the limelight, researchers working in and with Apple appear to have been pursuing goals that are of greater relevance to Apple’s computers and devices, including its Vision Pro. Hopefully WWDC this coming June should bring more detailed announcements about the fruit of all this research.

24Comments

Add yours

1

John Woods on March 23, 2024 at 9:00 am
Reply

this article suggests they might outsource. Sure hope they don’t.

https://www.macrumors.com/2024/03/22/apple-generative-ai-google-openai-baidu/

LikeLiked by 1 person
- 2
  
  hoakley on March 23, 2024 at 9:21 am
  Reply
  
  Thank you.
  Sadly, that’s not exactly informative. Outsourcing what, exactly? There’s a lot of outsourcing possible – data, models, services – each with very different implications. Still, it does at least live up to its name!
  Howard.
  
  LikeLiked by 1 person
  - 3
    
    John Woods on March 23, 2024 at 9:22 am
    Reply
    
    haha indeed!
    
    LikeLiked by 1 person
4

Tyler Loch on March 23, 2024 at 2:19 pm
Reply

It is frustrating how these practical AI solutions that Apple has shipped — at scale — are not recognized more. I wonder if it has to do with Apple using their preferred term “Machine Learning”, rather than “AI”, as has become accepted in the industry.

Global Accessibility Awareness Day (May 16th) may be a good pivot point for Apple to switch to calling these features “AI” ahead of further AI announcements at WWDC.

LikeLiked by 1 person
- 5
  
  hoakley on March 23, 2024 at 8:11 pm
  Reply
  
  Thank you.
  AI has been accepted in the press, but talk with those active in research and I think you’ll find rather different views as to what ML and AI are. Apple has been careful to remain accurate, and not to hype, because I think it expects to remain in the business for longer than many of today’s puffed-up companies will.
  Howard.
  
  LikeLike
- 6
  
  iain on March 25, 2024 at 10:09 am
  Reply
  
  The “accepted” industry term is problematic, it refers to a field of computer science rather than any one specific thing. It is as though we were talking about how much Apple gets done with SE (Software Engineering)!
  
  LikeLiked by 1 person
  - 7
    
    hoakley on March 25, 2024 at 12:34 pm
    Reply
    
    Thank you.
    Howard.
    
    LikeLike
8

Magnus on March 23, 2024 at 3:46 pm
Reply

I’d say there has been ML/AI in the Photos app for a long time, try entering “food” in Photos search field and see what comes up.

LikeLiked by 1 person
- 9
  
  hoakley on March 23, 2024 at 8:12 pm
  Reply
  
  Thank you – yes, more good examples.
  Howard.
  
  LikeLike
10

hstriepe on March 23, 2024 at 7:25 pm
Reply

Apple’s current applications are toy cases and only in very focused application areas.

I have been active with computers, first as a hobby and then as a profession, for close to 50 years. Before that I was in the fledgling semiconductor industry. I have never seen a field move so fast and so quickly. I have never seen a computing application area as (scaringly) profound as LLMs. LLMs are now used to improve, train, and iterate LLMs. The internal architectures and strategies are becoming more complex than the originally purely predictive models.

On the one hand, it is early. But because it is moving so fast, Apple is also somewhat late to get into the game. Microsoft has made more aggressive moves with hosting in Azure, tying into OpenIA, and hiring the best to form their own teams. They are heavily investing internally. Although car navigation technology is related, it is slightly more specialized, focusing on location and environmental (surrounding space) recognition. But, I imagine that being able to focus the car AI team on more central issues was an important consideration in canceling Project Titan.

AI will fundamentally change the way we relate to computing services and devices. Instead of relating activities with direct manipulation (Apple UX,) we will express questions and goals through voice and gestures. The agent needs to be able to know and understand our context. Siri has been fatally hampered due to privacy concerns. I do not believe these can be fully resolved by on-device models, which will never be as large as cloud-based ones. More importantly, local agents need to pull data from cloud-based instances. We will quickly move to autonomous agents acting on our behalf. The future is in personal, tiered, collaborative AI agents tailored to our needs and preferences.

This capability is too compelling to resist. It will also be the end of privacy. With large, predictive AIs it is possible to assemble the public crumbs we leave and know about us. GDPR, etc. is good, but apart from detailed health data likely a lost cause.

It will be like the old days living in a small village, where everyone know about everybody.

Apple needs to look at this and lead or become irrelevant. They had the right idea with the “Knowlege Navigator” in Sculley’s days!

LikeLiked by 1 person
- 11
  
  hoakley on March 23, 2024 at 8:25 pm
  Reply
  
  I’m very sad to read that you’ve been taken in by the hype. I’m sure you’ll recall the boom-and-bust history of AI to date. When I did a little research in what we then called ML, in the mid 1990s, the wave of early neural networks was just starting to break, when the perceptron was recognised as being just a fraud. At that time, AI fever was confined to larger businesses and speculators, who regularly sent well-paid consultants to AI/ML conferences so that they could discover how to predict the future behaviour of the stock market. I dread to think how much they blew in that snake oil.
  But rather than chew through all the arguments, let me make 3 simple comments:
  – LLMs aren’t about knowledge, wisdom, or even fact, but just about persuasive language, and heavily dependent on their data input. Now that a significant amount of text published on the Internet is AI-generated, they’re poisoning their own future with the junk they’ve pushed out in the past.
  – autonomous AI has been demonstrated time and again as being inherently flawed and dangerous.
  – many people are so concerned at Apple intruding on their privacy already, with on-device ML, that anything that compromises their privacy is a non-starter. And if something is that offensive to users, it isn’t going to sell unless the vendor lies through their teeth and never gets found out.
  Like you, I’ve lived through several periods of ‘AI has solved it all’. I remember seeing firms demonstrating computer systems based on the Lisp Machine that would replace doctors for diagnosis, until their fallibility was revealed. And, impressive though current LLMs may be, they’re also horribly fallible. So fallible that we have to pretend that they’re hallucinations.
  Howard.
  
  LikeLiked by 1 person
  - 12
    
    hstriepe on March 23, 2024 at 8:37 pm
    Reply
    
    We’ve been working with LLMs, and they actually have very practical uses.
    
    There is a lot of hype, but this time there is a core of truth.
    
    LikeLiked by 1 person
    - 13
      
      hstriepe on March 23, 2024 at 8:41 pm
      
      And please don’t be sad. Life’s too short.
      
      😉
      
      LikeLiked by 1 person
    - 14
      
      hoakley on March 24, 2024 at 7:46 pm
      
      Sure. Unfortunately, there’s a lot they don’t and can’t address, and knowing where and how they can be useful is crucial.
      Howard.
      
      LikeLike
  - 15
    
    hstriepe on March 23, 2024 at 8:43 pm
    Reply
    
    Actually, the most serious danger of the current AI is that it will produce overwhelming amounts of false information in words, images, and video that “feels true.”
    
    LikeLiked by 1 person
    - 16
      
      hoakley on March 24, 2024 at 7:49 pm
      
      But that’s exactly what it’s designed to do when it uses unchecked data. For example, with source code, if the data its provided with contains bugs, error, even malicious code, then that will poison what it produces. GIGO on a grand scale.
      For a world that appears to become more gullible and unquestioning by the day, that couldn’t be a problem, could it? 🙂
      Howard.
      
      LikeLike
17

Yyzguy on March 24, 2024 at 1:16 am
Reply

Do you moderate and approve comments? I tried leaving a comment about an hour or more ago and I don’t see it

LikeLiked by 1 person
- 18
  
  hoakley on March 24, 2024 at 7:51 pm
  Reply
  
  Yes, I’m afraid that I have to moderate some, as selected by Akismet. Otherwise all the comments you’d see here would be links to porn sites, crypto sites, advertisements more generally, and so on.
  My apologies that your multiple comments were caught in that, but it ensures the quality of comments.
  Howard.
  
  LikeLike
19

Yyzguy on March 24, 2024 at 2:45 am
Reply

I think the press and the public often conflate “Generative AI” with other forms of AI/ML

LikeLiked by 1 person
- 20
  
  hoakley on March 24, 2024 at 7:54 pm
  Reply
  
  Yes, although I don’t see it as being as distinct as many would like.
  A great deal of ML and AI is intended to generate new data. My particular interest in this is predicting chaotic data series: if the ML/AI you use doesn’t generate new data, then it doesn’t perform any predictions.
  Some see ‘generative’ approaches as justifying weak development methods, which is one of the failings of many LLMs.
  Howard.
  
  LikeLike
21

Raoul on March 26, 2024 at 3:01 am
Reply

Here’s a prediction. Apple in their typical style will wrap up AI/ML into a buzz-word named app where it can plumb into any installed app on device to provide the information you’re after. Few companies are positioned to achieve this given Apple build the hardware and the software. I suspect Apple will also likely contract out to multiple vendors for wider-reaching AI/ML requests that’s not able to be sourced on device. (eg. the Google story mentioned by John).

Classic Apple, where they assess an emerging technology, apply their paradigm to it and market it through their devices and do it better.

One aspect about all of this I find interesting is how will Apple manage accountability when things don’t go to plan such as models hallucinating. I think this is one of the reasons why Apple have been patient compared to other companies who scrambled to put the term AI in every presentation they possibly could for the last couple years. Apple also has more to lose than most other companies considering their stance on privacy.

This year’s WWDC could be interesting.

LikeLiked by 1 person
- 22
  
  hoakley on March 26, 2024 at 11:49 am
  Reply
  
  Thank you.
  Howard.
  
  LikeLike
23

Desk Investor on April 2, 2024 at 11:31 am
Reply

I think Apple knows the fallacies and implications with LLMs so it’s playing the wait and watch game while continuing to build its stuff.

It recently also open sourced its research on LLMs.

Apple knows on device AI will be the post powerful form of it, and it wants to do it right – when the time is right.

LikeLiked by 1 person
- 24
  
  hoakley on April 2, 2024 at 2:52 pm
  Reply
  
  Thank you.
  Howard.
  
  LikeLike