Explainer: Machine learning

Machine learning (ML), and artificial intelligence as a whole, have become a vogue and, for some, an alarming trend in computing. This is strange as one of the earliest major goals in computing has been to improve predictions, something for which we’re accustomed to using statistics and conventional modelling. The distinction between those and machine learning is blurred, yet no one seems worried about the impacts on society of improving weather models and their forecasts.

What’s most distinctive about machine learning is that, instead of humans laying down exactly how predictions should be made using equations and rules, ML provides the computer with the tools to develop and build its own predictive techniques.

Take weather forecasting as an example. Over the last couple of centuries, meteorologists and atmospheric physicists have learned a lot about how our atmosphere and weather work. They’ve progressively built more sophisticated mathematical models into which they load measurements taken around the world, and the model will forecast changes over the coming hours and days.

If that hadn’t worked as well as it has, an ML approach might have been to design a large neural network, load it with several years of observational data, and see what predictions it came up with. The neural network isn’t anything like the forecasting models developed by humans, but provides the tools for the computer to develop its own form of model.

Given the success of numerical weather forecasting, ML would there be up against stiff competition. In other fields such as human vision there has been a vast amount of research but little progress has been made on what appear to be relatively everyday problems, which human brains can solve in a fraction of a second. An example of this is object recognition in images, where vision scientists and computer programmers have worked for many years and still not produced a good algorithm to recognise a human face against a background. Yet, thanks to ML using neural networks, even a humble iPhone can now do that quite reliably, and Ventura goes on to extend the range of objects recognised, and can now identify many for you.

Over the last few decades, different designs of ML have developed. Perhaps the simplest to understand is generally known as supervised learning. In this, the ML system is provided with data from which it’s to learn, and the correct answers. You could use this technique to forecast stock price movements, by giving a neural network a large set of historical prices, and improving its performance in predicting how they change over time. One catch commonly encountered here is the tendency to validate the model against the same data used to train it. Not only is that ‘cheating’, but it can lead to overfitting, where the model becomes highly accurate at forecasting the data used to train it, but fails to predict well with previously unseen data.

Unsupervised learning may sound of more limited value, but for some problems it’s the only potential solution. Analysing data to discover clusters can prove formidably difficult using conventional statistical techniques, but may be amenable to ML using unsupervised learning. Most of us can recognise clusters in simple X-Y scatterplots, but when each datapoint has five variable measurements, we struggle to visualise them. ML can perform very well in such situations, and can suggest one or more ways in which data can be grouped into clusters, that we may or may not find meaningful. That’s another important lesson learned from ML: just because a neural network can ‘see’ something in the data doesn’t mean that effect is meaningful, so we always need to view its results in the context of reality.

There are several Big Problems that ML has been used to tackle, among the most notorious being machine translation of human language. A little experience using translation features in macOS and Google Translate shows that, while machine translation can work quite well on straightforward passages and with certain languages, it still has a long way to go before it can match the skills of a human interpreter. Knowing the limits of ML and not overhyping it is essential.

Apple makes an important distinction between two implementations of ML: on-device and off-device learning. ML performed on-device remains private, in that images and other personal data aren’t sent elsewhere for the learning to be performed. To facilitate that, Apple’s more recent chips include a neural engine, which can perform the calculations used for on-device ML at high speed, without burning up CPU cycles. However, for many problems, off-device learning is required, as the data or learning is too big even for your Mac; for example, considerable off-device learning has been performed to support Visual Look Up’s recognition of paintings, but the information transferred from your Mac is designed so that it doesn’t reveal anything personal, such as which paintings you’re most interested in.

Over the last couple of decades, neural networks have flourished as if they’re the only form of ML. That’s untrue, as there are many other techniques that have been used with varying degrees of success. In the late 1990s, a time when neural networks were in disgrace because their simple perceptron model was proving inadequate, I did some research into genetic programming, which proved successful in developing superior algorithms for some purposes. That’s one of a number of techniques which are based on biological and physical parallels: genetic programming uses the principles of evolution to select the best solution to a problem, but isn’t suitable for real-time applications. In those days, I used to run optimisations on multiple Macs for several days, and even now they’d take minutes or hours.

Since then, neural networks have received a lot of attention and a great deal of research effort. The calculations needed to use them in real-time have been progressively accelerated, and now have hardware support in Apple silicon Macs, in the ANE (Apple Neural Engine), GPU and CPU cores themselves. Our Macs are using ML more, to our advantage, whether you’re adjusting the colour in your photographs or just trying to discover the identity of a flower. There’s even more to come, and none of it is in the least bit spooky or alarming. Isn’t that why we use computers in the first place?

10Comments

Add yours

1

EcleX on July 16, 2022 at 7:05 am

Thanks for the interesting article. Apple should release a new Mac OS with emphasis on artificial intelligence. For instance, when you do repetitive tasks, the Mac should identify that and prompt to do it for you. It would be a new revolution.

LikeLiked by 2 people
- 2
  
  hoakley on July 16, 2022 at 9:45 am
  
  Thank you. You mean like Clippy? No thank you. Let’s keep the ‘intelligence’ in AI.
  Howard.
  
  LikeLiked by 1 person
  - 3
    
    EcleX on July 17, 2022 at 7:15 am
    
    Thanks. Not like Clippy at all. It would be turned off by default. The user would turn it on only when needed, and then turn it off again.
    
    For instance, imagine that you want to rename a list of 50 items in the Finder, so they start as 01, 02, 03… 48, 49, 50. Then, you turn on the assistant, start renaming the first ones and then the assistant ask you to rename the rest (showing a preview of how it would like, and giving the option to undo if needed). That way, instead of spending minutes on it, you could accomplish it automatically in seconds.
    
    Another examples would involved using word processors or any other application, in which you want to do repetitive tasks that are not built-in such application.
    
    I know that there are applications like Script Editor and Automator, but you must lean how o use them (most people will not learn them). And even easier to use ones, like Name Mangler to rename Finder items. But I meant a more general-purpuse assistant for any repetitive application task.
    
    LikeLiked by 1 person
    - 4
      
      hoakley on July 17, 2022 at 7:35 am
      
      Thank you. Clippy was a good demonstration of how you can get it wrong.
      What you’ve just described is the old AppleScript record feature. You could turn it on, do something, turn it off, and AppleScript would show you the code to perform that repeatedly. It worked brilliantly but relied on apps being completely re-written to be able to record AppleEvents.
      The API is still there in macOS, but it requires so much to implement in each application that it never caught on. A real missed opportunity.
      Oh – it’s not ML. No learning is involved at all.
      Howard.
      
      LikeLike
    - 5
      
      Johnson on July 17, 2022 at 8:29 pm
      
      unix ‘expect’ is even older
      [intended to post this in reply to Howard’s #4 but didn’t have the option]
      
      LikeLiked by 1 person
    - 6
      
      hoakley on July 17, 2022 at 9:11 pm
      
      Thank you. As a scripting language, yes, but I don’t ever recall it having a record feature, where you can run through a series of actions in the GUI, and it turns those into its own script automatically.
      Howard.
      
      LikeLike
7

BenSar on July 16, 2022 at 2:17 pm

I’m sorry to report that, in addition to all your other talents, you must now stand accused of …. wait for it …sanity!

“The distinction between those [statistics and conventional modelling] and machine learning is blurred, yet no one seems worried about the impacts on society of improving weather models and their forecasts.”

Will the alarmists ever forgive you?

n.b. we will.

LikeLiked by 1 person
- 8
  
  hoakley on July 16, 2022 at 7:23 pm
  
  Thank you.
  I still believe in the importance of ethical oversight of AI and ML. But if we’re going to get scared of black box techniques, getting your head around many mathematical and statistical modelling is no easier than understanding neural networks.
  Howard.
  
  LikeLike
9

David C. on July 18, 2022 at 3:17 pm

If you want a college-level introduction to AI/ML concepts, I found this really great book. Neural Networks from Scratch: https://nnfs.io/

It assumes you know the basics of Python programming, but otherwise starts from scratch, teaching the basics of how neural networks operate and how they are trained.

LikeLiked by 1 person
- 10
  
  hoakley on July 18, 2022 at 3:32 pm
  
  Thank you.
  Howard.
  
  LikeLike

Share this:

Related