hoakley January 4, 2021 Macs, Technology, Updates

Disk read and write tests go random in Stibium 1.0b6

At the end of last year I promised to concentrate my attention on analysing disk performance data collected by my free benchmarking app Stibium. I’m delighted to say that my new version of that app has moved on a great deal, and now produces results which you can use without lengthy analysis in other apps such as Numbers. I have also improved randomisation of write tests, and added true random write tests.

stibium10b60

Testing

Early versions of Stibium ran write tests in a fixed order. This can have great effect on the results, and when repeated can add order effects to every test cycle. This new version now randomises the order of each group of write tests within a test cycle, and should thus be considered as a random write test regime. So if you set the repeats box to 10, each of those repeated tests will be conducted in a different order.

Select the new Random Sizes checkbox and Stibium will write files of random sizes between 2 MB and 2 GB, the range which previous testing has shown is most reliable and relevant to real-world disk use. The length box sets the number of random file sizes in each write cycle, and is limited to the range 5-100. If you want to write more random sizes beyond 100, use the repeats setting in the line above as a multiplier: for example, to write 210 test files of random sizes, set length to 70 and repeats to 3.

Read tests are effectively performed in random order in any case, as that’s determined by the file system, which varies each time the files in a folder are read. Checking the Random Sizes checkbox has no effect on read tests.

In that way, Stibium now offers random-order fixed-size write tests, completely random write tests, and random read tests.

One earlier problem with read tests has been that small hidden files in a folder could get included in test results. Although Stibium still reads files of all sizes, those smaller than 10 KB aren’t passed for analysis, and their results are discarded.

Analysis

Stibium now performs two additional types of analysis on all the results it collects during each read or write test.

The first is to gather all results for each file size, and calculate the median of each group. This has little value when the Random Sizes checkbox is ticked, as it’s unlikely that any two test files will have the same size. Neither does it help when repeats is set to 1 for write testing, or when reading just one cycle of write tests. This is primarily intended when running tests with multiple write repeats.

When you set repeats to a number greater than 1 and then run a write test, Stibium writes one file of each of the fixed sizes (2 MB to 2 GB) in random order, then repeats the same group of write sizes as many times as you have specified repeats. Similarly, when Stibium reads multiple files of the same size, or repeats its tests more than once, it gets more than one measurement for each file size.

In this median analysis, Stibium now gives the median transfer rate for each file size it has tested. For a number of statistical reasons, medians are preferable to averages in this situation.

Stibium continues to provide overall average transfer rates, the overall median and their range. However, these need careful interpretation, as the distribution of file sizes between 2 MB and 2 GB isn’t even when using fixed file sizes, and will only become even for random writes when a large number is used in the length box. This is one of the reasons why you’ll see differences between the overall figures given for transfer rate. It can also be a confounding factor in other benchmarking methods, something to bear in mind when looking at their results.

StibAshurTimes

Because of that, Stibium now performs linear regression of test times against file size to calculate the ‘regressed’ transfer rate, as I performed manually using data from previous tests such as that above. At present, while I continue my research into more robust methods, this uses standard non-robust linear regression. It’s therefore susceptible to influence by outliers and other distorting effects. I will replace the current method with something more robust as soon as I’m happy that I’ve identified an appropriate method.

Results

Although Stibium still doesn’t have a built-in feature for charting results, the report which it writes now contains a lot more information. This includes:

all individual test results in CSV format, when the Verbose option is enabled;
average, median and range of transfer rates for the whole test;
a table of median transfer rates grouped by file size, in CSV format;
the overall transfer rate calculated using linear regression;
the overall average transfer rate for the whole group of tests;
details of the folder path used for tests, which also gives the volume name;
details of the Mac used and its physical memory;
macOS version, and that of Stibium;
start and end date and time of the tests.

More will come in the future, including the FileVault/encryption status of the volume tested, I hope.

As before, you can select and copy any of those results from that scrolling view. You can also now export the whole contents in a text file, using the Save Report… command in the File menu.

To make results more comprehensible, transfer rates are now given in standard units such as MB/s and GB/s.

Finally, when each test is completed, the regressed transfer rate, as best estimator of overall performance, is now shown in a box. That is given separately for write and read tests, so if you run each test, the two boxes will display the most recent of those results.

Next steps

There’s now a lot to Stibium and its tests, and my priority in addition to fixing bugs is to document it thoroughly, as well as improving on the current method of linear regression. I expect to release that next version, proably as a first full release. I will then add charting for the second release. As Stibium also now takes part in my auto-update system, you should be able to download future releases more easily.

If there are any features which you want me to include in the next version, please put your case now.

Stibium version 1.0b6 is available from here: stibium10b6

Enjoy!

20Comments

Add yours

1

Rocky on January 4, 2021 at 8:30 am

A few thoughts based on my rapidly fading training in statistics (i.e. enough to get me into trouble most of the time):

– Any way to add a confidence interval to Stibium headline values? Ranges are nice, but can be misleading if outliers or clusters (randomly) happen. Maybe show CIs only if tests are repeated “enough” times?

– What happens to headline value repeatability when you add randomness, and how important is that? For example, if you use Length 70 and Repeats 3 on Monday, and I use 35 and 6 on Tuesday, and someone else uses 42 and 5 on Wednesday, how close will our numbers be? Perhaps another reason to include at least rough CIs.

– Not only is testing hardware + OS becoming less deterministic over time (e.g. skipping a range of file sizes), testing must add randomness, too? The world gets weirder and less intuitive.

– Linear regression seems like that old joke: the worst way to compute things, except for all the others. Good luck with your search!

LikeLiked by 1 person
- 2
  
  hoakley on January 4, 2021 at 9:30 am
  
  Thank you.
  Further analysis is on pause while I get my robust methods honed. For medians, the measure of dispersion I’ve always used is the quartile range, but here you tend to get just one or two outliers, which may not be apparent without looking at the range.
  I haven’t yet looked at the performance of different estimators in different test regimes: that’s something the app needs to be ready for, and I’ve only been testing this version for a couple of hours, which have mainly be devoted to looking for bugs.
  Howard.
  
  LikeLike
3

EcleX on January 4, 2021 at 9:09 am

Great! It would be useful also to have input/output (I/O) operations per second (IOPS).

LikeLiked by 1 person
- 4
  
  hoakley on January 4, 2021 at 9:33 am
  
  Thank you.
  I’ve looked at that, and you can’t get IOPS from real-world tests like these. It’s perhaps worth reviewing the accounts on Wikipedia and in this excellent overview to realise why, and to appreciate that, if they’re ever relevant to the real world, it’s probably only on hard disks.
  Howard.
  
  LikeLike
- 5
  
  hoakley on January 4, 2021 at 9:35 am
  
  I’d also be very interested to see some results from different SSDs in Sierra, please. As far as I’m aware, no one has run any tests with this app on Sierra.
  Howard.
  
  LikeLike
  - 6
    
    EcleX on January 4, 2021 at 11:36 am
    
    In can do such tests in macOS 10.12.6 (16G2136) Sierra. Which configuration should I use?
    
    LikeLiked by 1 person
    - 7
      
      hoakley on January 4, 2021 at 11:46 am
      
      Thank you.
      At this stage, exploring the different options can be very interesting. My ‘gold standard’ sequence which I think returns the most reliable results is:
      Set the top popup to 0x41 with No Cache √ and repeats at 10, Random Sizes unticked. Then quit the app, restart, let the Mac settle for a minute or two while you create a folder to contain the test files. Click on the middle row Write… button and leave the trackpad/mouse alone until the results are written out. Save them to a report file, and set repeats to 1, then quit the app.
      Restart, allow the Mac to settle again, open Stibium, check it’s set to a repeats of 1, then run the Read… test (middle row) on the folder of test files you just wrote. Save the results to a report file.
      You should then have a good set of write and read results for that disk.
      You can then compare those with other test settings. In many cases, you can get away with just running one repeat of the write tests then doing the read without restarting – that depends on how much caching is taking place.
      Enjoy exploring!
      Howard.
      
      LikeLike
  - 8
    
    EcleX on January 4, 2021 at 12:24 pm
    
    Thanks for the information. I have tried, but either I did something terribly wrong, or there is a bug. What I got is indicated below. Besides, when selecting all Stibium text to copy and paste, the Mac only pasted the portions indicated between the “–” lines. So, I had to copy-paste five times in total:
    
    ——————————————————————————————-
    type,size,time,rate
    w,1000,0.000183427,5451760
    
    Test path /Users/EcleX/Desktop/Write 0x41 No Cache 10 repeats.tiff
    Mac model = iMac18,3
    —
    Machine name = iMac
    Version = 1.0
    —
    Hardware UUID = XXX
    CPU = Intel(R) Core(TM) i7-7700K CPU @ 4.20GHz, 4 cores
    Logic board ID = Mac-XXX
    —
    Physical memory = 40 GB
    Running macOS version Version 10.12.6 (Build 16G2136)
    Stibium version 1.0b6
    Started at 2021-01-04 12:09:10 +0000
    Ended at 2021-01-04 12:09:10 +0000
    ——————————————————————————————-
    
    As can be seen, the test was instantaneous; starting and ending at the same time (!). Besides, the generated “Write 0x41 No Cache 10 repeats.tiff” file shows with 1 kb in Finder and with 1.000 bytes (4 KB on disk) using “File – Get Info”. When trying to open with a graphics application, it reports that the file is broken and cannot be opened. How to fix that? Thanks again.
    
    LikeLiked by 1 person
    - 9
      
      hoakley on January 4, 2021 at 12:27 pm
      
      Thank you.
      You’re using the wrong Write… and Read… buttons: those are for single files. Use the ones in the row below for Series tests.
      No, you won’t be able to open the file. It’s a test file containing all identical bytes, not a real TIFF.
      Howard.
      
      LikeLike
  - 10
    
    EcleX on January 4, 2021 at 12:50 pm
    
    Sorry about that; my fault. Wow! this time worked as you said:
    
    ————————————————————————————–
    Write = 160
    Average = 1,8 GB/s
    median = 1,94 GB/second (1,06 GB/s – 2,14 GB/s)
    
    size_write,rate_write
    2000000,1,55 GB/s
    4000000,1,49 GB/s
    6000000,1,39 GB/s
    8000000,1,43 GB/s
    10000000,1,47 GB/s
    20000000,1,7 GB/s
    40000000,1,85 GB/s
    60000000,1,97 GB/s
    80000000,1,95 GB/s
    100000000,1,97 GB/s
    200000000,1,98 GB/s
    400000000,2,01 GB/s
    600000000,2,02 GB/s
    800000000,2,03 GB/s
    1000000000,2,01 GB/s
    2000000000,2,04 GB/s
    
    Write n = 160
    Regressed rate = 2,03 GB/s
    
    Overall = 160
    Average = 1,8 GB/s
    
    Test path /Users/EcleX/Desktop
    Mac model = iMac18,3
    —
    Machine name = iMac
    Version = 1.0
    —
    Hardware UUID = XXX
    CPU = Intel(R) Core(TM) i7-7700K CPU @ 4.20GHz, 4 cores
    Logic board ID = Mac-XXX
    —
    Physical memory = 40 GB
    Running macOS version Version 10.12.6 (Build 16G2136)
    Stibium version 1.0b6
    Started at 2021-01-04 12:34:31 +0000
    Ended at 2021-01-04 12:35:20 +0000
    ————————————————————————————–
    
    Albeit, as shown above, I had to copy-paste four times when doing it from the Stibium window. It also generated “Stibium_2021-01-04T12_34_31Z.text”, which contained the same information above. And also generated 160 “.tiff” files, starting and ending as shown below:
    
    1-0.tiff
    1-1.tiff
    1-2.tiff
    …/…
    10-13.tiff
    10-14.tiff
    10-15.tiff
    
    Can I delete such TIFF files, or could they be useful later on?
    
    Please, let me know If such procedure is OK, to continue with the benchmarking (read, other disks, etc). Thanks for all.
    
    LikeLiked by 1 person
    - 11
      
      hoakley on January 4, 2021 at 1:07 pm
      
      Excellent, thank you.
      If you’re going to copy and paste, the easy thing to do is click on the scrolling text view to make it active, Cmd-A to select all, Cmd-C to copy, then you can paste the entire contents in one go. It’s easier still now to save as a text file instead.
      Once you’ve written that folder of test files, you can use the Read… button in the middle row to read them all and get the read result. This time, ensure that repeats is set to 1, or it will read those 160 files many times. Once you’ve finished with those test files and don’t want to read them again, just trash the whole folder.
      To test other disks, simply create your folder to contain test files anywhere on that disk, and select that when you choose where to use for the Write… test, and subsequent Read… tests.
      Howard.
      
      LikeLike
  - 12
    
    EcleX on January 4, 2021 at 8:10 pm
    
    Thanks. These are the read results from the same internal booting disk:
    
    Read (noncache) = 160
    Average = 1,98 GB/s
    median = 2,25 GB/second (254,4 MB/s – 2,46 GB/s)
    
    size_read,rate_read
    2000000,1,17 GB/s
    4000000,1,6 GB/s
    6000000,1,44 GB/s
    8000000,1,91 GB/s
    10000000,2,02 GB/s
    20000000,1,99 GB/s
    40000000,2,34 GB/s
    60000000,2,27 GB/s
    80000000,2,3 GB/s
    100000000,2,31 GB/s
    200000000,2,31 GB/s
    400000000,2,28 GB/s
    600000000,2,29 GB/s
    800000000,2,31 GB/s
    1000000000,2,28 GB/s
    2000000000,2,31 GB/s
    
    Read (noncache) n = 160
    Regressed rate = 2,3 GB/s
    
    Overall = 160
    Average = 1,98 GB/s
    
    Next I will boot from an external clone and report.
    
    LikeLiked by 1 person
    - 13
      
      hoakley on January 4, 2021 at 8:12 pm
      
      Thank you – they’re very interesting and useful.
      Howard
      
      LikeLike
  - 14
    
    EcleX on January 4, 2021 at 11:00 pm
    
    Thank you for making Stibium. Here is a summary of the tests that I have done:
    
    1. iMac 27-inch Retina 5K BOOTING from its internal 2TB disk:
    
    1.1. On such internal disk: 2.3 GB/s read & 2.03 GB/s write.
    Repeating such test some hours later (as the ones below): 2.32 GB/s read & 2.193 GB/s write.
    
    1.2. On Samsung Portable SSD T5 2TB SuperDuper clone: 513 MB/s read & 509.9 MB/s write.
    
    2. iMac 27-inch Retina 5K BOOTING from Samsung Portable SSD T5 2TB SuperDuper clone:
    
    2.1. On such T5 disk: 519 MB/s read & 508.6 MB/s write.
    
    2.2. On iMac 27-inch Retina 5K internal 2TB disk: 2.52 GB/s read & 2.07 GB/s write.
    
    So, the results are quite consistent, independently of the booting internal or external disk. If you want the more complete report, just let me know and I will post it.
    
    Notes:
    
    – Stibium created 160 files for benchmarking (2 MB to 2 GB) = 53’3 GB.
    
    – When booting from the external clone, the Mac took 25 minutes until it showed no disk or CPU activity, as checked with MenuMeters. When booting from the internal disk, it slowed down in seconds.
    
    – After booting from the external clone and then from the internal disk, Time Machine started a new backup from scratch on other external disk. I tried to prevent it to no avail using the inherit Terminal command sudo tmutil inheritbackup. So, I guess that I have lost such backups and must start over from scratch.
    
    – It would be good if Stibium had a progress bar or animation to show that it is working. Also the possibility yo cancel a particular test. I say that because on one instance, when selecting the Stibium files, I did not click a folder in the path, but double clicked it, and that triggered Stibium to do the test with the contents of such folder, so I forced quit Stibium and started over.
    
    LikeLiked by 1 person
    - 15
      
      hoakley on January 4, 2021 at 11:25 pm
      
      Than you so much – those look excellent, and repeatable.
      I’m publishing a full tutorial here on Wednesday morning to help others run their tests.
      Regarding progress bars/animations I very wary of those, as they can affect the performance on some systems. I intend doing further testing once the first version is released.
      Howard.
      
      LikeLike
  - 16
    
    EcleX on January 5, 2021 at 8:17 am
    
    Thanks. Perhaps then, instead of an animation, just a legend saying “Please, wait. Stibium is benchmarking” or similar could be useful, together with a cancel button. Just suggestions for your consideration.
    
    LikeLiked by 1 person
    - 17
      
      hoakley on January 5, 2021 at 12:34 pm
      
      Thank you.
      For the moment you won’t get a cancel button, because Stibium won’t read it until it has completed the tests. I’m going to try putting the tests into a background task later, as I think it will only increase the variance in results and give inaccurate numbers.
      Howard.
      
      LikeLike
18

EcleX on January 4, 2021 at 12:59 pm

BTW, that was for the internal booting disk? How to select other disk?

LikeLiked by 1 person
- 19
  
  jeremyc99 on January 4, 2021 at 7:05 pm
  
  Use any of the read or write buttons, select Mac name and choose a drive then folder.
  
  LikeLiked by 1 person
  - 20
    
    EcleX on January 4, 2021 at 11:00 pm
    
    Thanks!
    
    LikeLiked by 1 person

·Comments are closed.

Share this:

Related