System Performance: Miscellaneous Workloads

Standardized benchmarks such as UL's PCMark 10 and BAPCo's SYSmark take a holistic view of the system and process a wide range of workloads to arrive at a single score. Some systems are required to excel at specific tasks - so it is often helpful to see how a computer performs in specific scenarios such as rendering, transcoding, JavaScript execution (web browsing), etc. This section presents focused benchmark numbers for specific application scenarios.

3D Rendering - CINEBENCH R23

We use CINEBENCH R23 for 3D rendering evaluation. R23 provides two benchmark modes - single threaded and multi-threaded. Evaluation of different PC configurations in both supported modes provided us the following results.

3D Rendering - CINEBENCH R23 - Single Thread

3D Rendering - CINEBENCH R23 - Multiple Threads

In the single-threaded case, Tiger Lake's Core i7 with a 38W PL1 takes the lead, as expected. The Core i5 NUC configurations come right behind, hobbled a bit by their 28W PL1 setting. In the multi-threaded case, the sheer number of cores help the octa-core Renoir mini-PCs emerge on top, followed by the 35W Comet Lake-equipped OnLogic HX500 well behind. Again, the important takeaway here is that the relative difference between the active and passively-cooled builds is minuscule.

Transcoding: Handbrake 1.5.1

Handbrake is one of the most user-friendly open source transcoding front-ends in the market. It allows users to opt for either software-based higher quality processing or hardware-based fast processing in their transcoding jobs. Our new test suite uses the 'Tears of Steel' 4K AVC video as input and transcodes it with a quality setting of 19 to create a 720p AVC stream and a 1080p HEVC stream.

Transcoding - x264

Transcoding - x265_10bit

Software transcoding rates heavily depend on the number of cores, followed by the performance potential of each core. The high-end Renoir-based systems with eight cores have a significant lead that the quad-core TGL-U systems are unable to match up to.

Transcoding - QuickSync H.264

Transcoding - QuickSync H.265 10bit

While the Renoir systems support VCE, we are not including results from those transcodes because the end results are not the same - and it wouldn't be an apples-to-apples comparison. The rate here is a function of the speed of the GPU clock feeding into the QuickSync block. All TGL-U systems end up with approximately the same numbers.

Archiving: 7-Zip 21.7

The 7-Zip benchmark is carried over from our previous test suite with an update to the latest version of the open source compression / decompression software.

7-Zip Compression Rate

7-Zip Decompression Rate

7-Zip is again a multi-threaded workload where Renoir's octa-core SKUs grab the top spots, and Tiger Lake's single-threaded performance advantage is not quite relevant.

Web Browsing: JetStream, Speedometer, and Principled Technologies WebXPRT4

Web browser-based workloads have emerged as a major component of the typical home and business PC usage scenarios. For headless systems, many applications based on JavaScript are becoming relevant too. In order to evaluate systems for their JavaScript execution efficiency, we are carrying over the browser-focused benchmarks from the WebKit developers used in our notebook reviews. Hosted at BrowserBench, JetStream 2.0 benchmarks JavaScript and WebAssembly performance, while Speedometer measures web application responsiveness.

BrowserBench - Jetstream 2.0

BrowserBench - Speedometer 2.0

From a real-life workload perspective, we also process WebXPRT4 from Principled Technologies. WebXPRT4 benchmarks the performance of some popular JavaScript libraries that are widely used in websites.

Principled Technologies WebXPRT4

Browsers enjoy multiple cores, but it is single-threaded performance that ends up as a performance limiter. Here, Tiger Lake shines across the board - particularly in the real-world Principled Technologies WebXPRT4 workload.

Application Startup: GIMP 2.10.30

A new addition to our systems test suite is AppTimer - a benchmark that loads up a program and determines how long it takes for it to accept user inputs. We use GIMP 2.10.30 with a 50MB multi-layered xcf file as input. What we test here is the first run as well as the cached run - normally on the first time a user loads the GIMP package from a fresh install, the system has to configure a few dozen files that remain optimized on subsequent opening. For our test we delete those configured optimized files in order to force a fresh load every second time the software is run.

AppTimer: GIMP 2.10.30 Startup

As it turns out, GIMP does optimizations for every CPU thread in the system, which requires that higher thread-count processors take a lot longer to run. So the test runs quick on systems with fewer threads, however fast cores are also needed. This combination works out well for Tiger Lake, and we see the four TGL-U systems take up the top spots.

System Performance: UL and BAPCo Benchmarks GPU Performance
Comments Locked

18 Comments

View All Comments

  • meacupla - Sunday, July 24, 2022 - link

    At that point, you may just want to use a white noise maker
  • abufrejoval - Sunday, July 24, 2022 - link

    Thanks, that's what I've been hearing, too!

    And in a way that's what I've been thinking without hinting it explicitely to Ryan1981: Getting yourself tuned to zero noise is both very expensive and counter-productive.

    Humanity has operated on communal and external noises for hundreds of thousands of years: a large part of our brain is designed to do nothing but discriminate between sounds that indicate danger and those that don't. A total absense of sound only has your brain increase the sensitivity of your receptors to the point where minute sounds become a bother.

    Instead of making electronics completely silient, we should have them emit a soft snore or other comforting noises akin to humans being human.

    There is an auditorium in the midst of Gibraltar's rock, that offers a level of silence no recording studio can match. Anyone left alone in there is bound to develop tinnitus as the brain keeps increasing the sensitivity in your in-ear "DSPs" to the point where they get the "social noise" evolution set as a base line.
  • abufrejoval - Saturday, July 23, 2022 - link

    Your previous Akasa tests had me hoping, that I’d be able to silence any NUC, if a passive Mini-ITX based solution, like the one I’d been using for Gemini Lake Atoms, wasn’t going to be available.

    I had sampled a NUC or Brix once before and was quite shocked at the nervous noise it generated: the fan gave you an audio variant of a CPU graph that you couldn’t just click away. And at top load, it was an unacceptable howler.

    I wanted something with a bit more punch than an Atom, but a similar idle power and obviously a notebook SoC should be able to do that. But the only way to get that stationary and at a reasonable price (with a full complement of RAM) was to get a NUC.

    When I hit across a NUC8i7BEK with the “double sized” Iris 655 48EU iGPU for only €300, my resistance to the form factor melted away and I gave it a try, even if the primary use case—a Linux based HCI server—had zero use for a GPU. After all you never know if it might be recycled as a desktop later and I was just curious to see how this “Apple spec” SoC would perform.

    It turned out that they key to making it unnoticeable was to ensure that the fan would never rev beyond 3200rpm and for that I had to ensure that PL2 would never last longer than 10 seconds nor exceed 50 Watts, while a PL1 of 15 Watts ensured low fan revs even for a power virus.
    I had just ordered another, when I saw a hexa-core i7-10700U based NUC (with a very ordinary 24EU iGPU) going for just €50 extra. So I cancelled and got that one instead. It turned out much more difficult to tame, because Intel was desperate to wring performance leadership out of 14nm in a tiny NUC and only Watts can get you there. I managed again, playing with the PL1/PL2/TAU to get a system rather good for those sprints where the Atoms were trying my patience, yet with a low-enough power and noise footprint to operate 24x7 as a server.

    Half a year later in February 2021 I landed a fresh Tiger Lake NUC11PAHi7, that’s played hard to get ever since. But mine is a Panther Canyon variant, evidently consumer optimized, with a completely different layout of ports for which Akasa doesn’t build a chassis. I don’t know if Intel already made these differentiations in earlier generations, but it’s rather annoying when only the number of models increases, not their availability.

    Again, that Tiger could also be tamed to unnoticeability via the excellent control Intel’s NUCs offer in the BIOS. Of course, even better would be a set of CLI tools which allow you to adjust these things from Linux…

    In terms of snappiness, none of them needs to hide, because at least for a couple of seconds they will all clock to 4.5 GHz or more and match any desktop. For brutal workloads I have other machines with 16 or 18 cores and 140-150 Watts of TDP made tolerable via lots of giant Noctua fans and coolers.

    While there is no noticeable difference in scalar performance between the NUC8 and NUC10, the two extra cores on the NUC10 i7-10700U will obviously deliver a bit of extra punch until TAU runs out. But the Tiger Lake annihilates their value with better IPC: with its four cores it matches pretty exactly the six cores of its predecessor on any parallelized workload while the single core performance is on par with a Zen 3 at the same clocks.

    The “double sized” Iris 655 with its 128MB of eDRAM on the NUC8 turned out to be a paper tiger, effectively adding only 50% of extra power vs. a normal 24EU UHD iGPU at the expense of quite a bit of silicon real-estate and production complexity. If Intel were to sell “Apple spec” chips only, I doubt they’d be nearly as profitable. The list price of an i7-8565U is $409 while the list price of an i7-8559U is only $22 higher. They are close to identical on the CPU side, but the GT3e extra die area and the 128MB eDRAM chip must have cost a pretty penny! I still own a notebook with an i5-6267U, a dual-core Skylake variant of GT3e where the CPU cores were probably the smallest piece of the chip’s silicon pie.

    Really astounding was how badly it got beaten by the 96 Tiger Lake Xe iGPU, which doesn’t have eDRAM for extra bandwidth: that one scaled rather nicely to 4x 24EU performance, beating my Ryzen 3 based 5800U APUs in most benchmarks, just as you describe.

    I don’t really know where that performance is coming from, because DRAM bandwidth is very similar across the board and only around 40GB/s. All my NUCs run with 64GB and while the timings may have gone from DDR4-2400 (NUC8) to DDR4-3200 (NUC11), that’s just adding wait states on these low power devise.

    I love running Google Maps in 3D globe view on Chrome derived browsers at 4k, because it really shows what this low power hardware is capable of with perhaps the most efficient 3D pipeline on the planet: it puts Microsoft’s best flight simulator to shame on an RTX 2080ti!

    It proves the main issue is software, not hardware. But existing real-world games are no fun on these boxes, even the Tiger Lake needs another power of 10 to become reasonably attractive at 4k.

    Another aftermarket NUC solution would evidently be one that adds a beefy active cooling, say a Noctua NH-L9i or even a Noctua NH-L9x65. Obviously these chips could sustain 65 Watts with proper cooling and then deliver quite a reasonable desktop performance in only a slightly bigger form factor.

    BTW: for my use as µ-servers I've added TB3 based 10Gbase-T NICs so the NVMe based SSDs contributing bricks to the Gluster file system don't get slowed down to inacceptable levels.

    I'd have preferred to make do with TB3 based networking via direct connect cables, but fell afoul the fact that Thunderbolt ports don't have MACs and will randomly generate them on every boot or plug event. It's the software.... again!
  • xane - Sunday, July 24, 2022 - link

    Interesting to see continued development, but for me nothing beats Cirrus7 cases from Germany. I do understand it's subjective, though.
  • Hixbot - Tuesday, July 26, 2022 - link

    Ganesh, I've been politely asking you add noise testing to your mini-pc tests for the last couple years. Noise is a very important characteristic to home theater PCs.
    Here we are with a fanless offering with some obvious thermal compromises, but your other reviews don't highlight noise at load and therefore cannot be compared.
  • ganeshts - Tuesday, July 26, 2022 - link

    If there is any noise / electrical coil whine, or anything of that sort, I do make a mention of it in the concluding section (like I did in the Zotac ZBOX CI660 nano).

    Other than that, the ambient noise / noise floor is too high in the environment where these systems are tested for a sound meter to pick up anything at all from them.
  • kepstin - Wednesday, August 24, 2022 - link

    You should really consider retiring/updating that Gimp application startup benchmark… The multithreaded scaling being weird is actually a bug where it's doing extra redundant work that it shouldn't have been, and has been fixed (or at least worked around) in newer versions.
  • storapa - Thursday, September 1, 2022 - link

    Had an old NUC3 with the old version of the Akasa Newton. Worked like a charm for years, until the board died (google results suggests it was a common problem with NUC3, not the case).

    But note that the kensington "lock" doesn't add any security, as you can remove the entire backplate with 4 screws..

Log in

Don't have an account? Sign up now