Compute

Jumping into pure compute performance, this is another scenario where the 290X shouldn’t throttle as much, and as such the performance differences between the 290 and 290X should be closer to what they are on paper. With compute workloads the ROPs aren’t being hit hard, so that’s power and thermal savings that lets both cards operate at close to their maximum boost clocks.

As always we'll start with our DirectCompute game example, Civilization V, which uses DirectCompute to decompress textures on the fly. Civ V includes a sub-benchmark that exclusively tests the speed of their texture decompression algorithm by repeatedly decompressing the textures required for one of the game’s leader scenes. While DirectCompute is used in many games, this is one of the only games with a benchmark that can isolate the use of DirectCompute and its resulting performance.

As with the 290X, Civ V can’t tell us much of value due to the fact that we’re running into CPU bottlenecks, not to mention increasingly absurd frame rates. The 290 is marginally slower than the 290X due to the lower clockspeeds and missing CUs, but minimally so.

Our next benchmark is LuxMark2.0, the official benchmark of SmallLuxGPU 2.0. SmallLuxGPU is an OpenCL accelerated ray tracer that is part of the larger LuxRender suite. Ray tracing has become a stronghold for GPUs in recent years as ray tracing maps well to GPU pipelines, allowing artists to render scenes much more quickly than with CPUs alone.

With both cards unthrottled and bound solely by shader performance, it’s an outright foot race for the Radeon cards. 290 trails 290X by around 9%, closely mirroring the difference in the CU count between the two cards. Though 290 is being very closely chased by the 280X, as Hawaii in general seems to have trouble getting the most out of its shader hardware on this benchmark.

Our 3rd compute benchmark is Sony Vegas Pro 12, an OpenGL and OpenCL video editing and authoring package. Vegas can use GPUs in a few different ways, the primary uses being to accelerate the video effects and compositing process itself, and in the video encoding step. With video encoding being increasingly offloaded to dedicated DSPs these days we’re focusing on the editing and compositing process, rendering to a low CPU overhead format (XDCAM EX). This specific test comes from Sony, and measures how long it takes to render a video.

There’s not enough of a GPU performance difference between the two cards to matter with this test. Both tie at 22 seconds.

Our 4th benchmark set comes from CLBenchmark 1.1. CLBenchmark contains a number of subtests; we’re focusing on the most practical of them, the computer vision test and the fluid simulation test. The former being a useful proxy for computer imaging tasks where systems are required to parse images and identify features (e.g. humans), while fluid simulations are common in professional graphics work and games alike.

In the CLBenchmark fluid simulation the 290X and 290 take the top spots as expected, with the 290 trailing once more by 9%. However both Hawaii cards are still struggling with the computer vision benchmark, leading to the 290 being edged out by the 7970 of all things.

Moving on, our 5th compute benchmark is FAHBench, the official Folding @ Home benchmark. Folding @ Home is the popular Stanford-backed research and distributed computing initiative that has work distributed to millions of volunteer computers over the internet, each of which is responsible for a tiny slice of a protein folding simulation. FAHBench can test both single precision and double precision floating point performance, with single precision being the most useful metric for most consumer cards due to their low double precision performance. Each precision has two modes, explicit and implicit, the difference being whether water atoms are included in the simulation, which adds quite a bit of work and overhead. This is another OpenCL test, as Folding @ Home has moved exclusively to OpenCL this year with FAHCore 17.

Generally Tahiti and Hawaii are strong performers in the GPU compute arena, but that isn’t of particular help to the 290 here, as it loses out to the GTX 780 in every mode. In single precision FAHBench has trouble putting Hawaii to good use at times, while double precision tests have the 1/8th DP rate 290 and 290X falling behind due to their lower than Tahiti DP throughput.

Wrapping things up, our final compute benchmark is an in-house project developed by our very own Dr. Ian Cutress. SystemCompute is our first C++ AMP benchmark, utilizing Microsoft’s simple C++ extensions to allow the easy use of GPU computing in C++ programs. SystemCompute in turn is a collection of benchmarks for several different fundamental compute algorithms, as described in this previous article, with the final score represented in points. DirectCompute is the compute backend for C++ AMP on Windows, so this forms our other DirectCompute test.

SystemCompute is another benchmark where 290 and 290X do not experience meaningful throttling, and as such are separated by more than what happens in our gaming benchmarks. In this case 290 yet again trails 290X by 9%, though it still enjoys a considerable lead over the GTX 780 and all other NVIDIA cards.

Synthetics Power, Temperature, & Noise
Comments Locked

295 Comments

View All Comments

  • HisDivineOrder - Tuesday, November 5, 2013 - link

    Unless the cards heat up your water so much you start watching your CPU get too hot for whatever overclock you've got. ;)

    What will you do then? Weep? Shake your head? Get another radiator? You might want to give them their own loop.
  • techkitsune - Tuesday, November 5, 2013 - link

    Three Delta Fans. 9,000 RPM. -64dBA. If those don't keep whatever I have attached to them nice and cool, there's something wrong with them, there's a blockage in the lines/block/radiator, or I screwed up applying the thermal paste. :) Also, CPU always comes first in the loop since it's the lower power device versus a GPU. If anything, the CPU would be heating the GPU.
  • faster - Tuesday, November 5, 2013 - link

    I agree. With this chip set to run at 95C, it is going to put abnormally high load on any cooling loop stressing the other components. Best to have its own dedicated water cooling loop.
  • techkitsune - Tuesday, November 5, 2013 - link

    " Best to have its own dedicated water cooling loop."

    Given how low-power newer CPUs are, no, it makes sense to have the CPU first in the loop as running a second loop does nothing regarding your reservoir temperatures, you're still drawing from the same cooling source.

    I do liquid cooling with 1,000w pieces in form factors far smaller than that GPU (try 1,000w in 30mm x 30mm.)
  • DMCalloway - Tuesday, November 5, 2013 - link

    .... and what?.... put it in the closet? LOL
  • DMCalloway - Tuesday, November 5, 2013 - link

    .... and everyone knows those fans are whisper quiet. In essence what's the difference here? ; )
  • The Von Matrices - Tuesday, November 5, 2013 - link

    You can't rate a card on speculation on what a custom card would be. You have to rate it for what it is now, and the product being sold today is unacceptably loud. There will be separate reviews for custom cards in the future and they will be judged on their own merits.
  • techkitsune - Tuesday, November 5, 2013 - link

    " the product being sold today is unacceptably loud."

    One of my Delta fans is almost twice as loud as one of these GPUs with reference coolers at max speed. You're still looking at the raised indoor voice level of noise, I have three Deltas.
  • HisDivineOrder - Tuesday, November 5, 2013 - link

    We all know Delta fans are loud. Delta black fans are loudest. They are also unacceptably loud for the majority of users.
  • techkitsune - Tuesday, November 5, 2013 - link

    Delta blacks are not the loudest. I've got some 80mm and 120mm impeller fans that can fit in a case, and do sound like jet engines, -83dBA at the high end. You can find just about anything in China! :)

Log in

Don't have an account? Sign up now