The Detailed Explanation of GPU Turbo

Under the hood, Huawei uses TensorFlow neural network models that are pre-trained by the company on a title-by-title basis. By examining the title in detail, over many thousands of hours (real or simulated), the neural network can build its own internal model of how the game runs and its power/performance requirements. The end result can be put into one dense sentence:

Optimized Per-Device Per-Game DVFS Control using Neural Networks

In the training phase, the network analyzes and adjusts the SoC’s DVFS parameters in order to achieve the best possible performance while minimizing power consumption. This entails trying its best to hit the nearest DVFS states on the CPUs, GPU, and memory controllers that still allow for hitting 60fps, yet without going to any higher state than is necessary (in other words, minimizing performance headroom). The end result is that for every unit of work that the CPU/GPU/DRAM has to do or manage, the corresponding hardware block has the perfectly optimized amount of power needed. This has a knock-on effect for both performance and power consumption, but mostly in the latter.

The resulting model is then included in the firmware for devices that support GPU Turbo. Each title has a specific network model for each smartphone, as the workload varies with the title and the resources available vary with the phone model. As far as we understand the technology, on the device itself there appears to be an interception layer between the application and GPU driver which monitors render calls. These serve as inputs to the neural network model.  Because the network model was trained to output the DVFS settings that would be most optimal for a given scene, the GPU Turbo mechanism can apply this immediately to the hardware and adjust the DVFS accordingly.

For SoCs that have them, the inferencing (execution) of the network model is accelerated by the SoC’s own NPU. Where GPU Turbo is introduced in SoCs that don’t sport an NPU, a CPU software fall-back is used. This allows for extremely fast prediction. One thing that I do have to wonder is just how much rendering latency this induces, however it can’t be that much and Huawei says they focus a lot on this area of the implementation. Huawei confirmed that these models are all 16-bit floating point (FP16), which means that for future devices like the Kirin 980, further optimization might occur through using INT8 models based on the new NPU support.

Essentially, because GPU Turbo is in effect a DVFS mechanism that works in conjunction with the rendering pipeline and with a much finer granularity, it’s able to predict the hardware requirements for the coming frame and adjust accordingly. This is how GPU Turbo in particular is able to make claims of much reduced performance jitter versus more conventional "reactive" DVFS drivers, which just monitor GPU utilization rate via hardware counters and adapt after-the-fact.

Thoughts After A More Detailed Explanation

What Huawei has done here is certainly an interesting approach with the clear potential for real-world benefits. We can see how distributing resources optimally across available hardware within a limited power budget will help the performance, the efficiency, and the power consumption, all of which is already a careful balancing act in smartphones. So the detailed explanation makes a lot of technical sense, and we have no issues with this at all. It’s a very impressive feat that could have ramifications in a much wider technology space, eventually including PCs.

The downside to the technology is the per-device & per-game nature of it. Huawei did not go into detail about long it took to train a single game: the first version of GPU Turbo supports PUBG and a Chinese game called Mobile Legends: Bang Bang. The second version, coming with the Mate 20, includes NBA 2K18, Rules of Survival, Arena of Valor, and Vainglory.

Technically the granularity is per-SoC rather than per-device, although different devices will have different limits in thermal performance or memory performance. But it is obvious that while Huawei is very proud of the technology, it is a slow per-game roll out. There is no silver bullet here – while an ideal goal would be a single optimized network to deal with every game in the market, we have to rely on default mechanisms to get the job done.

Huawei is going after its core gaming market first with GPU Turbo, which means plenty of Battle Royale and MOBA action, like PUBG and Arena of Valor, as well as tie-ins with companies like EA/Tencent for NBA 2K18. I suspect on the back of this realization, some companies will want to get in contact with Huawei to add their title to the list of games to be optimized. Our only request is that you also include tools so we can benchmark the game and output frame-time data, please!

On the next page, we go into our analysis on GPU Turbo with devices on hand. We also come across an issue with how Arm’s Mali GPU (used in Huawei Kirin SoCs) renders games differently to Huawei’s competitor devices.

The Claimed Benefits of GPU Turbo: Huawei’s Figures The Difficulty in Analyzing GPU Turbo
POST A COMMENT

64 Comments

View All Comments

  • LiverpoolFC5903 - Wednesday, September 5, 2018 - link

    Very interesting to say the least. The improvements from this, although not as much as promised, are still tangible and will make a difference in supported games.

    Also, alarming to see the quality difference between an Adreno unit and a Mali unit, especially considering they are supposed to be close competitors. I have an S9 with the Mali g72mp18 unit and going by the results on PUBG, it performs much worse than its Adreno counterpart, both in render quality and framerate.

    Hisilicon and Samsung should consider using Powervr gpus again, given the clear inability of the Mali to keep up. I have noticed this in the real world as well, with my LG V20 with a Snapdragon 820 lasting MUCH longer than my S9 while running emulators (PSP and Neo Geo), despite being years old.
    Reply
  • Manch - Wednesday, September 5, 2018 - link

    Maybe its my screen but the Honor Play and the S9 pics make it look like the dude got no undies. LOL Reply
  • umano - Wednesday, September 5, 2018 - link

    Amazing article, thank you. Having a P20 pro ( I don't play games on phone ) that was particularly interesting and I really liked the "ethic" behind words, supporting both customers and the company, asking the latter to do the right thing. I think this is the way professional journalism has to be done.
    Chapeau
    Reply
  • AshokGupta - Wednesday, September 5, 2018 - link

    Hundreds of Huawei's competaters have tried round and round to prove GPU Turbo is a fake junk, and all of them failed. Now you take over their job. Good Luck, Man! Reply
  • GreenReaper - Wednesday, September 5, 2018 - link

    Fake? No. But the reality doesn't exactly match up to the marketing. Reply
  • s.yu - Friday, September 7, 2018 - link

    The reality doesn't match up to the marketing, AT ALL. Good as fake.
    Huawei in all practicality was trying to sell this off as a *universal* performance and efficiency gain of 60%, 30% respectively while in fact it only works on *a handful* of games for about *10%* each. When you're exaggerating your claims by 3x, 6x, it's lying, it's fake.
    Reply
  • AshokGupta - Saturday, September 8, 2018 - link

    If you read the Chinese media, what happens here is just repetition of what happened exactly right after the technology was launched in CHINA. Including this stupid guess of saying it only covers few games. Then approved by many independent tech media it's applicable for all. Your name indicates you are most probably from China. I suppose you should know it. Don't understand why you come here again giving the approved fake comments. Reply
  • s.yu - Monday, September 10, 2018 - link

    Because of the opaque operation of Chinese media. Obviously you're also from China, don't tell me you don't know about the fuss Huawei created buying ads on international sites and then buying fake journalism back in China.
    http://tech.ifeng.com/a/20180710/45057623_0.shtml
    This article was widely spread as legit news but the international content cited was intentionally twisted, it's highly misleading.
    When Huawei buys western ads at least the hosts declare bought articles, in China there's no way of telling real journalism from Huawei's smokescreen, so I put off reaching a conclusion until global availability of the technology.
    Now from Anandtech's analysis and *interview* it's obviously certain that the tech only works on a handful of games, Huawei even admitted that each profile is trained separately then pushed to devices.
    I know a Huawei troll when I see one, I'll be keeping an eye on you in the future.
    Reply
  • ET - Wednesday, September 5, 2018 - link

    What's with the annoying 'Buy the Right CPU' autoplaying video? Reply
  • psychobriggsy - Wednesday, September 5, 2018 - link

    It's really annoying, and on every article, on every page, and it doesn't remember if you pause it on one page then go to the next. Reply

Log in

Don't have an account? Sign up now