Thursday, September 6th 2018
UL Benchmarks Kicks Huawei Devices from its Database over Cheating
UL Benchmarks de-listed several popular Huawei devices from its database over proof of cheating in its benchmarks. Over the month, it was found that several of Huawei's devices, such as P20 Pro, Nova 3, and Play; overclocked their SoCs while ignoring all power and thermal limits, to achieve high benchmark scores, when it detected that a popular benchmark such as 3DMark, was being run. To bust this, UL Benchmarks tested the three devices with "cloaked" benchmarks, or "private benchmarks" as they call it. These apps are identical in almost every way to 3DMark, but lack the identification or branding that lets Huawei devices know when to overclock themselves to cheat the test.
The results were startling. When the devices have no clue that a popular benchmark is being run (or if has no way of telling that 3DMark is being run), it chugs along at its "normal" speed, which is 35% to 36% lower. The rules that bind device manufacturers from advertising UL's 3DMark scores explicitly state that the device must not detect the app and optimize its hardware on the fly to ace the test. Huawei responded to UL by stating that it will unlock a new "performance mode" to users that lets them elevate their SoCs to the same high clocks for any application.
The results were startling. When the devices have no clue that a popular benchmark is being run (or if has no way of telling that 3DMark is being run), it chugs along at its "normal" speed, which is 35% to 36% lower. The rules that bind device manufacturers from advertising UL's 3DMark scores explicitly state that the device must not detect the app and optimize its hardware on the fly to ace the test. Huawei responded to UL by stating that it will unlock a new "performance mode" to users that lets them elevate their SoCs to the same high clocks for any application.
47 Comments on UL Benchmarks Kicks Huawei Devices from its Database over Cheating
Funny thing, and although both of these companies went about it in very different ways, the end result was the same: both tried to show the newer products were better than they actually were.
Is it cheating to utilize an engine fully out while it's being dyno tested?
Just because other apps doesn't utilize the hardware fully out doesn't mean that apps that is made SPESIFICALLY to take full advantage of the hardware is not allowed to do that.
It's like saying 'Booohooo, Spotify for Windows is BARELY using 10% of my computers fully power while Adobe Premiere Pro is taking the full advantage of the CPU, GPU and the RAM in the computer. That must be cheating for Adobe Premiere Pro to utilize the hardware fully out just because Spotify isn't doing it'.
Are you guys figuring out on how stupid that is?
Saying a benchmarking app has to run like a normal app is the dumbest argument to ever happen in the smartphones history when that's not the freaking point of benchmarking apps. A normal app and a benchmarking app is 2 totally different types of apps.
If you want to find out how a smartphone performs under normal usage, then you have to test the smartphones out yourself. Because what one person is saying about normal usage performance can be totally different from what another person sees as normal usage performance. So it's pretty much impossible for benchmarking apps to say what's considered normal usage performance.
Benchmarks are (somewhat) a tool that we use to gauge performance of a product. It's a major part of how video card reviews are done here at TPU, by running actual benchmarking programs like 3DMARK on the card, as well as benchmarking it across a number of popular games. The results show the card's performance. What if the card was being overclocked during the review (and nobody said it was overclocked, or provided stock figures)? That would lead the reader to believe the card is faster than it actually is. In fact, a similar thing happened here on TPU, the manufacturer was sending us cards that were running faster than retail models. they mad
And you are calling this cheating and overclocking?
Overclocking is going over the speeds that is written down for the phones in the specs list.
The same applies to dyno testing engines. When you dyno tests a car engine, then you pushes it to it's highest RPM number possible. If you are doing the same on normal traffic, then you wont get any different RPM while being there. You get the same limit on how high up you can take the RPM on the engine. It doesn't matter where you are. And what do you think happens with the GPU's on computers while they are running 3DMark?
Yes, they get's up to 100% usage and the higest possible clock speeds that are written down in the specifications. Do you see the GPU only uses 70% of the power the GPU can deliver when running the latest 3DMark?
No you don't. 3DMark that is a benchmarking program is specifically made to push the GPU's to the limit of what the specs are saying it can deliver. So why should it be any different on smartphones?
Lastly. The thing with vendors giving out overcklocked GPU's to testers is not the same as utilizing the clock speeds and power of the genuine hardware fully out to what the specifications says it has.
Huawei didn't break the law like Volkswagen did. They broke UL's policy. ISPs are frequently caught doing the same when they detect a speed test running.
UL needs to try harder to obfuscate its tests.
The simplist solution is to record metrics of the system throughout the test. If clockspeeds go wild then those testing the system know the manufacturer is cheating it. They could also make the test very long to run so that if the system can't handle the clocks the clocks for extended times, it will be forced to thermal throttle which the metrics can catch.
CAN YOU?
What I dont get is how so many people dont understand the implications here. You might as well just ask them for their wallet and walk away.
If the clock speeds was going over those listed specs, then Samsung would have cheated as well. But they haven't done that so far.
No you don't. Because that program is not made to run at those speeds. However, benchmarking apps is made to do so. Just because other apps / programs doesn't utilizes the hardware fully out doesn't mean other apps / programs that is made to do so can't run the hardware at full speed. Putting every programs / apps under the same boat is not a good idea.