The Mobile CPU Core-Count Debate: Analyzing The Real World
by Andrei Frumusanu on September 1, 2015 8:00 AM EST- Posted in
- Smartphones
- CPUs
- Mobile
- SoCs
Real Racing 3 Playing
The little cores see at least 3 major threads loaded onto them. The 4th core is doing some work as well, but quite a bit less than the first 3. What is extremely interesting here is the frequency distribution graph: The cores don't settle for any one frequency and make use full use of the full range of the cluster.
The behaviour of the big-cluster is clear-cut. There's only 1 significant thread that ever gets placed on the big cores. This is an ideal scenario for a big.LITTLE architecture as would there have been more than 1 thread, that secondary thread would have suffered from diminished efficiency as it wouldn't be able to run at the best perf/W frequency due to ARM's synchronous frequency planes between CPUs in a cluster.
The power-distribution graph does show the worrying anomaly of seeing CPU4 come out its power-collapse state for very small periods of time. This would be a source of inefficiency of either the scheduler or the CPUIdle framework needing to wake up that core for the sake of simple clean-up work instead of real load.
I think it's pretty safe to come to the conclusion that Real Racing 3 is coded with quad-core CPUs in mind as we see exactly 4 major threads loading the SoC's CPUs to various extent.
157 Comments
View All Comments
Hrobertgar - Tuesday, September 1, 2015 - link
Your spikes on the video recording appear to be every ~4 secs of video, could the CPU spikes be app / memory related?badchris - Tuesday, September 1, 2015 - link
Thank you for this excited article.And one problem,how do we explain 2 big core Snapdragon 808 is more efficient than 4 big core Snapdragon 810?Andrei Frumusanu - Tuesday, September 1, 2015 - link
You cannot make comparisons between different SoCs even if they have the same CPU IP and the same manufacturing process. The S808 is different from the S810 which are again different from Nvidia's X1 even if all 3 have A57 cores on TSMC 20nm.badchris - Tuesday, September 1, 2015 - link
nvm,i should realize this comparison is not scientific.metafor - Tuesday, September 1, 2015 - link
The S808 and S810 should be fairly similar though. That's not to say you can say that the only difference is the CPU configuration but a similar study on what the behavior is like on a different SoC with fewer cores would be helpful.Threading isn't 100% free and neither is thread migration. It might be good to take a look at just what the S810 is doing over time compared to the S808 in terms of CPU activity.
Andrei Frumusanu - Tuesday, September 1, 2015 - link
I have data on all of that... It's just in need of being published in an orderly fashion.kpkp - Tuesday, September 1, 2015 - link
There are quite few other differences beside the 2 cores, starting with the memory controller.badchris - Tuesday, September 1, 2015 - link
thx for your notice.there're something i forgotnpp - Tuesday, September 1, 2015 - link
As an ex-Android developer I can remember that the SDK not only encourages, but sometimes straight out enforces extensive usage of threads. For example, around API level 14/15, making a network request in the main thread would throw an exception, which may seem obvious to experienced developers but wasn't enforced in earlier versions. This is a simple example, but having the API itself pushing towards multi-threaded coding has a positive effect on the way Android developers build their apps. I'm not sure then why Google's own browser would be surprising for its usage of high thread counts - even a very basic app would be very likely to spawn much more than 4 threads nowadays.Arbie - Tuesday, September 1, 2015 - link
"I was weary of creating this table..."That's not surprising, after all your work ;-).
Terrific article BTW which is up to Anandtech's long-time standards. Seems like a mini master's thesis.