All New Gaming Test Suite

On the gaming side, the changes are rather more substantial. We’ve decided to wipe the slate clean and select all new titles for 2012. Actually, that’s not entirely true—I’ve been running tests with Civilization V and Total War: Shogun 2 for a while, but we’re now updating the settings for the benchmarks and all laptop editors will test with the following games and settings. Note that we tried to get a good selection of game genres in the list; depending on how you want to classify the games, we have four games representing first person/third person action games, two strategy games, one RPG, and one simulation/driving game. We also have representatives of several major engines—Unreal Engine 3, Frostbite 2, and Source being the most noteworthy. We’ve tried to overlap our desktop gaming suite, and while we won’t use identical test suites, we do overlap on six of the titles.

The other big change is that we’re ramping up the quality settings this year. Previously, we had both Low/Minimum and Medium settings at 1366x768. Unfortunately, we often ran into games where minimum quality looked, frankly, awful; in other games the difference between our Low and Medium settings ended up being negligible. For 2012, then, we’ve decided to skip the “minimum” detail testing and select settings that we feel most gamers will actually like using. [Update: we're changing the naming convention to avoid name space conflicts.]

Our Value setting for the test suite loosely corresponds to last year’s “Medium” settings, all run at 1366x768; our new Mainstream settings bump the resolution to 1600x900 and increase quality to roughly match last year’s “High” settings; finally, our Enthusiast settings enable 4x MSAA in all seven titles and increase the resolution to 1920x1080, basically matching the “Ultra” settings of 2011. We’ve tested each game on several setups with the goal of choosing settings that will result in reasonable quality and performance differences between the three settings. With that out of the way, here’s a rundown of the games.

Batman: Arkham City: The sequel to 2009’s Batman: Arkham Asylum, Arkham City continues the story of the Dark Knight with a free roaming playground to explore at your leisure. Graphically, the game is similar to the original title, only now the PC version has (properly working) DX11 support. The DX11 features come at a serious performance cost, however, so our Value test setting will leave them off. Note that even in the game settings, DX11 features are disabled unless you choose the maximum “Extreme” preset—and you’ll really need a beefy PC to handle the workload at that point!

We’re using the built-in benchmark so that readers can compare our results with their own hardware. Our Value settings use the Medium defaults; for Mainstream we switch to the High defaults and enable DX11 features; finally, for Enthusiast we use the Extreme preset and enable 4x MSAA. If you’re wondering, leaving DX11 disabled largely removes performance differences between the various settings, at least on moderate hardware. We tested at 1366x768 with Low, Medium, High, and Very High settings and found the average frame rates only dropped around 20%; enabling DX11 on any of the other modes results in a drop of around 40%.

Also worth noting is that Batman supports PhysX, but we won’t be testing with PhysX as that’s only available on NVIDIA hardware. That said, we do want to mention that PhysX definitely improves the gaming experience, with enhanced fog effects, more debris, cloth effects, and certain weapons (e.g. the freeze gun in one scene) fire polygons instead of sprites/textures. My own impression is that Batman with PhysX enabled and DX11 disabled generally looks better than Batman with DX11 enabled and PhysX disabled. If you want all of the goodies enabled, you’ll need very high-end hardware, beyond what most laptops can support—we’d suggest GTX 560M SLI as a bare minimum for the Extreme preset with PhysX enabled at 1080p.

Battlefield 3: We’re switching our Battlefield choice from Bad Company 2 to Battlefield 3, though in practice performance is frequently similar. For this title, we’re using FRAPS and using a two minute tank “on rails” sequence from the Thunder Run single player mission. Performance in the single player missions is highly variable depending on the level, and multi-player is even more so, but we need something that provides consistency between test runs. The Frostbite 2 engine puts quite a hefty load on your GPU, and if you want all the eye candy enabled you’ll need more than your typical mobile GPU. For BF3, our Value settings use the Medium preset; Mainstream uses the High preset; and Enthusiast uses the Ultra preset. BF3 also supports DX10 and DX11, and we leave the DirectX version support set to “Auto”; outside of Intel’s HD 2000/3000 hardware, that means all laptops will run in DX11 mode.

Civilization V: Civ5 is an interesting title in that the use of driver command lists allowed NVIDIA to optimize performance and get a healthy boost in frame rates not long after it launched. AMD has yet to implement command lists (AFAIK), but as we showed in our HD 7970 review, there may be other factors at play. We continue to use the built-in LateGameView benchmark, and it’s worth noting that the turn-based nature of Civ5 makes lower frame rates more palatable than in shooters. For our detail settings, Value has all of the video settings at Low; Mainstream uses High settings on everything (with the High detail strategic view enabled); Enthusiast is the same, only with 4x MSAA enabled. We use the DX10/11 executable and set the configuration file to allow the use of both SM41 and SM50 (Shader Model 4.1/DX10.1 and Shader Model 5.0/DX11).

DiRT 3: Our replacement to DiRT 2 is a simple update to the latest title in the series. As with BF3, this time we’re letting all systems use DX11 hardware—early indications are that DX11 improves performance at Low to High presets, but it creates a pretty massive performance drop at the Ultra preset. We run the in-game benchmark. Our Value setting will use the Medium preset; Mainstream will use the High preset, and Enthusiast will use the Ultra preset with 4x MSAA. (Note that just moving the detail slider from High to Ultra results in a ~40% drop in frame rate while adding 4xAA accounts for another 10-15%, so there’s a pretty sizeable gap between our Mainstream and Enthusiast results.)

Elder Scrolls: Skyrim: Skyrim is one of two titles in our updated list that doesn’t (currently) support DX11. There may be a patch at some point to improve the situation—there are some old conflicting statements from earlier this year where Skyrim was claimed to support DX11—but for now we’re using whatever the game has in the latest patch (e.g. as of early January, 2012). Texture quality is not one of the strong points of Skyrim, with frequently blurry textures (thanks to the console cross-platform nature of development), but at least dragons are now properly attacking with the latest updates.

As far as benchmarking goes, Skyrim appears to be far more taxing on the CPU side of things than on graphics, particularly for desktop gamers, but mobile graphics hardware is several rungs down the performance ladder so we’re going to use it. Our Value setting uses the Medium preset with antialiasing off, anisotropic filtering set to 4x, and texture quality set to medium with FXAA disabled—the latter basically uses a full screen blur filter to remove jaggies while increasing blurriness. For Mainstream, we use the High preset and turn antialiasing off. Last, for Enthusiast, we use the Ultra preset but drop antialiasing to 4xAA. Note that enabling antialiasing, at least on a GTX 560M, appears to have a minimal impact on performance; however, that may not always be the case so we’re sticking with our standard of no-AA at Value and Mainstream settings and 4xAA at the Enthusiast settings.

Update: The 1.4 patch of Skyrim dramatically improved performance, and Bethesda also released a high resolution texture pack for the PC. We will use the high resolution texture pack at the Mainstream and Enthusiast settings going forward.

Portal 2: Portal 2 is our representative of the Source engine, and like the other Source games released so far from Valve, that means no DX10 or DX11 support. That doesn’t mean the game isn’t graphically demanding, though it may have different bottlenecks than many of the other titles. We use an in-house demo file where the player combines speed gel with portals, switches, and an Excursion Funnel to advance through the map. Like most Source engine games, frame rates tend to be quite a bit higher than other titles. Our Value settings use trilinear filtering with multicore rendering enabled, and Shader/Effect/Model/Texture detail are all at Medium (with paged pool memory available set to High). Mainstream maxes out all of the settings with the exception of antialiasing, which remains off, and Enthusiast adds 4x MSAA to the mix.

Total War: Shogun 2: Wrapping up our gaming list is Total War: Shogun 2, a game which holds the dubious honor of being the slowest loading title in our test suite—by a large margin. Initially launched as a DX9 title, a patch later added DX10/11 support. Graphically, it’s difficult to tell what differences the various rendering modes have, but DX11/SM5.0 does appear to have substantially better SSAO. The patch that added DX11 features also added a built-in benchmark, the introduction to one of the scenarios, which we use for our testing. Our Value settings use the Medium preset, Mainstream will use the High preset, and Enthusiast uses the Very High preset with 4x MSAA enabled. In addition to enabling the DX11 engine, all of our settings files are set to use SM5.0 code where applicable.

2012: Meet Our New Mobile Benchmark Suite Benchmarks and Closing Thoughts
POST A COMMENT

48 Comments

View All Comments

  • losonn - Saturday, January 07, 2012 - link

    A few thoughts for someone who went out and bought an XPS 15 L502x immediately after reading your L501x / L502X reviews.

    I've got to agree with an earlier poster that I always skim the mobile reviews here at AnandTech, but frequently find myself referring to laptop reviews elsewhere (including notebookcheck.net) for better formatted specs, more helpful product pics, more dense information, specific noise / heat info, power adapter weight / pics, build quality opinions, etc.. Your current article format is perfect for pro hardware and new tech reviews (7970 review, vertex 3 review, etc.) but your consumer / mobile reviews could be a lot more dense ( http://www.notebookcheck.net/Review-Samsung-305U1A... ) or at least a little bit less ugly (see http://www.theverge.com/2012/1/4/2677801/hp-envy-1... ).

    This may have more to do with your page layout formatting than anything else though, even on my 1080p screen I have to do a -lot- of scrolling on your site and the dropdown navigation for your articles is a bit clunky. This site has a solid reputation and I trust the recommendations of Anand more than any other tech site, but you could really use a facelift that was a little bit less all-business-all-the-time a la the asus transformer prime :)

    As for benchmarks a sleep, hibernate, wake from sleep, wake from hibernate could be really helpful.
    Reply
  • coolhardware - Sunday, January 08, 2012 - link

    Nice points here.

    +Notebookcheck definitely does a great job. :-)

    -To me though, theverge layout doesn't read/flow nearly as nicely as Anandtech does right now... :-(
    Reply
  • colonelclaw - Saturday, January 07, 2012 - link

    Ok, it's very niche, but if you would like to try a real ball-busting CPU test I would recommend a combination of 3DSMax 2012 and VRay. It's the most popular combination in the 3D world at the moment and nothing sucks the life out of a PC better than a VRay render.
    Plus these days VRay is available for Max, Maya, Sketchup, Rhino, Softimage and probably more. It also runs on OSX under the apps available for that platform. Something to think about.
    Reply
  • RoninX - Sunday, January 08, 2012 - link

    I'd like to request a benchmark for battery life when gaming. This wouldn't need to be the full set of gaming tests, but just one or two games run at value, mainstream, and enthusiast settings.

    I own a high-end gaming desktop for gaming at home, but I also do a lot of business travel, and I like to having a laptop that can run games during long waits at the airport. Right now, I have a Dell XPS 15, which does pretty well, but in the future, I'd be interested to know the tradeoffs in battery life and performance for dedicated gaming laptops (e.g. Alienware) as well as high-end mainstream laptops (e.g. XPS).

    I realize that laptop battery life is pretty poor across the board when running demanding 3D games, but for me, there's a big difference between 90 minutes (which I get from my XPS 15), and say, 20 minutes. With a spare battery, the former gives you 3 hours of gaming time -- more than enough for typical flight delays.
    Reply
  • JarredWalton - Sunday, January 08, 2012 - link

    Besides tripling the number of battery life tests (ouch!), there are other factors to consider. First, you need a "game test" that's repeatable. If you play Batman for a couple hours on a laptop, and then play it again for a couple hours, I'm not sure the load will be the same. Then again, I doubt that the difference between Value/Mainstream/Enthusiast and even different games will be much -- they all activate the GPU and put a decent load on the CPU, so they're pretty much the worst-case scenario.

    How about this one: start battery life timer, open Skyrim save, and let the laptop just sit there until it runs dead. (Note that Skyrim will have the camera start circling the player in third person mode after a minute or two of inactivity.) That's about as consistently repeatable as I can come up with, though not necessarily a realistic test case. I would also be willing to check battery life on one laptop at our three detail settings to verify whether there's a difference in battery life or not. I might also check out some other games to see if they show variance in battery life (and if I can find a good looping test). Let me see if we can come up with something reasonable -- and we'd also want to test performance on battery power, as most higher end GPUs really clamp down on maximum clocks when on DC power.
    Reply
  • RoninX - Sunday, January 08, 2012 - link

    That Skyrim test sounds good to me, and testing performance on battery power would also be very useful.

    Thanks!
    Reply
  • joelypolly - Monday, January 09, 2012 - link

    Would be nice to have a web based comparison table so you can compare things like viewing angles, general specs and performance. Reply
  • JamesAnthony - Thursday, January 12, 2012 - link

    I was wondering if you would be able to add to the video tests for games more notebooks with Nvidia Quadro and ATI FireGL video cards, as Dell specifically & several other vendors tend to use those cards as the only option available in business level or professional level laptops & it would be nice to know how well you could expect your $1500 Business laptop $3,000 CAD laptop to play some games when you get home with it.

    For Example
    AMD FirePro M8900 Mobility Pro
    NVIDIA Quadro 3000M
    NVIDIA Quadro 4000M
    NVIDIA Quadro 5010M

    AMD FirePro M5950 Mobility Pro
    NVIDIA Quadro 1000M
    NVIDIA Quadro 2000M

    NVIDIA NVS 4200M
    Reply

Log in

Don't have an account? Sign up now