Intel's Haswell - An HTPC Perspective: Media Playback, 4K and QuickSync Evaluated

Name: Intel's Haswell - An HTPC Perspective: Media Playback, 4K and QuickSync Evaluated
Item: Intel's Haswell - An HTPC Perspective: Media Playback, 4K and QuickSync Evaluated
Author: Ganesh T S

by Ganesh T S on June 2, 2013 8:15 PM EST

Posted in
CPUs
Intel
HTPC
Haswell
GPUs

95 Comments | Add A Comment

95 Comments

Decoding and Rendering Benchmarks

Our decoding and rendering benchmarks consists of standardized test clips (varying codecs, resolutions and frame rates) being played back through MPC-HC. GPU usage is tracked through GPU-Z logs and power consumption at the wall is also reported. The former provides hints on whether frame drops could occur, while the latter is an indicator of the efficiency of the platform for the most common HTPC task - video playback.

Enhanced Video Renderer (EVR) / Enhanced Video Renderer - Custom Presenter (EVR-CP)

The Enhanced Video Renderer is the default renderer made available by Windows 8. It is a lean renderer in terms of usage of system resources since most of the aspects are offloaded to the GPU drivers directly. EVR is mostly used in conjunction with native DXVA2 decoding. The GPU is not taxed much by the EVR despite hardware decoding also taking place. Deinterlacing and other post processing aspects were left at the default settings in the Intel HD Graphics Control Panel (and these are applicable when EVR is chosen as the renderer). EVR-CP is the default renderer used by MPC-HC. It is usually used in conjunction with MPC-HC's video decoders, some of which are DXVA-enabled. However, for our tests, we used the DXVA2 mode provided by the LAV Video Decoder. In addition to DXVA2 Native, we also used the QuickSync decoder developed by Eric Gur (an Intel applications engineer) and made available to the open source community. It makes use of the specialized decoder blocks available as part of the QuickSync engine in the GPU.

Power consumption shows a tremendous decrease across all streams. Admittedly, the passive Ivy Bridge HTPC uses a 55W TDP Core i3-3225, but, as we will see later, the power consumption at full load for the Haswell build is very close to that of the Core i3-3225 build despite the lower TDP of the Core i7-4765T.

In general, using the QuickSync decoder results in a higher power consumption because the decoded frames are copied back to the DRAM before being sent to the renderer. Using native DXVA decoding, the frames are directly passed to the renderer without the copy-back step. The odd-man out in the power numbers is the interlaced VC-1 clip, where QuickSync decoding is more efficient compared to 'native DXVA2'. This is because there is currently no support in the open source native DXVA2 decoders for interlaced VC-1 on Intel GPUs, and hence, it is done in software. On the other hand, the QuickSync decoder is able to handle it with the VC-1 bitstream decoder in the GPU.

The GPU utilization numbers follow a similar track to the power consumption numbers. EVR is very lean on the GPU, as discussed earlier. The utilization numbers provide proof of the same. QuickSync appears to stress the GPU more, possibly because of the copy-back step for the decoded frames.

madVR

Videophiles often prefer madVR as their renderer because of the choice of scaling algorithms available as well as myriad other features. In our recent Ivy Bridge HTPC review, we found that with DDR3-1600 DRAM, it was straightforward to get madVR working with the default scaling algorithms for all materials 1080p60 or lesser. In the meanwhile, Mathias Rauen (developer of madVR) has developed more features. In order to alleviate the ringing artifacts introduced by the Lanczos algorithm, an option to enable an anti-ringing filter was introduced. A more intensive scaling algorithm (Jinc) was also added. Unfortunately, enabling either the anti-ringing filter with Lanczos or choosing any variant of Jinc resulted in a lot of dropped frames. Haswell's HD4600 is simply not powerful enough for these madVR features.

~~It is not possible to use native DXVA2 decoding with madVR because the decoded frames are not made available to an external renderer directly~~. (Update: It is possible to use DXVA2 Native with madVR since v0.85. Future HTPC articles will carry updated benchmarks) To work around this issue, LAV Video Decoder offers three options. The first option involves using software decoding. The second option is to use either QuickSync or DXVA2 Copy-Back. In either case, the decoded frames are brought back to the system memory for madVR to take over. One of the interesting features to be integrated into the recent madVR releases is the option to perform DXVA scaling. This is particularly interesting for HTPCs running Intel GPUs because the Intel HD Graphics engine uses dedicated hardware to implement support for the DXVA scaling API calls. AMD and NVIDIA apparently implement those calls using pixel shaders. In order to obtain a frame of reference, we repeated our benchmark process using DXVA2 scaling for both luma and chroma instead of the default settings.

One of the interesting aspects to note here is the fact that the power consumption numbers show a much larger shift towards the lower end when using DXVA2 scaling. This points to more power efficient updates in the GPU video post processing logic.

DXVA scaling results in much lower GPU usage for SD material in particular with a corresponding decrease in average power consumption too. Users with Intel GPUs can continue to enjoy other madVR features while giving up on the choice of a wide variety of scaling algorithms.

Refresh Rate Handling - 23.976 Hz Works! Network Streaming Performance - Netflix and YouTube

PRINT THIS ARTICLE

Post Your Comment
Please log in or sign up to comment.

Comments Locked

95 Comments

View All Comments

jhoff80 - Sunday, June 2, 2013 - link
This article and the power consumption stats just make me wish that Intel would just make it easier to get a hold of their -T chips for end users. A 35W or 45W chip would be great for me, but the only thing that has full retail availability is the 65W one. (And it's not because it's so early in launch, it's always been way too difficult to get -T versions.)
EnzoFX - Sunday, June 2, 2013 - link
Not to mention expensive! You get the same results by undervolting/underclocking, typically.
Laststop311 - Monday, June 3, 2013 - link
You are correct in a way but you could undervolt the T series as well and get better thermal performance then the 65 watt version. atleast that is my experience. If i was making an HTPC i would use the i7-4770t or the i7-4650t if thats the equivalent of the i7-3770t this year. The power consumption is amazing and proper 24hz is great for 1080p24 playback. upgrade to the htpc just isn't in my budget right now and ivy bridge + gt 660 isnt a bad htpc. MY PC budget is going to an ultrabook upgrade this year. The increased battery life and performance is insane. i7-980x desktop still does not have a large enough upgrade to make it worth it. Ivy bridge-E is not THAT much faster and I dont think even haswell-e next year will be enough to upgrade the desktop.
Death666Angel - Tuesday, June 4, 2013 - link
"but you could undervolt the T series as well and get better thermal performance then the 65 watt version."
Not to the same extent. The T series will already be driving much tighter voltages than normal SKUs. While you may save 15% power consumption by undervolting normal SKUs, undervolting already power efficient SKUs would result in sub 5% probably.
vnangia - Sunday, June 2, 2013 - link
Well, it helps that there are 35W parts this time around - at least on the timeline. IVB didn't get any 35W parts, so the HTPC is still on SNB, and yeah, I could definitely use the incremental improvements to QuickSync.
jhoff80 - Sunday, June 2, 2013 - link
Yes, but I'm not talking about only 35W specific chips. The i7-3770T was just as difficult to get as any other -T series chip, because they don't sell them to end-users directly.
vnangia - Sunday, June 2, 2013 - link
I'm agreeing with you! What I was trying to say is, Intel did announce low-TDP SNB parts and delivered: SNB had a bunch of -T versions available to end-users at both low (G4xx, G5xxT, 2100T, 2120T) and high end (2390, 2500T). I bought my 2100T at Microcenter B&M for instance.

By contrast, Intel didn't announce any end-user -T (and just a handful of -S) parts and we saw that IVB had virtually no -T parts available. I'm optimistic that now they've announced a few -T parts at the high end, we might actually see these materialize in the retail chain and hopefully it bodes well for -T parts at the low end.

Fortunately (*knocks on wood*) the current SNB-based HTPC is still going strong, so I don't feel the need to upgrade. If and when I do, though, I expect that it won't be so clear cut - I may end up going with AMD's lineup, despite the relative paucity of AMD ITX boards.
jhoff80 - Monday, June 3, 2013 - link
Sorry, I must've misunderstood.
Krysto - Monday, June 3, 2013 - link
This is insane. Why use a $400 Intel Haswell media box for 4k video, when you can use the much cheaper and much more efficient Mali T622-based media boxes that should be appearing next year?

http://blogs.arm.com/multimedia/977-a-new-branch-f...
NirXY - Monday, June 3, 2013 - link
"should be appearing next year"

Intel's Haswell - An HTPC Perspective: Media Playback, 4K and QuickSync Evaluated

Decoding and Rendering Benchmarks

Post Your Comment

95 Comments

View All Comments

jhoff80 - Sunday, June 2, 2013 - link

EnzoFX - Sunday, June 2, 2013 - link

Laststop311 - Monday, June 3, 2013 - link

Death666Angel - Tuesday, June 4, 2013 - link

vnangia - Sunday, June 2, 2013 - link

jhoff80 - Sunday, June 2, 2013 - link

vnangia - Sunday, June 2, 2013 - link

jhoff80 - Monday, June 3, 2013 - link

Krysto - Monday, June 3, 2013 - link

NirXY - Monday, June 3, 2013 - link

Log in

Don't have an account? Sign up now