Mobile Trinity Lineup

Trinity is of course coming in two flavors, just like Llano before it. On the desktop, we’ll have Virgo chips, but those are coming later this year (around Q3); right now, Trinity is only on laptops. On laptops the codename for Trinity is Comal. AMD has also dropped wattages on their mobile flavors, so where Llano saw 35W and 45W mobile parts, with Comal AMD will have 17W, 25W, and 35W parts. (The desktop Trinity chips will apparently retain their 65W and 100W targets.) There aren’t a ton of mobile Trinity chips launching today; instead, AMD has five different APUs and each one targets a distinct market segment. Here’s the quick rundown:

AMD Trinity A-Series Fusion APUs for Notebooks
APU Model A10-4600M A8-4500M A6-4400M A10-4655M A6-4455M
“Piledriver” CPU Cores 4 4 2 4 2
CPU Clock (Base/Max) 2.3/3.2GHz 1.9/2.8GHz 2.7/3.2GHz 2.0/2.8GHz 2.1/2.6GHz
L2 Cache (MB) 4 4 1 4 2
Radeon Model HD 7660G HD 7640G HD 7520G HD 7620G HD 7500G
Radeon Cores 384 256 192 384 256
GPU Clock (Base/Max) 497/686MHz 497/655MHz 497/686MHz 360/497MHz 327/424MHz
TDP 35W 35W 35W 25W 17W
Package FS1r2 FS1r2 FS1r2 FP2 FP2
DDR3 Speeds DDR3-1600
DDR3L-1600
DDRU-1333
DDR3-1600
DDR3L-1600
DDRU-1333
DDR3-1600
DDR3L-1600
DDRU-1333
DDR3-1333
DDR3L-1333
DDRU-1066
DDR3-1333
DDR3L-1333
DDRU-1066

As a Bulldozer-derived architecture, Trinity uses CPU modules that each contain two Piledriver CPU cores with a shared FP/SSE (Floating Point) unit. From one perspective, that makes Trinity a quad-core or dual-core processor; others would argue that it’s not quite the same as a “true” quad-core setup. We’re not going to worry too much about the distinction here, though, as we’ll let the performance results tell that story. Compared to Llano’s K10-derived CPU core, clock speeds in Trinity are substantially higher—both the base and Turbo Core clocks. The top-end A10-4600M has a base clock that’s 53% higher than the 1.5GHz A8-3500M we reviewed when Llano launched, while maximum turbo speeds are up 33%. Unfortunately, while clock speeds might be substantially higher, Trinity’s Piledriver cores have substantially longer pipelines than Llano’s K10+ cores; we’ll see in the benchmarks what that means for typical performance.

The GPU side of the equation is are also substantially different from Llano. Llano used a Redwood GPU core (e.g. Radeon 5600 series) with a VLIW5 architecture (e.g. the Evergreen family of GPUs), and the various APUs had either 400, 320, or 240 Radeon cores. Trinity changes out the GPU core for a VLIW4 design (Northern Islands family of GPU cores), and this is the only time we’ve seen AMD use VLIW4 outside of the 6900 series desktop GPUs. The maximum number of Radeon cores is now 384, but we should see better efficiency out of the design, and clock speeds are substantially higher than on Llano—the mobile clocks are typically 55-60% higher. Again, how this plays out in terms of actual performance is something we’ll look at momentarily.

Looking at the complete lineup of Trinity APUs, it’s interesting to see AMD using a new A10 branding for the top models while overlapping the existing A8 and A6 brands on lower spec models. We only have the A10-4600M in for testing right now, but AMD provided some performance estimates for the various performance levels. The A10-4600M delivers 56% better graphics performance and 29% better “productivity” performance than the A8-3500M—note that we put productivity in quotes because it’s not clear if AMD is talking specifically about CPU performance or some other metric. The new A8-4500M delivers 32% faster graphics performance than the A8-3500M and 19% higher productivity, which appears to be why it gets the same “A8” classification. Finally, even the single-module/dual-core A6-4400M delivers 16% better graphics than the A8-3500M and 5% higher productivity. I suspect that the various percentages AMD lists are more of an “up to” statement as opposed to being typical performance improvements, as it seems unlikely that 192 VLIW4 cores at 686MHz could consistently outperform 400 VLIW5 cores at 444MHz.

If we consider target markets, the A10-4600M will be the fastest Trinity APU for now, and it should go into mainstream laptops that will provide a well rounded experience with the ability for moderate gaming along with any other tasks you might want to run. The A8-4500M takes a pretty major chunk out of the GPU (one third of the GPU cores are gone along with a slight drop in maximum clock speed) while maintaining roughly 80% of the CPU performance, so it can fit into slightly cheaper laptops but will likely drop gaming performance from “moderate” to “light”. The A6-4400M ends up as the extreme budget offering, with higher clocks on the CPU making up for the removal of two cores; the GPU likewise gets a slight trim relative to the A8-4500M, and we’re now down to half the graphics performance potential of the A10-4600M. All of the standard voltage parts support up to DDR3-1600 memory, with low voltage DDR3-1600 and ultra low voltage DDR3-1333 also supported.

The other two APUs are low voltage and ultra low voltage parts, which should work well in laptops like HP’s “sleekbooks”—basically, they’re for AMD-based alternatives to ultrabooks. The A10-4655M has about 87% of the CPU performance potential of the A10-4600M, with 70% of the GPU performance potential, and it can fit into a 25W TDP. The A6-4455M drops the TDP to 17W, matching Intel’s ULV parts, but again the CPU and GPU cores get cut. This time we get two Piledriver cores, 256 Radeon cores, and lowered base and maximum clock speeds. The low/ultra low voltage parts also drop support for DDR3-1600 memory, moving all RAM options down one step to DDR3-1333, low voltage DDR3-1333 and ultra low voltage DDR3-1066.

The final piece of the puzzle for any platform is the chipset. AMD is using their A70M (Hudson M3) chipset, which is the same chipset used for Llano. That’s not really a problem, though, as the chipset provides everything Trinity needs: it has support for up to six native SATA 6Gbps ports, four USB 3.0 ports (and 10 USB 2.0 ports), RAID 0/1 support, and basically everything else you need for a mainstream laptop. PCI Express support in Trinity remains at PCIe 2.0, but that’s not really a problem considering the target market. PCIe 3.0 has been shown to improve performance in some GPGPU workloads with HD 7970, but that’s a GPU that provides nearly an order of magnitude more compute power (over 7X more based on clock speeds and shader count alone).

That takes care of the overview of AMD’s Mobile Trinity lineup, and Anand has covered the architectural information, so now it’s time to meet our prototype AMD Trinity laptop.

Improved Turbo, Beefy Interconnects and the Trinity GPU Meet the AMD Trinity/Comal Prototype
Comments Locked

271 Comments

View All Comments

  • mdeo - Tuesday, May 15, 2012 - link

    " For some of these applications, we don’t have any good way of measuring performance across a wide selection of hardware, and for some of those where benchmarks are possible I’ve run out of time to try to put anything concrete together"

    Please wait and spend the required time before you post results.
    Also, where are the graphs for WinZip, GIMP filters (19 of them ..you deemed 5 that you would use). Graphs make it easy to read that Trinity beats Intel chips in GIMP and equals in WinZip.

    This makes me wonder why should I trust Anandtech more then tomshardware reviews ...
  • JarredWalton - Tuesday, May 15, 2012 - link

    You apparently have no idea how much time goes into putting together a review and running all the benchmarks. Let's just say that after running (and rerunning) benchmarks for much of the last month on a variety of laptops, I finished a couple of graphs right at the 12:01 AM NDA time. That was after getting about ten hours of sleep total over the weekend, and never mind the fact that I've had a horrible cold the past week.

    Every new benchmark needs to be created and evaluated to see if it's useful. GIMP's new "Noise Reduction" and "Blur" functions can use OpenCL, but so can "Checkerboard". Um, really? We need OpenCL to fill an image with a checkerboard?

    Here are a few GIMP numbers (from Noise Reduction):
    A10 CPU: .396 MP/s
    A10 OCL: 4.10 MP/s
    IVB CPU: 1.49 MP/s
    IVB OCL: 4.04 MP/s
    SNB CPU: .689 MP/s
    SNB OCL: 3.56 MP/s
    DC SNB CPU: .586 MP/s
    DC SNB OCL: 2.01 MP/s
    Llano CPU: .321 MP/s
    Llano OCL: 2.39 MP/s

    I had a graph for WinZip, but then we pulled it because apparently WinZip's OpenCL performance is best using the legacy compression. I used their newer Zipx compression, which results in a smaller file but isn't as optimized (yet). So now I need to spend about two hours retesting WinZip and 7-zip. Thanks for understanding.
  • Beenthere - Tuesday, May 15, 2012 - link

    As expected Trinity delivers in all areas and should meet most people's needs quite well. Good job AMD. You get my money!
  • tipoo - Tuesday, May 15, 2012 - link

    I wonder what causes these odd results? The 7660 winning by a wide margin in most games, but losing by a small margin in some? Is it whether the games are pixel fill vs pixel shader (hd4000 is good at the former, bad at the latter) bound, or is there a driver issue with the 4000, or what?
  • Wolfpup - Tuesday, May 15, 2012 - link

    That has to be the most surprising thing in the review to me. While I know technically today's GPUs are really CPUs geared towards less branchy, more parallel code, it still caught me off guard that someone had thought to run a file compression utility on it!

    Also surprised Intel has OpenCL drivers at all...not surprised they're bad though. I wonder what they do? Like is their "GPU" portion of Sandy/Ivy bridge actually capable of doing that type of work, or are they mostly just using the CPU?

    Still hate "quicksync" and the graphics portion of those CPUs as it's wasting at least enough transistors for a 5th core.
  • jwcalla - Tuesday, May 15, 2012 - link

    File compression can be parallelized, but there are some unfortunate limitations... final compressed size is generally less optimal with parallel compression, and compressing large volumes of data become memory-bound fairly quickly. But for "ordinary" compression tasks it's quite effective.

    The article didn't indicate if the CPU compression tests were single-threaded or multi-threaded.
  • Brazos - Tuesday, May 15, 2012 - link

    Can I assume the improvements seen here will be implemented in the next version of Bulldozer (Vishera)?
  • mikato - Wednesday, May 16, 2012 - link

    Yes, it's supposed to use Piledriver modules, not Bulldozer.
  • silverblue - Thursday, May 17, 2012 - link

    Plus even the Piledriver implementation in Trinity should, clock for clock, be faster than Bulldozer even without L3 cache. This isn't a repeat of Llano vs. Phenom II where Husky cores were technically faster than Stars albeit lacked L3 cache which brought the performance back down again.

    Some information about the caches plus their latency would be really appreciated; if it bodes well here, Vishera might be a very decent chip.
  • jwcalla - Tuesday, May 15, 2012 - link

    Is there really a large market for gaming on a lower-end laptop? I can see us techies being interested in that sort of thing, but what percentage of PC buyers is actually concerned about gaming performance, let alone on a laptop? In real world terms, I'm not seeing AMD's strategy giving it much of an advantage.

    I'm willing to entertain the reality that Intel has been "overselling performance" to casual users for some time now, and so maybe low-end is more than good enough... but, if true, AMD seems to be focusing on a segment that is going to have an enormous amount of competition in the next 2-3 years.

Log in

Don't have an account? Sign up now