Time for New Server CPUs!

While the network fabric and optimized motherboard designs were the key features of the micro server, it is clear that there is room for a “specialized micro server CPU” between the current anemic Atoms and the low power Xeons and Opterons.  AMD announced last week the quadcore Opteron-X series, based upon the Jaguar core.   The quadcore X1150 is claimed to perform twice as fast as the current Atoms at 2 GHz, but needs 17W to achieve this. You can lower the power usage by fiddling with a p-state cap in the BIOS. Unfortunately the 9W number that so many publications talked about without further commentary is only achieved at 1 GHz. At that clockspeed the performance per Watt advantage will be negligible compared to the 2 GHz Atom S1260 at 8.5W. The best performance/watt will be achieved somewhere between 1.5 and 2 GHz, but the advantage that the new Opteron-X has over the Atom is not as large as many people thought. Unless of course you can make use of the floating point processing power of the integrated Radeon core in the Opteron X-1250 APU.

But although the new Opteron-X can not offer a massive performance/watt improvement over the Atom, it is a much more attractive micro server chip. It can deliver the “good enough” horsepower that an Atom can not deliver. But then again, the current low power Ivy Bridge based Xeon, the Xeon E3-1220LV2 was already a very good micro server chip.  But those who think that AMD will be contend with an underdog role once again, are wrong.  AMD has a pretty ambitious roadmap to attack this market.

Low power server CPUs Berlin: Radeon mixed with Steamroller
Comments Locked

80 Comments

View All Comments

  • andrewaggb - Wednesday, June 19, 2013 - link

    I like AMD. But they are in a tough spot and I don't see them getting out of it with any of their roadmaps. The only interesting product is their ARM server chip. Everything else appears to be more of the same with slight improvements. That's not going to change the market dynamics. Intel squeezed up to 50% better idle battery life (and 30-100% faster graphics) with haswell and is targeting 2x cpu performance for the next atom and has their new xeon phi powering the fastest supercomputer (even if it's slightly less efficient than some of the other machines). They are setting aggressive targets and meeting them.

    AMD needs to hit targets that are at least 30-100% better than current offerings at something that matters. Be it IPC, power consumption, graphics, maximum clocks, core counts... but something.
    They should just sell the PS4 or XBOne chips. 8 cores and gddr5 or edram. That would give probably a 100% increase in apu graphics performance and double the cores which would give some cpu tasks a large increase.
  • silverblue - Tuesday, June 18, 2013 - link

    I'm not sure I agree. Even with Steamroller fixing multithreading, that in itself won't result in a big gain. If Cinebench is anything to go by, FX scales decently enough there so there's not a huge amount of performance left on the table without a decent single core IPC gain.

    Each BD core only contains 2 ALUs and 2 AGUs. AMD argues that dropping the third of each was a necessary evil, however whereas Stars couldn't really utilise them, Core certainly can. Additionally, FlexFPU isn't helping - theoretically, AMD shouldn't lag behind in AVX but they do significantly, so they'd need to boost the FPU to 256-bit per pipe in order to gain parity... but they're not doing this with Steamroller as far as I've heard (in fact, the only thing they're doing is removing an MMX unit). FX's memory controller doesn't appear to be that good, either, and the significant power savings from high density libraries don't come until Excavator (though Steamroller does get the dynamic L2, which is neat).

    Not having an FX or Opteron Steamroller 8-core (or higher) is a bit daft regardless of the above, unless it's not a big enough jump, OR Excavator has been pushed forwards.
  • jjj - Tuesday, June 18, 2013 - link

    A bit alarming that there is no 12-16 Steamroller cores on 28 or 20nm on the roadmap and what does that tells us about desktop (they don't really have a public roadmap for 2014 desktop)?
    I do wonder if Warsaw is one huge die or they are still going with 2 dies with 8 cores each.
    On the ARM side it will be interesting to see power consumption on 28nm,at least this way they'll have the product sooner and i do wish the DIY players and the SoC makers would set some standards for ARM desktops. At the very least there is a HTPC and file server market that can expand to desktop.
  • SilentSin - Tuesday, June 18, 2013 - link

    I wonder if AMD is taking an Intel release approach to this. The high core-count EP server CPUs coming out a year after the smaller ones with some tweaks. Steamroller is set to be released for desktop in Kaveri form this year- same die as Berlin I am guessing. But you are right that I don't see a Steamroller FX chip or a server variant with 4m/8c which is puzzling. I hope that they aren't going to rely on fused-off Kaveri 2m/4c CPU only parts for clients as it simply isn't enough horsepower. The BD architecture is actually extremely competitive with anything Intel has when used in a proper Linux environment.

    Hopefully these products come out on time, AMD really needs timely execution to have any hope here. Other 64-bit ARM solutions are just around the corner and they will be left in the dust if they have constant delays.
  • Gigaplex - Tuesday, June 18, 2013 - link

    "The BD architecture is actually extremely competitive with anything Intel has when used in a proper Linux environment."

    Only for integer based workloads. It still struggles with floating point.
  • TiredOldFart2 - Tuesday, June 18, 2013 - link

    Integer workload is most of the workload for now. Fp intensive will naturally migrate onto opencl over time, as it is more efficient to do so.
  • sanaris - Monday, February 17, 2014 - link

    You are newbie in server market. Server folks will never go to closed language.
    Because they value their time of development.

    And of cause server processors need as much floating point as possible in 64/128 wide range.
    The funny "32 fp performance" numbers which opencl and cuda creators are showing to us are irrelevant because you need as much significant bits as possible - 32 minus exponent minus reserve - conclusion is 32 fp is irrelevant.
  • jjj - Tuesday, June 18, 2013 - link

    Intel got greedy and lazy ,AMD had an opportunity in 200+$ desktop with many cores, preferably on 20nm so they can fit 16 cores in a reasonable die size). AMD also has an opportunity to make an APU similar in perf to Xbox One, for a nice budged gaming box capable of playing console ports for quite a few years and more when the APU gets CFed with a discrete card.We'll see big of a GPU Kaveri has ,it's debatable if it's worth using a big enough GPU on 28nm.
    In ARM consumer, both Intel and M$ are somewhat alienating customers so the ARM players should push on all fronts.
    Sadly it seems AMD doesn't have the resources to be aggressive and fight too many battles.
  • sanaris - Monday, February 17, 2014 - link

    AMD had opportunity everywhere since the times they made opteron 180 and overall transition to manycore era. But they throw their chance with opteron decision to drop floating point. And of cause AMD needs manual cache regulation instruction set.
  • JDG1980 - Tuesday, June 18, 2013 - link

    I don't think we can necessarily infer desktop strategy from the server roadmap. AMD already has a really, really small market share (something like 4%) in big x86 server parts. Their market share on the desktop is nowhere near that bad. Sure, they're going to release the APUs first for Steamroller since that's a mainstream product and has a bigger market than the enthusiasts. But once they get that done, they're going to need to do something to keep themselves in the news until the next architecture, and putting together a FX chip wouldn't be too much of a stretch. I do think that Socket AM3+ is dead; the FX parts will most likely be on the same socket as the APUs from now on. This will minimize engineering costs.

Log in

Don't have an account? Sign up now