Meet AMD's Berlin: Radeon + Steamroller

The current Opteron 4310 EE (2 modules, 4 cores at 2.2-3 GHz, 40W TDP) and Opteron 4376 HE (4 modules, 8 cores at 2.6-3.6 GHz, 65W TDP) are about the best AMD can deliver for low power servers that need more processing power. These will be replaced by AMD’s Berlin CPU, the first Opteron with the new Steamroller cores.

The Quadcore chip integrates a next generation Radeon core, PCI-e 2.0/3.0, LPC, USB 2.0/3.0 and Serial-ATA controller.

AMD says that using the graphics core for the heavy scalar floating point will get as easy as C++ programming and as a result, Berlin should make a few heads turn in the HPC world. It even looks like SSE-x will get less and less important over time in that market. Andrew Feldman told us that Berlin will offer at least two time much CPU processing performance as the Opteron X-series. Although the TDP will be higher, the performance per Watt will be quite a bit better.  Berlin will use the same 28 nm process technology as the Opteron x1150 and x2150. This chip will be available in the first half of 2014, and might experience some stiff competition from Haswell based Xeons.

Time for new server CPUs! Seattle: the Atom killer?
Comments Locked

80 Comments

View All Comments

  • andrewaggb - Wednesday, June 19, 2013 - link

    I like AMD. But they are in a tough spot and I don't see them getting out of it with any of their roadmaps. The only interesting product is their ARM server chip. Everything else appears to be more of the same with slight improvements. That's not going to change the market dynamics. Intel squeezed up to 50% better idle battery life (and 30-100% faster graphics) with haswell and is targeting 2x cpu performance for the next atom and has their new xeon phi powering the fastest supercomputer (even if it's slightly less efficient than some of the other machines). They are setting aggressive targets and meeting them.

    AMD needs to hit targets that are at least 30-100% better than current offerings at something that matters. Be it IPC, power consumption, graphics, maximum clocks, core counts... but something.
    They should just sell the PS4 or XBOne chips. 8 cores and gddr5 or edram. That would give probably a 100% increase in apu graphics performance and double the cores which would give some cpu tasks a large increase.
  • silverblue - Tuesday, June 18, 2013 - link

    I'm not sure I agree. Even with Steamroller fixing multithreading, that in itself won't result in a big gain. If Cinebench is anything to go by, FX scales decently enough there so there's not a huge amount of performance left on the table without a decent single core IPC gain.

    Each BD core only contains 2 ALUs and 2 AGUs. AMD argues that dropping the third of each was a necessary evil, however whereas Stars couldn't really utilise them, Core certainly can. Additionally, FlexFPU isn't helping - theoretically, AMD shouldn't lag behind in AVX but they do significantly, so they'd need to boost the FPU to 256-bit per pipe in order to gain parity... but they're not doing this with Steamroller as far as I've heard (in fact, the only thing they're doing is removing an MMX unit). FX's memory controller doesn't appear to be that good, either, and the significant power savings from high density libraries don't come until Excavator (though Steamroller does get the dynamic L2, which is neat).

    Not having an FX or Opteron Steamroller 8-core (or higher) is a bit daft regardless of the above, unless it's not a big enough jump, OR Excavator has been pushed forwards.
  • jjj - Tuesday, June 18, 2013 - link

    A bit alarming that there is no 12-16 Steamroller cores on 28 or 20nm on the roadmap and what does that tells us about desktop (they don't really have a public roadmap for 2014 desktop)?
    I do wonder if Warsaw is one huge die or they are still going with 2 dies with 8 cores each.
    On the ARM side it will be interesting to see power consumption on 28nm,at least this way they'll have the product sooner and i do wish the DIY players and the SoC makers would set some standards for ARM desktops. At the very least there is a HTPC and file server market that can expand to desktop.
  • SilentSin - Tuesday, June 18, 2013 - link

    I wonder if AMD is taking an Intel release approach to this. The high core-count EP server CPUs coming out a year after the smaller ones with some tweaks. Steamroller is set to be released for desktop in Kaveri form this year- same die as Berlin I am guessing. But you are right that I don't see a Steamroller FX chip or a server variant with 4m/8c which is puzzling. I hope that they aren't going to rely on fused-off Kaveri 2m/4c CPU only parts for clients as it simply isn't enough horsepower. The BD architecture is actually extremely competitive with anything Intel has when used in a proper Linux environment.

    Hopefully these products come out on time, AMD really needs timely execution to have any hope here. Other 64-bit ARM solutions are just around the corner and they will be left in the dust if they have constant delays.
  • Gigaplex - Tuesday, June 18, 2013 - link

    "The BD architecture is actually extremely competitive with anything Intel has when used in a proper Linux environment."

    Only for integer based workloads. It still struggles with floating point.
  • TiredOldFart2 - Tuesday, June 18, 2013 - link

    Integer workload is most of the workload for now. Fp intensive will naturally migrate onto opencl over time, as it is more efficient to do so.
  • sanaris - Monday, February 17, 2014 - link

    You are newbie in server market. Server folks will never go to closed language.
    Because they value their time of development.

    And of cause server processors need as much floating point as possible in 64/128 wide range.
    The funny "32 fp performance" numbers which opencl and cuda creators are showing to us are irrelevant because you need as much significant bits as possible - 32 minus exponent minus reserve - conclusion is 32 fp is irrelevant.
  • jjj - Tuesday, June 18, 2013 - link

    Intel got greedy and lazy ,AMD had an opportunity in 200+$ desktop with many cores, preferably on 20nm so they can fit 16 cores in a reasonable die size). AMD also has an opportunity to make an APU similar in perf to Xbox One, for a nice budged gaming box capable of playing console ports for quite a few years and more when the APU gets CFed with a discrete card.We'll see big of a GPU Kaveri has ,it's debatable if it's worth using a big enough GPU on 28nm.
    In ARM consumer, both Intel and M$ are somewhat alienating customers so the ARM players should push on all fronts.
    Sadly it seems AMD doesn't have the resources to be aggressive and fight too many battles.
  • sanaris - Monday, February 17, 2014 - link

    AMD had opportunity everywhere since the times they made opteron 180 and overall transition to manycore era. But they throw their chance with opteron decision to drop floating point. And of cause AMD needs manual cache regulation instruction set.
  • JDG1980 - Tuesday, June 18, 2013 - link

    I don't think we can necessarily infer desktop strategy from the server roadmap. AMD already has a really, really small market share (something like 4%) in big x86 server parts. Their market share on the desktop is nowhere near that bad. Sure, they're going to release the APUs first for Steamroller since that's a mainstream product and has a bigger market than the enthusiasts. But once they get that done, they're going to need to do something to keep themselves in the news until the next architecture, and putting together a FX chip wouldn't be too much of a stretch. I do think that Socket AM3+ is dead; the FX parts will most likely be on the same socket as the APUs from now on. This will minimize engineering costs.

Log in

Don't have an account? Sign up now