Tuesday, May 28, 2013

AMD Opteron™ 6200 Series processors

  • AMD Opteron™ 6200 Series processors

AMD Opteron™ 6200 Series processors picture 2
AMD proudly presents the new AMD Opteron™ 6200 Series—the world’s first and only 16-core x86 server processor with the highest core density for incredible scalability to handle demanding multi-threaded workloads such as cloud computing, virtualization, high-performance computing (HPC), databases and business applications.

The innovative modular architecture helps pack in more cores, making it the natural choice for heavy-threaded cloud computing environments and enterprise applications that demand outstanding performance and scalability.

AMD Opteron™ 6200 Series Processor - Features and Benefits 

Performance


FEATURE FUNCTION  END-USER BENEFIT
New Core Architecture Optimized performance per watt:
  • Full performance from each core
  • High frequency and low-power design
  • Virtualization enhancements
  • Shared double-sized Flex FP
  • ISA enhancements
The new processor core can enhance power efficiency for server products:
  • Help minimize cooling solution complexities
  • Help reduce power at low utilization
  • Enable more control for IT managers
AMD Turbo CORE Technology Turns unused TDP headroom into added clock speed for improved performance The industry’s best full-core boost1 with up to a 500MHz frequency increase2
ISA Enhancements In addition to baseline instruction support, added SSE4.1, SSE4.2, SSSE3, AVX, AES, PCLMULQDQ, FMA4 and XOP Provides performance enhancements to a wide range of applications
Flex FP The world’s only flexible 256-bit FPU. Two 128-bit FMACs shared per module, allowing for dedicated 128-bit execution per core or shared 256-bit execution per module. Enhances flexibility and performance in technical computing

AMD-P 2.0

FEATURE FUNCTION  END-USER BENEFIT
APML (Advanced Platform Management Link) Provides an interface for processor and systems management monitoring and controlling of system resources (in APML-enabled platforms); Comprised of the Remote Power Management Interface (RPMI) and the Precision Thermal Monitor Remote Power Management Interface (RPMI):
  • Ability to monitor and control platform power consumption via p-state limits
  • Access to processor identification and health

Precision Thermal Monitor:
  • Provides accurate information about CPU thermals to closely monitor power/cooling and proactively alert the Base Management Controller (BMC)
  • Early notification helps save time and money by providing intelligence that can be used to more effectively monitor power and thermals to optimize cooling solutions in an IT datacenter
TDP Power Cap Allows the user to set the maximum processor power ceiling via BIOS or APML This technology gives enterprises the ability to customize their chips to meet power and workload demands, providing:
  • More control over power settings
  • Flexibility to set power limits without capping CPU frequencies
AMD CoolSpeed Technology Protects processor integrity by reducing power-states, when a temperature limit is reached
  • Server can automatically drop into lower power mode if processor’s thermal environment exceeds safe operational limits
  • Offers platform providers the ability to safely reduce system fan speeds, which helps deliver greater platform efficiency
C1E Reduces memory controller and HyperTransport™ technology links’ power This feature can equate to a significant power savings in your datacenter depending on system configuration, when the Northbridge and HyperTransport™ technology links are powered down and cores are at idle.
C6 Support Core power gating: When a core is halted its context is exported to system memory and voltage is removed from the core. Helps reduce power consumption at idle by up to 46% over the previous generation3
LV-DDR3 Support Supports 1.25V and 1.35V DDR3 Helps to reduce overall power consumption
Higher DDR3 Frequencies DDR3-1600 now supported for typical memory configurations; DDR3-1866 supported in slightly loaded memory configurations (1 of 1 DIMM per channel) Helps to improve overall system performance
LR-DIMM Support Support for LR-DIMM memory LR-DIMMs provide more capacity per DIMM, which can enable higher overall server memory configurations. Virtualization, cloud computing and other high-capacity applications greatly benefit from LR-DIMMs.

Direct Connect Architecture 2.0

FEATURE FUNCTION  END-USER BENEFIT
Quad Channel Memory Doubles the memory channels compared to Six-Core AMD Opteron™ processors Can increase performance especially in memory-intensive workloads
HyperTransport™ Technology Assist (HT Assist) Helps to increase coherent memory bandwidth and reduce latency in multi-node systems by reducing cache probe traffic between cores Can result in faster queries that can increase performance for cache sensitive applications such as database, virtualization and compute intensive applications
HyperTransport™ 3.0 Technology (HT3) Provides superior system bandwidth between CPUs and I/O, increasing interconnect rate up to a maximum 6.4GT/s with HT3 Helps improve overall system balance and scalability for scale-out computing environments like HPC, database and web serving
Cache and Core Count Integration of up to 16 cores within the same package, 1MB L2 cache per core (up to 16MB of L2 cache per socket), and a shared 16MB L3 cache per socket Offers improved performance and performance/watt (compared to prior generations) for multi-threaded environments like virtualization, database and web serving

  1. Compared to Intel’s Turbo Boost that reaches up to a 266MHz frequency increase. See http://ark.intel.com/products/47925/Intel-Xeon-Processor-E5620#infosectionessentials
  2. Based on internal AMD engineering projections of AMD Opteron 6200 Series processors with up to 500MHz in P1 boost state and up to 1.3GHz in P0 boost state over base P2 clock frequency
  3. Based on testing in AMD Performance Labs as of August, 2011, an AMD Opteron™ processor model 6174 (12-core 2.2GHz) consumes 11.7W in the active idle C1E power state while an AMD Opteron™ processor model 6276 (16-core 2.3GHz) consumes only 6.4W in the active idle C1E power state with new C6 power gating employed. System configuration: “Drachma” reference design kit, 32GB (8 x 4GB DDR3-1333) memory, 500GB SATA disk drive, Microsoft® Windows Server® 2008 x64 Enterprise Edition R2. SVR-60


Ground-Breaking Architecture

The new AMD Opteron™ 6200 Series processor delivers the worlds highest core density, lets users host more virtual machines per server, handle more database users and solve more complex HPC applications with fewer nodes and less power - offering a better option for the hyper-efficient, virtualized, cloud-ready world.

Performance for Data Centers
  • 84% greater performance for HPC than the competition1
  • Up to 160% more cores2 to handle more VMs per platform and minimize datacenter space
  • Low cost per VM lets you deploy fewer physical servers1 as business needs scale up

Scalability for Virtualization
  • 73% more memory throughput3 to scale as your workloads grow

Economics for the Cloud
  • Unparalleled efficiency with 1/2 the power-per-core4
  • Requires 2/3 less floor space5
  • 1/3 to 2/3 lower platform price6


AMD in the Cloud

Power the Cloud with AMD Opteron™ Processors

AMD Opteron™ 6200 Series processors picture 1
The new core architecture of AMD Opteron™ processors is designed to handle peak traffic loads with unparalleled energy efficiency to keep budgets under control. With ground-breaking architecture featuring up to 16 cores, AMD Opteron processors can quickly scale up to handle serious workloads so you can expand your business into the cloud.
Performance: Up to 84% greater performance than the competition7
Scalability: Build scalable clouds with up to 73% more memory throughput8
Efficiency: As low as 4.375W per core9 without compromising features

Energy Efficiency

An essential component of any cloud computing environment is the ability to balance performance with low energy consumption. The new AMD Opteron 4200 Series processor provides the low power consumption you need, without sacrificing key features such as cache and memory speed. Specialized features that combine outstanding energy efficiency with high core count processors help ensure AMD-based platforms not only scale well, but do so efficiently.


1LINPACK (2P) AMD Opteron processor Model 6276 generates 84% more FLOPS than Intel Xeon processor Model X5670. (239.1 FLOPS, 2 x AMD Opteron™ processors Model 6276 in Supermicro H8DGT server, 64GB (8 x 8GB DDR3-1600) memory, SuSE Linux® Enterprise Server 11 SP1 64-bit, gfortran compiler v4.6, OMPI 1.5.3, AMD Core Math Library 5.0.0.0. Compiler Flags: -fomit-frame-pointer -O3 -funroll-loops -W -Wall -mavx -mfma4 -fopenmp. 130.1 FLOPS, 2 x Intel Xeon processors Model X5670 in Supermicro 6026TT-BIBQF server, 24GB (6 x 4GB DDR3-1333) memory, SuSE Linux® Enterprise Server 11 SP1 64-bit, Intel Professional Compiler v11.1, OMPI 1.5.1, Intel Math Kernel Library 10.3, Hyper-Threading disabled, Turbo Boost Technology enabled. Compiler Flags: -O3 -w -ansi-alias -i-static -openmp -nocompchk. Assumes 1 VM per core.
2Comparison of 16-core AMD Opteron™ 6200 Series processor with 6-core Intel Xeon 5600 series processor and 10-core Intel Xeon E7 Series processor. SVR-30
3Based on STREAM benchmark results obtained by AMD Performance Labs as of October 2011. 73 GB/s: 2 x AMD Opteron™ processors Model 6276 in Supermicro H8DGT, 64GB (8 x 8GB DDR3-1600) memory, SuSE Linux® Enterprise Server 11 SP1 64-bit, x86 Open64 4.2.5-1 Compiler Suite. 42 GB/s: 2 x Intel Xeon processors Model X5670 in Supermicro X8DTT server, 24GB (6 x 4GB DDR3-1333) memory, SuSE Linux® Enterprise Server 11 SP1 64-bit, Intel Compiler v11.1.064.
4As of Nov 14, 2011, AMD Opteron™ processor Models 4200 EE have the lowest known power per core of any x86 server processor, at 35W TDP (35W/8 = 4.375W/core). Intels lowest power per core server processor, L5630, is 40W TDP (40W/4 = 10W/core). See http://ark.intel.com/products/series/47915. Previous record held by AMD Opteron processor Models 4100 EE at 35W TDP / 6 cores = 5.83 W/core.
5VMs/rack (2P and 4P) One rack of AMD Opteron 6200 Series-based servers can support 672 VMs (1 VM per core, 2U servers). This would take three racks of floor space and 56 2U Intel Xeon 5600 Series-based servers to do the same. Assumes 1 VM/core, AMD Opteron 6200 Series-based 2P 2U server has up to 32 cores, supports up to 32 VMs/server x 21 servers per rack, which equals 672 VMs per server. Intel Xeon 5600 Series-based 2P 2U server has up to 12 cores, supports up to 12 VMs/server x 21 servers per rack, which equals 252 VMs per rack, Intel specs as of 11/4/11 at www.intc.com/pricelist.cfm.
6Top bin comparisons. Dell R710 with two top bin Intel Xeon processor Model X5690s is $7,103. Since pricing for a Dell R715 with two top bin AMD Opteron processor Model 6282 SE (1ku $1019) is not yet available, the similarly priced AMD Opteron processor Model 6140 (1ku $989) was used and the server yielded a price of $4564. That is a 36% price savings. Both servers were configured with 32GB RAM, 146GB 10K hdd, and 3yr base warranty and large enterprise pricing is from www.dell.com as of 10/22/11. HP DL 580 with four top bin Intel Xeon processor Model E7-4870 is $29,336 at www.dell.com. Since pricing for an HP DL585 pricing with four top bin AMD Opteron Model 6828 SE (1ku $1019) is not yet available, the similarly priced AMD Opteron processor Model 6140 (1ku $989) was used and server yielded a price of $11,094. That is a 62% price savings. Both servers were configured with 64GB RAM, 72GB 15K hdd, and 3yr base warranty and large enterprise pricing is from www.hp.com as of 10/22/11. VMware vSphere pricing not included, assuming both servers configured with versions 5.0 or 4.1u2, which are the same price for AMD- and Intel-based servers.
7LINPACK (2P) AMD Opteron processor Model 6276 generates 84% more FLOPS than Intel Xeon processor Model X5670. 239.1 FLOPS, 2 x AMD Opteron™ processors Model 6276 in Supermicro H8DGT server, 64GB (8 x 8GB DDR3-1600) memory, SuSE Linux© Enterprise Server 11 SP1 64-bit, gfortran compiler v4.6, OMPI 1.5.3, AMD Core Math Library 5.0.0.0. Compiler Flags: -fomit-frame-pointer -O3 -funroll-loops -W -Wall -mavx -mfma4 -fopenmp. 130.1 FLOPS, 2 x Intel Xeon processors Model X5670 in Supermicro 6026TT-BIBQF server, 24GB (6 x 4GB DDR3-1333) memory, SuSE Linux© Enterprise Server 11 SP1 64-bit, Intel Professional Compiler v11.1, OMPI 1.5.1, Intel Math Kernel Library 10.3, Hyper-Threading disabled, Turbo Boost Technology enabled. Compiler Flags: -O3 -w -ansi-alias -i-static -openmp -nocompchk. Assumes 1 VM per core.
8Based on STREAM benchmark results obtained by AMD Performance Labs as of October, 2011. 73 GB/s: 2 x AMD Opteron™ processors Model 6276 in Supermicro H8DGT, 64GB (8 x 8GB DDR3-1600) memory, SuSE Linux© Enterprise Server 11 SP1 64-bit, x86 Open64 4.2.5-1 Compiler Suite. 42 GB/s: 2 x Intel Xeon processors Model X5670 in Supermicro X8DTT server, 24GB (6 x 4GB DDR3-1333) memory, SuSE Linux© Enterprise Server 11 SP1 64-bit, Intel Compiler v11.1.064.
9As of November 14, 2011, AMD Opteron™ processor Models 4200 EE have the lowest known power per core of any x86 server processor, at 35W TDP (35W/8 = 4.375W/core). Intels lowest power per core server processor, L5630, is 40W TDP (40W/4 = 10W/core). See http://ark.intel.com/products/series/47915. Previous record held by AMD Opteron processor Models 4100 EE at 35W TDP / 6 cores = 5.83 W/core. SVR-58



Data taken from http://server.amd.com/LP=237#1 and http://www.amd.com/us/products/server/processors/6000-series-platform/6200/Pages/6200-series-processors.aspx