SlideShare una empresa de Scribd logo
1 de 20
Descargar para leer sin conexión
OPTIMIZED	
  HADOOP	
  DEPLOYMENTS	
  WITH	
  
SEAMICRO	
  SM15000	
  
PRESENTED	
  AT	
  AMD	
  DEVELOPER	
  SUMMIT,	
  NOV	
  2013	
  
SATHEESH	
  NANNIYUR	
  
BIG	
  DATA	
  IS	
  A	
  STRATEGIC	
  DECISION:	
  CAPEX	
  AND	
  OPEX	
  

Devices	
  

2	
   |	
  	
  	
  RIGHT	
  SIZING	
  HADOOP	
  DEPLOYMENTS	
  	
  

Apps	
  

Cloud	
  
HIGHER	
  PERFORMANCE,	
  LESS	
  POWER	
  AND	
  SPACE	
  
Hadoop	
  Technology	
  Stack	
  

Data	
  
Warehouse	
  

Data	
  AnalyRcs	
  

Management	
  
Data	
  Access	
  
Data	
  Processing	
  
Data	
  Storage	
  
3	
   |	
  	
  	
  RIGHT	
  SIZING	
  HADOOP	
  DEPLOYMENTS	
  	
  
SEAMICRO	
  SM15000™	
  ACCELERATES	
  APACHE™	
  HADOOP™	
  
DEPLOYMENTS	
  
!  Superior	
  high	
  availability	
  
‒  AcRve/standby	
  NameNode	
  
‒  AcRve/standby	
  JobTracker	
  
‒  Highly	
  resilient	
  fabric	
  for	
  inter-­‐node	
  
east-­‐west	
  traffic	
  

!  Reduced	
  down	
  Rme	
  
‒  Remap	
  or	
  rezone	
  disks	
  to	
  recover	
  
data	
  
‒  Hot-­‐swappable	
  upgrades	
  or	
  
component	
  replacements	
  

!  Hardware	
  redundancy	
  
‒  Power	
  supplies	
  
‒  Network	
  I/O	
  

4	
   |	
  	
  	
  RIGHT	
  SIZING	
  HADOOP	
  DEPLOYMENTS	
  	
  
SM15000	
  OVERVIEW	
  
64	
  HDDs/SDDs	
  
• 
Share	
  drives	
  across	
  all	
  servers	
  
• 
Assign	
  one	
  server	
  to	
  one	
  or	
  more	
  drives	
  as	
  needed	
  
• 
In	
  service	
  upgrades	
  as	
  needed	
  
64	
  Industry	
  standard	
  x86	
  servers	
  
• 
AMD	
  Opteron™,	
  Intel	
  Xeon®,	
  Atom™	
  
• 
Energy	
  efficient	
  processor	
  
• 
20	
  Gbps	
  per	
  socket,	
  16X	
  tradiRonal	
  servers	
  

960	
  terabytes	
  Fabric	
  Storage	
  
• 
Extends	
  supercompute	
  fabric	
  to	
  external	
  storage	
  
• 
Up	
  to	
  3.84	
  PB	
  storage	
  capacity;	
  up	
  to	
  960	
  3.5”	
  SAS/
SATA	
  drives	
  
• 
Map	
  to	
  any	
  CPU—same	
  as	
  internal	
  drives	
  

5	
   |	
  	
  	
  RIGHT	
  SIZING	
  HADOOP	
  DEPLOYMENTS	
  	
  

160	
  Gbps	
  Network	
  I/O	
  
• 
Share	
  network	
  I/O	
  across	
  all	
  servers	
  
• 
Eliminate	
  TOR	
  switch	
  
• 
Minimize	
  cabling	
  
• 
In	
  service	
  upgrades	
  as	
  needed	
  
SEAMICRO	
  FREEDOM™	
  FABRIC	
  ASIC	
  PROVIDES	
  MASSIVE	
  
PERFORMANCE,	
  REDUCES	
  POWER	
  AND	
  SPACE	
  	
  
B E N E F I T S 	
  

Freedom™	
  

SeaMicro
IOVT

TIO

Freedom Supercompute
Fabric

6	
   |	
  	
  	
  RIGHT	
  SIZING	
  HADOOP	
  DEPLOYMENTS	
  	
  

Eliminates 90% of the components
on a motherboard shrinking
power used, cost and space

Reduces the power used by
any CPU by consolidating and
shutting off unused functionality

Provides massive bandwidth
while eliminating power hungry
top of rack switches
FS	
  4060-­‐L	
  FABRIC	
  STORAGE	
  ENCLOSURE	
  WITH	
  ZONING	
  
CAPABILITY	
  
!  High	
  density,	
  power	
  opRmized	
  4U	
  enclosure	
  with	
  60	
  3.5”	
  drives	
  
!  Up	
  to	
  16	
  enclosures	
  per	
  SM15000,	
  960	
  drives,	
  and	
  3.84	
  PB	
  
storage	
  capacity	
  
!  Redundant	
  controllers,	
  ports,	
  fans,	
  and	
  PSUs	
  
!  Support	
  cost	
  opRmized	
  24x7	
  operaRons	
  SATA	
  HDD	
  for	
  high	
  
density	
  Big	
  Data	
  and	
  Object	
  Storage	
  deployments	
  

!  OpRonal	
  configuraRon	
  to	
  logically	
  parRRon	
  an	
  enclosure	
  into	
  
two	
  30	
  3.5”	
  drive	
  enclosures	
  
!  Balanced	
  disk	
  to	
  core	
  raRo	
  (1:1)	
  for	
  opRmizing	
  Hadoop	
  
performance	
  
!  Field	
  configurable	
  to	
  provide	
  utmost	
  	
  flexibility	
  to	
  balance	
  
density	
  and	
  performance	
  

7	
   |	
  	
  	
  RIGHT	
  SIZING	
  HADOOP	
  DEPLOYMENTS	
  	
  
FREEDOM	
  FABRIC	
  DISAGGREGATES	
  SERVER	
  RESOURCES	
  
PROVIDES	
  FLEXIBILITY	
  FOR	
  EXPANSION	
  AND	
  INFRASTRUCTURE	
  OPTIMIZATION	
  

!  SM15000	
  provides	
  independent	
  scaling	
  of	
  Compute,	
  Storage,	
  and	
  Network	
  
!  Centrally	
  managed	
  provisioning	
  of	
  storage	
  and	
  network	
  resources	
  to	
  compute	
  
nodes	
  enabled	
  by	
  CLI	
  and	
  API	
  interfaces	
  
SeaMicro	
  SM15000	
  Server	
  

Hadoop	
  OpRmizaRon	
  

Compute	
  and	
  Memory	
  
Pool	
  
CPU	
  

CPU	
  

CPU	
  

CPU	
  

Deploy	
  

CPU	
  

CPU	
  

Fabric	
  Interconnect	
  

Shared	
  Storage	
  
Pool	
  

8	
   |	
  	
  	
  RIGHT	
  SIZING	
  HADOOP	
  DEPLOYMENTS	
  	
  

Network	
  
Pool	
  

Cost	
  and	
  
Performance	
  

Tune	
  and	
  
OpWmize	
  

Run	
  and	
  
Analyze	
  
SM15000	
  FLEXIBLE	
  STORAGE	
  ALLOWS	
  ITERATIVELY	
  
OPTIMIZING	
  APACHE®	
  HADOOP™	
  DEPLOYMENT	
  
!  Flexible	
  shared	
  storage	
  with	
  commodity	
  hardware	
  enabled	
  by	
  SeaMicro	
  fabric	
  
technology	
  
!  Decoupled	
  from	
  Compute	
  and	
  Network	
  to	
  grow	
  storage	
  independently	
  
!  IteraRvely	
  opRmize	
  Hadoop	
  disk	
  to	
  core	
  raRo	
  as	
  applicaRon	
  needs	
  evolve	
  
Captive DAS with Rigid
Storage to Compute Ratio

Flexible scale-out Fabric Storage
up to 5PB

Freedom	
  Fabric	
  

Traditional
Rackmount

9	
   |	
  	
  	
  RIGHT	
  SIZING	
  HADOOP	
  DEPLOYMENTS	
  	
  

Intel	
  /AMD	
  X86	
  
servers	
  
APACHE®	
  HADOOP™	
  DEPLOYMENT	
  ON	
  THE	
  SEAMICRO	
  
SM15000™	
  
SM15000	
  
ZooKeeper	
  

HDFS	
  
NameNode	
  

ZooKeeper	
  

MapReduce	
  
JobTracker	
  

MapReduce	
  
JobTracker	
  

Up	
  to	
  160	
  Gb/s	
   Redundant	
  NameNode	
  and	
  
JobTracker	
  
bandwidth	
  for	
  data	
  
ingesWon	
  

10	
   |	
  	
  	
  RIGHT	
  SIZING	
  HADOOP	
  DEPLOYMENTS	
  	
  

DataNode/	
  
TaskTracker	
  

DataNode/	
  
TaskTracker	
  

DataNode/	
  
TaskTracker	
  

DataNode/	
  
TaskTracker	
  

HDFS	
  
NameNode	
  

DataNode/	
  
TaskTracker	
  

DataNode/	
  
TaskTracker	
  

10Gb/s	
  network	
  
bandwidth/node	
  

DataNode/	
  
TaskTracker	
  

DataNode/	
  
TaskTracker	
  

60	
  DataNode/	
  TaskTracker	
  
Nodes	
  	
  	
  

•  8GB/s	
  storage	
  bandwidth	
  
•  Flexible	
  storage	
  capacity	
  
SEAMICRO	
  REFERENCE	
  ARCHITECTURE	
  FOR	
  APACHE®	
  
HADOOP™	
  
!  Hadoop	
  HA	
  with	
  NameNode	
  and	
  JobTracker	
  AcRve/Standby	
  ConfiguraRon	
  
!  Up	
  to	
  60	
  DataNode/TaskTracker	
  nodes,	
  512	
  x86	
  cores,	
  960	
  TB	
  raw	
  capacity,	
  10	
  
Gb/s	
  Internode	
  bandwidth	
  and	
  160	
  Gb/s	
  uplink	
  bandwidth	
  in	
  28	
  RU	
  and	
  5.8	
  	
  kW	
  
SoluWon	
  Components	
  
• 
• 
• 
• 
• 
• 

Highly	
  Available	
  AcRve/Standby	
  NameNode	
  
Highly	
  Available	
  AcRve/Standby	
  JobTracker	
  
DataNode/TaskTracker	
  
SM15000	
  Internal	
  Drives	
  for	
  NameNode	
  and	
  JobTracker	
  
11	
   |	
  	
  	
  RIGHT	
  SIZING	
  HADOOP	
  DEPLOYMENTS	
  	
  

	
  
• 
• 

SeaMicro	
  SM15000	
  AMD	
  Opteron	
  or	
  
Intel	
  Xeon	
  (Ivy	
  Bridge)	
  CPU	
  chassis	
  
32	
  GB	
  memory	
  per	
  node	
  
2	
  10GbE	
  Network	
  cards	
  
8	
  Storage	
  Controller	
  cards	
  
4	
  FS	
  4060-­‐L	
  enclosures	
  with	
  SAS	
  
zoning	
  enabled	
  
60	
  4TB	
  3.5”	
  SAS	
  or	
  SATA	
  drives	
  per	
  
enclosure	
  
Any	
  Hadoop	
  distribuRon	
  (CDH,	
  HDP,	
  
MapR,	
  Apache	
  Hadoop	
  etc.)	
  
ZooKeeper	
  for	
  NameNode	
  and	
  
JobTracker	
  HA	
  
SM15000	
  HADOOP	
  DEPLOYMENT	
  
SoluWon	
  Components	
  

SM15000	
  Intel	
  
Xeon	
  

SM15000	
  
AMD	
  Opteron	
  

Performance	
  
OpRmized	
  

Cost	
  OpRmized	
  

Racks	
  

<1	
  

<1	
  

Servers	
  

64	
  

64	
  

Cores	
  

256	
  

512	
  

DRAM	
  

2TB	
  

4	
  TB	
  

240/960	
  TB	
  

240/960	
  TB	
  

Cable	
  Management	
  

0	
  RU	
  

0	
  RU	
  

ToR	
  SwRches	
  

None	
  

None	
  

Downlink	
  Network	
  Cables	
  

None	
  

None	
  

60	
  

60	
  

10	
  Gb/s	
  

10	
  Gb/s	
  

Uplink	
  bandwidth	
  

Up	
  to	
  160	
  Gb/s	
  

Up	
  to	
  160	
  Gb/s	
  

Storage	
  bandwidth	
  

8	
  GigaBytes/s	
  

8	
  GigaBytes/s	
  

Use	
  case	
  

Hard	
  Drives	
  

Data	
  Nodes	
  
Bandwidth	
  per	
  node	
  

12	
   |	
  	
  	
  RIGHT	
  SIZING	
  HADOOP	
  DEPLOYMENTS	
  	
  
SM15000	
  –	
  INDUSTRY’S	
  ONLY	
  FLEXIBLE	
  SYSTEM	
  FOR	
  
OPTIMIZING	
  HADOOP	
  CLUSTERS	
  
Storage	
  
Intensive	
  

Compute	
  
Intensive	
  

Network	
  
Intensive	
  

Compute	
  
Intensive	
  

Storage	
  
Intensive	
  

Map	
  
Reduce	
  

Map	
  

Reduce	
  

Map	
  

HDFS	
  
Input	
  

Up	
  to	
  512	
  x86	
  
cores	
  with	
  4TB	
  
DRAM	
  per	
  Fabric	
  
Server	
  in	
  10RU	
  

13	
   |	
  	
  	
  RIGHT	
  SIZING	
  HADOOP	
  DEPLOYMENTS	
  	
  

Map	
  and	
  
Intermediate	
  
Data	
  Write	
  

Flexible	
  scale-­‐out	
  
storage	
  with	
  over	
  
1400	
  spindles	
  and	
  
5	
  Petabytes	
  of	
  
capacity	
  

Shuffle	
  

Reduce	
  

10	
  Gpbs	
  Inter-­‐
Node	
  Bandwidth	
  
per	
  server	
  	
  

HDFS	
  
Output	
  

160	
  Gbps	
  	
  shared	
  	
  
uplink	
  for	
  Inter-­‐
Rack	
  traffic	
  
SM15000	
  HADOOP	
  PERFORMANCE	
  BETTER	
  THAN	
  
COMPETITIVE	
  OFFERINGS	
  
!  77%	
  less	
  power	
  per	
  node	
  
!  30%	
  less	
  power	
  per	
  core	
  
!  63%	
  more	
  data	
  sorted	
  per	
  second	
  per	
  Wat	
  than	
  Large	
  Vendor	
  
Large	
  Vendor	
  

Terasort	
  
CompleRon	
  

7	
  min	
  
13	
  seconds	
  

8	
  min	
  
33	
  seconds	
  

Nodes	
  

62	
  incl.	
  HA	
  

18	
  

248	
  

216	
  

5800	
  W	
  

7200	
  W	
  

MB/s	
  per	
  Wat	
  

0.4	
  

0.24	
  

MB/s	
  per	
  CPU	
  
core	
  

9.3	
  

9.0	
  

Wats/Node	
  

94	
  W	
  

400	
  W	
  

Wats/Core	
  

23	
  W	
  

33	
  W	
  

SeaMicro	
  SM15000	
  with	
  64	
  Nodes	
  based	
  on	
  	
  
Intel	
  Xeon®	
  (Ivy	
  Bridge)	
  1265	
  L-­‐v2	
  CPU,	
  32	
  GB	
  
memory,	
  and	
  2	
  3.5”	
  3TB	
  SAS	
  drives	
  per	
  node	
  

CPU	
  Cores	
  
Power	
  

CompeRRve	
  soluRon	
  consists	
  of	
  dual	
  socket	
  2U	
  rack-­‐
mount	
  servers	
  with	
  Intel	
  Xeon	
  E5-­‐2667	
  2.9	
  GHz	
  octal	
  
core	
  CPUs	
  with	
  64	
  GB	
  memory,	
  16	
  disks,	
  and	
  4	
  GbE	
  
network	
  links	
  per	
  node	
  
14	
   |	
  	
  	
  RIGHT	
  SIZING	
  HADOOP	
  DEPLOYMENTS	
  	
  

Terasort	
  -­‐	
  Sort	
  rate	
  per	
  Wa`	
  

MB/s	
  per	
  Wa`	
  

SM15000	
  

0.4	
  
0.35	
  
0.3	
  
0.25	
  
0.2	
  
0.15	
  
0.1	
  
0.05	
  
0	
  
SM15000	
  

Large	
  Vendor	
  
WAYFAIR.COM:	
  PERSONALIZED	
  SHOPPING	
  EXPERIENCE	
  
FOR	
  “A	
  ZILLION	
  THINGS”	
  
Applica'ons:	
  Apache®	
  Hadoop™,	
  SQL	
  server,	
  
PHP	
  
!  Challenge	
  
‒  Space	
  and	
  power	
  constraints	
  hindered	
  availability	
  of	
  “shared	
  
nothing”	
  servers	
  for	
  development	
  
‒  Too	
  costly	
  in	
  space	
  and	
  power	
  to	
  use	
  tradiRonal	
  servers	
  for	
  the	
  
number	
  of	
  servers	
  required	
  and	
  accurately	
  test	
  applicaRon	
  
performance	
  

!  SoluRon	
  
‒  SeaMicro	
  SM	
  high	
  density	
  server	
  	
  
‒  256	
  Intel®	
  Xeon®	
  cores	
  in	
  10	
  RU	
  system	
  
‒  64	
  servers,	
  1.28	
  Tbps	
  SeaMicro	
  Freedom™	
  Supercompute	
  Fabric	
  

!  Results	
  
‒  Reduced	
  development	
  cycles	
  and	
  shortened	
  Rme	
  to	
  market	
  for	
  
new	
  products	
  
‒  Increased	
  producRvity	
  of	
  development	
  engineers	
  by	
  providing	
  
abundant	
  access	
  to	
  “shared	
  nothing”	
  servers	
  versus	
  developing	
  
on	
  virtualized	
  server	
  farms	
  
‒  Eliminated	
  unnecessary	
  equipment	
  such	
  as	
  top	
  of	
  rack	
  switches	
  
and	
  terminal	
  servers;	
  simplified	
  network	
  and	
  power	
  cabling	
  

15	
   |	
  	
  	
  RIGHT	
  SIZING	
  HADOOP	
  DEPLOYMENTS	
  	
  

“The	
  SeaMicro	
  SM	
  server	
  is	
  helping	
  us	
  operate	
  at	
  a	
  large	
  scale	
  
and	
  fast	
  pace.	
  	
  The	
  key	
  benefits	
  are	
  reduced	
  operaRng	
  costs	
  and	
  
increased	
  efficiency	
  for	
  our	
  big	
  data	
  development	
  infrastructure.	
  	
  
It	
  provides	
  the	
  highest	
  density	
  and	
  flexibility	
  while	
  slashing	
  
energy	
  consumpRon:	
  256	
  Intel	
  Xeon	
  cores,	
  64	
  hosts.	
  	
  It	
  
consumes	
  50	
  percent	
  less	
  power	
  and	
  doubled	
  our	
  compuRng	
  
capacity...”	
  
	
  
Ben	
  Clark,	
  Director	
  of	
  So@ware	
  Engineering	
  
EHARMONY:	
  INCREASE	
  COMPUTING	
  WHILE	
  	
  
REDUCING	
  TOTAL	
  COST	
  OF	
  OWNERSHIP	
  
Applica'ons:	
  Apache®	
  Hadoop™	
  
!  Challenge	
  
‒  Provide	
  cost	
  effecRve	
  compuRng	
  	
  
plaxorm	
  for	
  Apache	
  Hadoop	
  
‒  Reduce	
  costs	
  incurred	
  from	
  external	
  	
  
cloud	
  compuRng	
  

!  SoluRon	
  
‒  SeaMicro	
  SM10000-­‐64	
  high	
  density	
  server	
  	
  
‒  512	
  Intel®	
  Atom™	
  cores	
  in	
  10	
  RU	
  system	
  

!  Results	
  
‒  Reduce	
  TCO	
  by	
  more	
  than	
  74	
  percent	
  
‒  Save	
  thousands	
  per	
  month	
  spent	
  on	
  	
  
cloud	
  compuRng	
  service	
  
‒  URlize	
  compuRng	
  resources	
  7	
  x	
  24	
  

16	
   |	
  	
  	
  RIGHT	
  SIZING	
  HADOOP	
  DEPLOYMENTS	
  	
  

“We	
  purchased	
  SeaMicro	
  servers	
  and	
  immediately	
  
reduced	
  our	
  operaRng	
  expenses…The	
  system	
  has	
  
been	
  in	
  place	
  for	
  over	
  two	
  years,	
  and	
  we	
  have	
  had	
  
zero	
  down	
  Rme.”	
  
	
  
Cormac	
  Twomey,	
  Data	
  Center	
  Opera'ons	
  
AMD	
  SEAMICRO	
  PARTNERS	
  WITH	
  INDUSTRY	
  LEADERS	
  

Hadoop	
  

Partner

17	
   |	
  	
  	
  RIGHT	
  SIZING	
  HADOOP	
  DEPLOYMENTS	
  	
  

OS/Hypervisor	
  

OpenStack	
  
Discussion	
  
FS	
  4060-­‐L	
  SAS	
  ZONING	
  
!  SAS	
  zoning	
  allows	
  the	
  logical	
  parRRoning	
  of	
  FS	
  4060-­‐L	
  enclosure	
  into	
  two	
  30	
  
disk	
  enclosures	
  in	
  4U	
  
‒  Provides	
  fully	
  independent	
  	
  end	
  to	
  end	
  path	
  to	
  all	
  30	
  drives	
  in	
  each	
  zone	
  
‒  Up	
  to	
  2	
  S-­‐cards	
  connected	
  to	
  a	
  storage	
  enclosure	
  

SM15000	
  S-­‐card	
  

SM15000	
  S-­‐card	
  

Zone	
  1	
  

FS	
  4060-­‐L	
  

Zone	
  2	
  
FS	
  4060-­‐L	
  with	
  SAS	
  zoning	
  

19	
   |	
  	
  	
  RIGHT	
  SIZING	
  HADOOP	
  DEPLOYMENTS	
  	
  
DISCLAIMER	
  &	
  ATTRIBUTION	
  
The	
  informaRon	
  presented	
  in	
  this	
  document	
  is	
  for	
  informaRonal	
  purposes	
  only	
  and	
  may	
  contain	
  technical	
  inaccuracies,	
  omissions	
  and	
  
typographical	
  errors.	
  
	
  
The	
  informaRon	
  contained	
  herein	
  is	
  subject	
  to	
  change	
  and	
  may	
  be	
  rendered	
  inaccurate	
  for	
  many	
  reasons,	
  including	
  but	
  not	
  limited	
  to	
  
product	
  and	
  roadmap	
  changes,	
  component	
  and	
  motherboard	
  version	
  changes,	
  new	
  model	
  and/or	
  product	
  releases,	
  product	
  differences	
  
between	
  differing	
  manufacturers,	
  so{ware	
  changes,	
  BIOS	
  flashes,	
  firmware	
  upgrades,	
  or	
  the	
  like.	
  AMD	
  assumes	
  no	
  obligaRon	
  to	
  update	
  or	
  
otherwise	
  correct	
  or	
  revise	
  this	
  informaRon.	
  However,	
  AMD	
  reserves	
  the	
  right	
  to	
  revise	
  this	
  informaRon	
  and	
  to	
  make	
  changes	
  from	
  Rme	
  to	
  
Rme	
  to	
  the	
  content	
  hereof	
  without	
  obligaRon	
  of	
  AMD	
  to	
  noRfy	
  any	
  person	
  of	
  such	
  revisions	
  or	
  changes.	
  
	
  
AMD	
  MAKES	
  NO	
  REPRESENTATIONS	
  OR	
  WARRANTIES	
  WITH	
  RESPECT	
  TO	
  THE	
  CONTENTS	
  HEREOF	
  AND	
  ASSUMES	
  NO	
  RESPONSIBILITY	
  FOR	
  
ANY	
  INACCURACIES,	
  ERRORS	
  OR	
  OMISSIONS	
  THAT	
  MAY	
  APPEAR	
  IN	
  THIS	
  INFORMATION.	
  
	
  
AMD	
  SPECIFICALLY	
  DISCLAIMS	
  ANY	
  IMPLIED	
  WARRANTIES	
  OF	
  MERCHANTABILITY	
  OR	
  FITNESS	
  FOR	
  ANY	
  PARTICULAR	
  PURPOSE.	
  IN	
  NO	
  
EVENT	
  WILL	
  AMD	
  BE	
  LIABLE	
  TO	
  ANY	
  PERSON	
  FOR	
  ANY	
  DIRECT,	
  INDIRECT,	
  SPECIAL	
  OR	
  OTHER	
  CONSEQUENTIAL	
  DAMAGES	
  ARISING	
  FROM	
  
THE	
  USE	
  OF	
  ANY	
  INFORMATION	
  CONTAINED	
  HEREIN,	
  EVEN	
  IF	
  AMD	
  IS	
  EXPRESSLY	
  ADVISED	
  OF	
  THE	
  POSSIBILITY	
  OF	
  SUCH	
  DAMAGES.	
  
	
  
ATTRIBUTION	
  
©	
  2013	
  Advanced	
  Micro	
  Devices,	
  Inc.	
  All	
  rights	
  reserved.	
  AMD,	
  the	
  AMD	
  Arrow	
  logo,	
  AMD	
  Opteron,	
  Freedom	
  and	
  combinaRons	
  thereof	
  
are	
  trademarks	
  of	
  Advanced	
  Micro	
  Devices,	
  Inc.	
  in	
  the	
  United	
  States	
  and/or	
  other	
  jurisdicRons..	
  Other	
  names	
  are	
  for	
  informaRonal	
  
purposes	
  only	
  and	
  may	
  be	
  trademarks	
  of	
  their	
  respecRve	
  owners.	
  

20	
   |	
  	
  	
  RIGHT	
  SIZING	
  HADOOP	
  DEPLOYMENTS	
  	
  

Más contenido relacionado

La actualidad más candente

CC-4001, Aparapi and HSA: Easing the developer path to APU/GPU accelerated Ja...
CC-4001, Aparapi and HSA: Easing the developer path to APU/GPU accelerated Ja...CC-4001, Aparapi and HSA: Easing the developer path to APU/GPU accelerated Ja...
CC-4001, Aparapi and HSA: Easing the developer path to APU/GPU accelerated Ja...AMD Developer Central
 
CE-4030, Optimizing Photo Editing Application with HSA Technology, by Stanley...
CE-4030, Optimizing Photo Editing Application with HSA Technology, by Stanley...CE-4030, Optimizing Photo Editing Application with HSA Technology, by Stanley...
CE-4030, Optimizing Photo Editing Application with HSA Technology, by Stanley...AMD Developer Central
 
CC-4006, Deliver Hardware Accelerated Applications Using RemoteFX vGPU with W...
CC-4006, Deliver Hardware Accelerated Applications Using RemoteFX vGPU with W...CC-4006, Deliver Hardware Accelerated Applications Using RemoteFX vGPU with W...
CC-4006, Deliver Hardware Accelerated Applications Using RemoteFX vGPU with W...AMD Developer Central
 
MM-4097, OpenCV-CL, by Harris Gasparakis, Vadim Pisarevsky and Andrey Pavlenko
MM-4097, OpenCV-CL, by Harris Gasparakis, Vadim Pisarevsky and Andrey PavlenkoMM-4097, OpenCV-CL, by Harris Gasparakis, Vadim Pisarevsky and Andrey Pavlenko
MM-4097, OpenCV-CL, by Harris Gasparakis, Vadim Pisarevsky and Andrey PavlenkoAMD Developer Central
 
MM-4092, Optimizing FFMPEG and Handbrake Using OpenCL and Other AMD HW Capabi...
MM-4092, Optimizing FFMPEG and Handbrake Using OpenCL and Other AMD HW Capabi...MM-4092, Optimizing FFMPEG and Handbrake Using OpenCL and Other AMD HW Capabi...
MM-4092, Optimizing FFMPEG and Handbrake Using OpenCL and Other AMD HW Capabi...AMD Developer Central
 
PG-4039, RapidFire API, by Dmitry Kozlov
PG-4039, RapidFire API, by Dmitry KozlovPG-4039, RapidFire API, by Dmitry Kozlov
PG-4039, RapidFire API, by Dmitry KozlovAMD Developer Central
 
Open compute technology
Open compute technologyOpen compute technology
Open compute technologyAMD
 
GS-4136, Optimizing Game Development using AMD’s GPU PerfStudio 2, by Gordon ...
GS-4136, Optimizing Game Development using AMD’s GPU PerfStudio 2, by Gordon ...GS-4136, Optimizing Game Development using AMD’s GPU PerfStudio 2, by Gordon ...
GS-4136, Optimizing Game Development using AMD’s GPU PerfStudio 2, by Gordon ...AMD Developer Central
 
PT-4052, Introduction to AMD Developer Tools, by Yaki Tebeka and Gordon Selley
PT-4052, Introduction to AMD Developer Tools, by Yaki Tebeka and Gordon SelleyPT-4052, Introduction to AMD Developer Tools, by Yaki Tebeka and Gordon Selley
PT-4052, Introduction to AMD Developer Tools, by Yaki Tebeka and Gordon SelleyAMD Developer Central
 
AMD and the new “Zen” High Performance x86 Core at Hot Chips 28
AMD and the new “Zen” High Performance x86 Core at Hot Chips 28AMD and the new “Zen” High Performance x86 Core at Hot Chips 28
AMD and the new “Zen” High Performance x86 Core at Hot Chips 28AMD
 
Delivering a new level of visual performance in an SoC AMD "Raven Ridge" APU
Delivering a new level of visual performance in an SoC AMD "Raven Ridge" APUDelivering a new level of visual performance in an SoC AMD "Raven Ridge" APU
Delivering a new level of visual performance in an SoC AMD "Raven Ridge" APUAMD
 
PL-4043, Accelerating OpenVL for Heterogeneous Platforms, by Gregor Miller
PL-4043, Accelerating OpenVL for Heterogeneous Platforms, by Gregor MillerPL-4043, Accelerating OpenVL for Heterogeneous Platforms, by Gregor Miller
PL-4043, Accelerating OpenVL for Heterogeneous Platforms, by Gregor MillerAMD Developer Central
 
ISCA 2014 | Heterogeneous System Architecture (HSA): Architecture and Algorit...
ISCA 2014 | Heterogeneous System Architecture (HSA): Architecture and Algorit...ISCA 2014 | Heterogeneous System Architecture (HSA): Architecture and Algorit...
ISCA 2014 | Heterogeneous System Architecture (HSA): Architecture and Algorit...HSA Foundation
 
WT-4066, The Making of Turbulenz’ Polycraft WebGL Benchmark, by Ian Ballantyne
WT-4066, The Making of Turbulenz’ Polycraft WebGL Benchmark, by Ian BallantyneWT-4066, The Making of Turbulenz’ Polycraft WebGL Benchmark, by Ian Ballantyne
WT-4066, The Making of Turbulenz’ Polycraft WebGL Benchmark, by Ian BallantyneAMD Developer Central
 
GS-4106 The AMD GCN Architecture - A Crash Course, by Layla Mah
GS-4106 The AMD GCN Architecture - A Crash Course, by Layla MahGS-4106 The AMD GCN Architecture - A Crash Course, by Layla Mah
GS-4106 The AMD GCN Architecture - A Crash Course, by Layla MahAMD Developer Central
 
AMD EPYC 7002 World Records
AMD EPYC 7002 World RecordsAMD EPYC 7002 World Records
AMD EPYC 7002 World RecordsAMD
 
AMD 2014 A Series and Performance Mobile Accelerated Processing Units (Codena...
AMD 2014 A Series and Performance Mobile Accelerated Processing Units (Codena...AMD 2014 A Series and Performance Mobile Accelerated Processing Units (Codena...
AMD 2014 A Series and Performance Mobile Accelerated Processing Units (Codena...AMD
 
GS-4139, RapidFire for Cloud Gaming, by Dmitry Kozlov
GS-4139, RapidFire for Cloud Gaming, by Dmitry KozlovGS-4139, RapidFire for Cloud Gaming, by Dmitry Kozlov
GS-4139, RapidFire for Cloud Gaming, by Dmitry KozlovAMD Developer Central
 

La actualidad más candente (20)

CC-4001, Aparapi and HSA: Easing the developer path to APU/GPU accelerated Ja...
CC-4001, Aparapi and HSA: Easing the developer path to APU/GPU accelerated Ja...CC-4001, Aparapi and HSA: Easing the developer path to APU/GPU accelerated Ja...
CC-4001, Aparapi and HSA: Easing the developer path to APU/GPU accelerated Ja...
 
CE-4030, Optimizing Photo Editing Application with HSA Technology, by Stanley...
CE-4030, Optimizing Photo Editing Application with HSA Technology, by Stanley...CE-4030, Optimizing Photo Editing Application with HSA Technology, by Stanley...
CE-4030, Optimizing Photo Editing Application with HSA Technology, by Stanley...
 
CC-4006, Deliver Hardware Accelerated Applications Using RemoteFX vGPU with W...
CC-4006, Deliver Hardware Accelerated Applications Using RemoteFX vGPU with W...CC-4006, Deliver Hardware Accelerated Applications Using RemoteFX vGPU with W...
CC-4006, Deliver Hardware Accelerated Applications Using RemoteFX vGPU with W...
 
MM-4097, OpenCV-CL, by Harris Gasparakis, Vadim Pisarevsky and Andrey Pavlenko
MM-4097, OpenCV-CL, by Harris Gasparakis, Vadim Pisarevsky and Andrey PavlenkoMM-4097, OpenCV-CL, by Harris Gasparakis, Vadim Pisarevsky and Andrey Pavlenko
MM-4097, OpenCV-CL, by Harris Gasparakis, Vadim Pisarevsky and Andrey Pavlenko
 
MM-4092, Optimizing FFMPEG and Handbrake Using OpenCL and Other AMD HW Capabi...
MM-4092, Optimizing FFMPEG and Handbrake Using OpenCL and Other AMD HW Capabi...MM-4092, Optimizing FFMPEG and Handbrake Using OpenCL and Other AMD HW Capabi...
MM-4092, Optimizing FFMPEG and Handbrake Using OpenCL and Other AMD HW Capabi...
 
PG-4039, RapidFire API, by Dmitry Kozlov
PG-4039, RapidFire API, by Dmitry KozlovPG-4039, RapidFire API, by Dmitry Kozlov
PG-4039, RapidFire API, by Dmitry Kozlov
 
Open compute technology
Open compute technologyOpen compute technology
Open compute technology
 
GS-4136, Optimizing Game Development using AMD’s GPU PerfStudio 2, by Gordon ...
GS-4136, Optimizing Game Development using AMD’s GPU PerfStudio 2, by Gordon ...GS-4136, Optimizing Game Development using AMD’s GPU PerfStudio 2, by Gordon ...
GS-4136, Optimizing Game Development using AMD’s GPU PerfStudio 2, by Gordon ...
 
PT-4052, Introduction to AMD Developer Tools, by Yaki Tebeka and Gordon Selley
PT-4052, Introduction to AMD Developer Tools, by Yaki Tebeka and Gordon SelleyPT-4052, Introduction to AMD Developer Tools, by Yaki Tebeka and Gordon Selley
PT-4052, Introduction to AMD Developer Tools, by Yaki Tebeka and Gordon Selley
 
AMD and the new “Zen” High Performance x86 Core at Hot Chips 28
AMD and the new “Zen” High Performance x86 Core at Hot Chips 28AMD and the new “Zen” High Performance x86 Core at Hot Chips 28
AMD and the new “Zen” High Performance x86 Core at Hot Chips 28
 
Delivering a new level of visual performance in an SoC AMD "Raven Ridge" APU
Delivering a new level of visual performance in an SoC AMD "Raven Ridge" APUDelivering a new level of visual performance in an SoC AMD "Raven Ridge" APU
Delivering a new level of visual performance in an SoC AMD "Raven Ridge" APU
 
PL-4043, Accelerating OpenVL for Heterogeneous Platforms, by Gregor Miller
PL-4043, Accelerating OpenVL for Heterogeneous Platforms, by Gregor MillerPL-4043, Accelerating OpenVL for Heterogeneous Platforms, by Gregor Miller
PL-4043, Accelerating OpenVL for Heterogeneous Platforms, by Gregor Miller
 
ISCA 2014 | Heterogeneous System Architecture (HSA): Architecture and Algorit...
ISCA 2014 | Heterogeneous System Architecture (HSA): Architecture and Algorit...ISCA 2014 | Heterogeneous System Architecture (HSA): Architecture and Algorit...
ISCA 2014 | Heterogeneous System Architecture (HSA): Architecture and Algorit...
 
WT-4066, The Making of Turbulenz’ Polycraft WebGL Benchmark, by Ian Ballantyne
WT-4066, The Making of Turbulenz’ Polycraft WebGL Benchmark, by Ian BallantyneWT-4066, The Making of Turbulenz’ Polycraft WebGL Benchmark, by Ian Ballantyne
WT-4066, The Making of Turbulenz’ Polycraft WebGL Benchmark, by Ian Ballantyne
 
GS-4106 The AMD GCN Architecture - A Crash Course, by Layla Mah
GS-4106 The AMD GCN Architecture - A Crash Course, by Layla MahGS-4106 The AMD GCN Architecture - A Crash Course, by Layla Mah
GS-4106 The AMD GCN Architecture - A Crash Course, by Layla Mah
 
AMD EPYC 7002 World Records
AMD EPYC 7002 World RecordsAMD EPYC 7002 World Records
AMD EPYC 7002 World Records
 
AMD 2014 A Series and Performance Mobile Accelerated Processing Units (Codena...
AMD 2014 A Series and Performance Mobile Accelerated Processing Units (Codena...AMD 2014 A Series and Performance Mobile Accelerated Processing Units (Codena...
AMD 2014 A Series and Performance Mobile Accelerated Processing Units (Codena...
 
AMD It's Time to ROC
AMD It's Time to ROCAMD It's Time to ROC
AMD It's Time to ROC
 
GS-4139, RapidFire for Cloud Gaming, by Dmitry Kozlov
GS-4139, RapidFire for Cloud Gaming, by Dmitry KozlovGS-4139, RapidFire for Cloud Gaming, by Dmitry Kozlov
GS-4139, RapidFire for Cloud Gaming, by Dmitry Kozlov
 
HSA Introduction
HSA IntroductionHSA Introduction
HSA Introduction
 

Similar a CC-4009, "Optimizing Hadoop Deployments with SeaMicro SM15000" by Satheesh Nanniyur and Anil Rao

S016827 pendulum-swings-nola-v1710d
S016827 pendulum-swings-nola-v1710dS016827 pendulum-swings-nola-v1710d
S016827 pendulum-swings-nola-v1710dTony Pearson
 
Red Hat Storage Day Seattle: Supermicro Solutions for Red Hat Ceph and Red Ha...
Red Hat Storage Day Seattle: Supermicro Solutions for Red Hat Ceph and Red Ha...Red Hat Storage Day Seattle: Supermicro Solutions for Red Hat Ceph and Red Ha...
Red Hat Storage Day Seattle: Supermicro Solutions for Red Hat Ceph and Red Ha...Red_Hat_Storage
 
3PAR and VMWare
3PAR and VMWare3PAR and VMWare
3PAR and VMWarevmug
 
IBM Power9 Features and Specifications
IBM Power9 Features and SpecificationsIBM Power9 Features and Specifications
IBM Power9 Features and Specificationsinside-BigData.com
 
JetStor NAS ZFS based 716U 724U Network Attached Storage
JetStor NAS ZFS based 716U 724U Network Attached StorageJetStor NAS ZFS based 716U 724U Network Attached Storage
JetStor NAS ZFS based 716U 724U Network Attached StorageGene Leyzarovich
 
PAS 8 Datasheet
PAS 8 DatasheetPAS 8 Datasheet
PAS 8 DatasheetPanasas
 
HP Storage: Delivering Storage without Boundaries
HP Storage: Delivering Storage without BoundariesHP Storage: Delivering Storage without Boundaries
HP Storage: Delivering Storage without Boundariesjameshub12
 
Panasas ActiveStor 11 and 12: Parallel NAS Appliance for HPC Workloads
Panasas ActiveStor 11 and 12: Parallel NAS Appliance for HPC WorkloadsPanasas ActiveStor 11 and 12: Parallel NAS Appliance for HPC Workloads
Panasas ActiveStor 11 and 12: Parallel NAS Appliance for HPC WorkloadsPanasas
 
Panasas ActiveStor Parallel Storage
Panasas ActiveStor Parallel StoragePanasas ActiveStor Parallel Storage
Panasas ActiveStor Parallel StoragePanasas
 
DDN: Massively-Scalable Platforms and Solutions Engineered for the Big Data a...
DDN: Massively-Scalable Platforms and Solutions Engineered for the Big Data a...DDN: Massively-Scalable Platforms and Solutions Engineered for the Big Data a...
DDN: Massively-Scalable Platforms and Solutions Engineered for the Big Data a...inside-BigData.com
 
JetStor 780JH/JHD JBOD CLOUD BIG DATA HADOOP
JetStor 780JH/JHD JBOD CLOUD BIG DATA HADOOPJetStor 780JH/JHD JBOD CLOUD BIG DATA HADOOP
JetStor 780JH/JHD JBOD CLOUD BIG DATA HADOOPGene Leyzarovich
 
Optimizing the Upstreaming Workflow: Flexibly Scale Storage for Seismic Proce...
Optimizing the Upstreaming Workflow: Flexibly Scale Storage for Seismic Proce...Optimizing the Upstreaming Workflow: Flexibly Scale Storage for Seismic Proce...
Optimizing the Upstreaming Workflow: Flexibly Scale Storage for Seismic Proce...Avere Systems
 
Storage and performance, Whiptail
Storage and performance, Whiptail Storage and performance, Whiptail
Storage and performance, Whiptail Internet World
 
JetStor high density raid series 42bay 64bay units
JetStor high density raid series 42bay 64bay unitsJetStor high density raid series 42bay 64bay units
JetStor high density raid series 42bay 64bay unitsGene Leyzarovich
 

Similar a CC-4009, "Optimizing Hadoop Deployments with SeaMicro SM15000" by Satheesh Nanniyur and Anil Rao (20)

S016827 pendulum-swings-nola-v1710d
S016827 pendulum-swings-nola-v1710dS016827 pendulum-swings-nola-v1710d
S016827 pendulum-swings-nola-v1710d
 
Red Hat Storage Day Seattle: Supermicro Solutions for Red Hat Ceph and Red Ha...
Red Hat Storage Day Seattle: Supermicro Solutions for Red Hat Ceph and Red Ha...Red Hat Storage Day Seattle: Supermicro Solutions for Red Hat Ceph and Red Ha...
Red Hat Storage Day Seattle: Supermicro Solutions for Red Hat Ceph and Red Ha...
 
3PAR and VMWare
3PAR and VMWare3PAR and VMWare
3PAR and VMWare
 
IBM Power9 Features and Specifications
IBM Power9 Features and SpecificationsIBM Power9 Features and Specifications
IBM Power9 Features and Specifications
 
JetStor NAS ZFS based 716U 724U Network Attached Storage
JetStor NAS ZFS based 716U 724U Network Attached StorageJetStor NAS ZFS based 716U 724U Network Attached Storage
JetStor NAS ZFS based 716U 724U Network Attached Storage
 
@IBM Power roadmap 8
@IBM Power roadmap 8 @IBM Power roadmap 8
@IBM Power roadmap 8
 
Qnap NAS TS Serie x53u-catalogo
Qnap NAS TS Serie x53u-catalogoQnap NAS TS Serie x53u-catalogo
Qnap NAS TS Serie x53u-catalogo
 
PAS 8 Datasheet
PAS 8 DatasheetPAS 8 Datasheet
PAS 8 Datasheet
 
HP Storage: Delivering Storage without Boundaries
HP Storage: Delivering Storage without BoundariesHP Storage: Delivering Storage without Boundaries
HP Storage: Delivering Storage without Boundaries
 
Panasas ActiveStor 11 and 12: Parallel NAS Appliance for HPC Workloads
Panasas ActiveStor 11 and 12: Parallel NAS Appliance for HPC WorkloadsPanasas ActiveStor 11 and 12: Parallel NAS Appliance for HPC Workloads
Panasas ActiveStor 11 and 12: Parallel NAS Appliance for HPC Workloads
 
Panasas ActiveStor Parallel Storage
Panasas ActiveStor Parallel StoragePanasas ActiveStor Parallel Storage
Panasas ActiveStor Parallel Storage
 
DDN: Massively-Scalable Platforms and Solutions Engineered for the Big Data a...
DDN: Massively-Scalable Platforms and Solutions Engineered for the Big Data a...DDN: Massively-Scalable Platforms and Solutions Engineered for the Big Data a...
DDN: Massively-Scalable Platforms and Solutions Engineered for the Big Data a...
 
San Presentation
San PresentationSan Presentation
San Presentation
 
JetStor 780JH/JHD JBOD CLOUD BIG DATA HADOOP
JetStor 780JH/JHD JBOD CLOUD BIG DATA HADOOPJetStor 780JH/JHD JBOD CLOUD BIG DATA HADOOP
JetStor 780JH/JHD JBOD CLOUD BIG DATA HADOOP
 
Optimizing the Upstreaming Workflow: Flexibly Scale Storage for Seismic Proce...
Optimizing the Upstreaming Workflow: Flexibly Scale Storage for Seismic Proce...Optimizing the Upstreaming Workflow: Flexibly Scale Storage for Seismic Proce...
Optimizing the Upstreaming Workflow: Flexibly Scale Storage for Seismic Proce...
 
Storage and performance, Whiptail
Storage and performance, Whiptail Storage and performance, Whiptail
Storage and performance, Whiptail
 
IBM System Storage DCS9900 Data Sheet
IBM System Storage DCS9900 Data SheetIBM System Storage DCS9900 Data Sheet
IBM System Storage DCS9900 Data Sheet
 
JetStor high density raid series 42bay 64bay units
JetStor high density raid series 42bay 64bay unitsJetStor high density raid series 42bay 64bay units
JetStor high density raid series 42bay 64bay units
 
V L S
V L SV L S
V L S
 
Qnap event v1.6
Qnap   event v1.6Qnap   event v1.6
Qnap event v1.6
 

Más de AMD Developer Central

DX12 & Vulkan: Dawn of a New Generation of Graphics APIs
DX12 & Vulkan: Dawn of a New Generation of Graphics APIsDX12 & Vulkan: Dawn of a New Generation of Graphics APIs
DX12 & Vulkan: Dawn of a New Generation of Graphics APIsAMD Developer Central
 
Leverage the Speed of OpenCL™ with AMD Math Libraries
Leverage the Speed of OpenCL™ with AMD Math LibrariesLeverage the Speed of OpenCL™ with AMD Math Libraries
Leverage the Speed of OpenCL™ with AMD Math LibrariesAMD Developer Central
 
An Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware Webinar
An Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware WebinarAn Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware Webinar
An Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware WebinarAMD Developer Central
 
Webinar: Whats New in Java 8 with Develop Intelligence
Webinar: Whats New in Java 8 with Develop IntelligenceWebinar: Whats New in Java 8 with Develop Intelligence
Webinar: Whats New in Java 8 with Develop IntelligenceAMD Developer Central
 
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...AMD Developer Central
 
TressFX The Fast and The Furry by Nicolas Thibieroz
TressFX The Fast and The Furry by Nicolas ThibierozTressFX The Fast and The Furry by Nicolas Thibieroz
TressFX The Fast and The Furry by Nicolas ThibierozAMD Developer Central
 
Rendering Battlefield 4 with Mantle by Yuriy ODonnell
Rendering Battlefield 4 with Mantle by Yuriy ODonnellRendering Battlefield 4 with Mantle by Yuriy ODonnell
Rendering Battlefield 4 with Mantle by Yuriy ODonnellAMD Developer Central
 
Low-level Shader Optimization for Next-Gen and DX11 by Emil Persson
Low-level Shader Optimization for Next-Gen and DX11 by Emil PerssonLow-level Shader Optimization for Next-Gen and DX11 by Emil Persson
Low-level Shader Optimization for Next-Gen and DX11 by Emil PerssonAMD Developer Central
 
Direct3D12 and the Future of Graphics APIs by Dave Oldcorn
Direct3D12 and the Future of Graphics APIs by Dave OldcornDirect3D12 and the Future of Graphics APIs by Dave Oldcorn
Direct3D12 and the Future of Graphics APIs by Dave OldcornAMD Developer Central
 
Introduction to Direct 3D 12 by Ivan Nevraev
Introduction to Direct 3D 12 by Ivan NevraevIntroduction to Direct 3D 12 by Ivan Nevraev
Introduction to Direct 3D 12 by Ivan NevraevAMD Developer Central
 
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth ThomasHoly smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth ThomasAMD Developer Central
 
Productive OpenCL Programming An Introduction to OpenCL Libraries with Array...
Productive OpenCL Programming An Introduction to OpenCL Libraries  with Array...Productive OpenCL Programming An Introduction to OpenCL Libraries  with Array...
Productive OpenCL Programming An Introduction to OpenCL Libraries with Array...AMD Developer Central
 
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14AMD Developer Central
 
RapidFire - the Easy Route to low Latency Cloud Gaming Solutions - AMD at GDC14
RapidFire - the Easy Route to low Latency Cloud Gaming Solutions - AMD at GDC14RapidFire - the Easy Route to low Latency Cloud Gaming Solutions - AMD at GDC14
RapidFire - the Easy Route to low Latency Cloud Gaming Solutions - AMD at GDC14AMD Developer Central
 

Más de AMD Developer Central (20)

DX12 & Vulkan: Dawn of a New Generation of Graphics APIs
DX12 & Vulkan: Dawn of a New Generation of Graphics APIsDX12 & Vulkan: Dawn of a New Generation of Graphics APIs
DX12 & Vulkan: Dawn of a New Generation of Graphics APIs
 
Leverage the Speed of OpenCL™ with AMD Math Libraries
Leverage the Speed of OpenCL™ with AMD Math LibrariesLeverage the Speed of OpenCL™ with AMD Math Libraries
Leverage the Speed of OpenCL™ with AMD Math Libraries
 
Introduction to Node.js
Introduction to Node.jsIntroduction to Node.js
Introduction to Node.js
 
Media SDK Webinar 2014
Media SDK Webinar 2014Media SDK Webinar 2014
Media SDK Webinar 2014
 
An Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware Webinar
An Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware WebinarAn Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware Webinar
An Introduction to OpenCL™ Programming with AMD GPUs - AMD & Acceleware Webinar
 
DirectGMA on AMD’S FirePro™ GPUS
DirectGMA on AMD’S  FirePro™ GPUSDirectGMA on AMD’S  FirePro™ GPUS
DirectGMA on AMD’S FirePro™ GPUS
 
Webinar: Whats New in Java 8 with Develop Intelligence
Webinar: Whats New in Java 8 with Develop IntelligenceWebinar: Whats New in Java 8 with Develop Intelligence
Webinar: Whats New in Java 8 with Develop Intelligence
 
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
The Small Batch (and other) solutions in Mantle API, by Guennadi Riguer, Mant...
 
Inside XBox- One, by Martin Fuller
Inside XBox- One, by Martin FullerInside XBox- One, by Martin Fuller
Inside XBox- One, by Martin Fuller
 
TressFX The Fast and The Furry by Nicolas Thibieroz
TressFX The Fast and The Furry by Nicolas ThibierozTressFX The Fast and The Furry by Nicolas Thibieroz
TressFX The Fast and The Furry by Nicolas Thibieroz
 
Rendering Battlefield 4 with Mantle by Yuriy ODonnell
Rendering Battlefield 4 with Mantle by Yuriy ODonnellRendering Battlefield 4 with Mantle by Yuriy ODonnell
Rendering Battlefield 4 with Mantle by Yuriy ODonnell
 
Low-level Shader Optimization for Next-Gen and DX11 by Emil Persson
Low-level Shader Optimization for Next-Gen and DX11 by Emil PerssonLow-level Shader Optimization for Next-Gen and DX11 by Emil Persson
Low-level Shader Optimization for Next-Gen and DX11 by Emil Persson
 
Gcn performance ftw by stephan hodes
Gcn performance ftw by stephan hodesGcn performance ftw by stephan hodes
Gcn performance ftw by stephan hodes
 
Inside XBOX ONE by Martin Fuller
Inside XBOX ONE by Martin FullerInside XBOX ONE by Martin Fuller
Inside XBOX ONE by Martin Fuller
 
Direct3D12 and the Future of Graphics APIs by Dave Oldcorn
Direct3D12 and the Future of Graphics APIs by Dave OldcornDirect3D12 and the Future of Graphics APIs by Dave Oldcorn
Direct3D12 and the Future of Graphics APIs by Dave Oldcorn
 
Introduction to Direct 3D 12 by Ivan Nevraev
Introduction to Direct 3D 12 by Ivan NevraevIntroduction to Direct 3D 12 by Ivan Nevraev
Introduction to Direct 3D 12 by Ivan Nevraev
 
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth ThomasHoly smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
Holy smoke! Faster Particle Rendering using Direct Compute by Gareth Thomas
 
Productive OpenCL Programming An Introduction to OpenCL Libraries with Array...
Productive OpenCL Programming An Introduction to OpenCL Libraries  with Array...Productive OpenCL Programming An Introduction to OpenCL Libraries  with Array...
Productive OpenCL Programming An Introduction to OpenCL Libraries with Array...
 
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
Rendering Battlefield 4 with Mantle by Johan Andersson - AMD at GDC14
 
RapidFire - the Easy Route to low Latency Cloud Gaming Solutions - AMD at GDC14
RapidFire - the Easy Route to low Latency Cloud Gaming Solutions - AMD at GDC14RapidFire - the Easy Route to low Latency Cloud Gaming Solutions - AMD at GDC14
RapidFire - the Easy Route to low Latency Cloud Gaming Solutions - AMD at GDC14
 

Último

Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkPixlogix Infotech
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesBernd Ruecker
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024TopCSSGallery
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observabilityitnewsafrica
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 

Último (20)

Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App Framework
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architectures
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 

CC-4009, "Optimizing Hadoop Deployments with SeaMicro SM15000" by Satheesh Nanniyur and Anil Rao

  • 1. OPTIMIZED  HADOOP  DEPLOYMENTS  WITH   SEAMICRO  SM15000   PRESENTED  AT  AMD  DEVELOPER  SUMMIT,  NOV  2013   SATHEESH  NANNIYUR  
  • 2. BIG  DATA  IS  A  STRATEGIC  DECISION:  CAPEX  AND  OPEX   Devices   2   |      RIGHT  SIZING  HADOOP  DEPLOYMENTS     Apps   Cloud  
  • 3. HIGHER  PERFORMANCE,  LESS  POWER  AND  SPACE   Hadoop  Technology  Stack   Data   Warehouse   Data  AnalyRcs   Management   Data  Access   Data  Processing   Data  Storage   3   |      RIGHT  SIZING  HADOOP  DEPLOYMENTS    
  • 4. SEAMICRO  SM15000™  ACCELERATES  APACHE™  HADOOP™   DEPLOYMENTS   !  Superior  high  availability   ‒  AcRve/standby  NameNode   ‒  AcRve/standby  JobTracker   ‒  Highly  resilient  fabric  for  inter-­‐node   east-­‐west  traffic   !  Reduced  down  Rme   ‒  Remap  or  rezone  disks  to  recover   data   ‒  Hot-­‐swappable  upgrades  or   component  replacements   !  Hardware  redundancy   ‒  Power  supplies   ‒  Network  I/O   4   |      RIGHT  SIZING  HADOOP  DEPLOYMENTS    
  • 5. SM15000  OVERVIEW   64  HDDs/SDDs   •  Share  drives  across  all  servers   •  Assign  one  server  to  one  or  more  drives  as  needed   •  In  service  upgrades  as  needed   64  Industry  standard  x86  servers   •  AMD  Opteron™,  Intel  Xeon®,  Atom™   •  Energy  efficient  processor   •  20  Gbps  per  socket,  16X  tradiRonal  servers   960  terabytes  Fabric  Storage   •  Extends  supercompute  fabric  to  external  storage   •  Up  to  3.84  PB  storage  capacity;  up  to  960  3.5”  SAS/ SATA  drives   •  Map  to  any  CPU—same  as  internal  drives   5   |      RIGHT  SIZING  HADOOP  DEPLOYMENTS     160  Gbps  Network  I/O   •  Share  network  I/O  across  all  servers   •  Eliminate  TOR  switch   •  Minimize  cabling   •  In  service  upgrades  as  needed  
  • 6. SEAMICRO  FREEDOM™  FABRIC  ASIC  PROVIDES  MASSIVE   PERFORMANCE,  REDUCES  POWER  AND  SPACE     B E N E F I T S   Freedom™   SeaMicro IOVT TIO Freedom Supercompute Fabric 6   |      RIGHT  SIZING  HADOOP  DEPLOYMENTS     Eliminates 90% of the components on a motherboard shrinking power used, cost and space Reduces the power used by any CPU by consolidating and shutting off unused functionality Provides massive bandwidth while eliminating power hungry top of rack switches
  • 7. FS  4060-­‐L  FABRIC  STORAGE  ENCLOSURE  WITH  ZONING   CAPABILITY   !  High  density,  power  opRmized  4U  enclosure  with  60  3.5”  drives   !  Up  to  16  enclosures  per  SM15000,  960  drives,  and  3.84  PB   storage  capacity   !  Redundant  controllers,  ports,  fans,  and  PSUs   !  Support  cost  opRmized  24x7  operaRons  SATA  HDD  for  high   density  Big  Data  and  Object  Storage  deployments   !  OpRonal  configuraRon  to  logically  parRRon  an  enclosure  into   two  30  3.5”  drive  enclosures   !  Balanced  disk  to  core  raRo  (1:1)  for  opRmizing  Hadoop   performance   !  Field  configurable  to  provide  utmost    flexibility  to  balance   density  and  performance   7   |      RIGHT  SIZING  HADOOP  DEPLOYMENTS    
  • 8. FREEDOM  FABRIC  DISAGGREGATES  SERVER  RESOURCES   PROVIDES  FLEXIBILITY  FOR  EXPANSION  AND  INFRASTRUCTURE  OPTIMIZATION   !  SM15000  provides  independent  scaling  of  Compute,  Storage,  and  Network   !  Centrally  managed  provisioning  of  storage  and  network  resources  to  compute   nodes  enabled  by  CLI  and  API  interfaces   SeaMicro  SM15000  Server   Hadoop  OpRmizaRon   Compute  and  Memory   Pool   CPU   CPU   CPU   CPU   Deploy   CPU   CPU   Fabric  Interconnect   Shared  Storage   Pool   8   |      RIGHT  SIZING  HADOOP  DEPLOYMENTS     Network   Pool   Cost  and   Performance   Tune  and   OpWmize   Run  and   Analyze  
  • 9. SM15000  FLEXIBLE  STORAGE  ALLOWS  ITERATIVELY   OPTIMIZING  APACHE®  HADOOP™  DEPLOYMENT   !  Flexible  shared  storage  with  commodity  hardware  enabled  by  SeaMicro  fabric   technology   !  Decoupled  from  Compute  and  Network  to  grow  storage  independently   !  IteraRvely  opRmize  Hadoop  disk  to  core  raRo  as  applicaRon  needs  evolve   Captive DAS with Rigid Storage to Compute Ratio Flexible scale-out Fabric Storage up to 5PB Freedom  Fabric   Traditional Rackmount 9   |      RIGHT  SIZING  HADOOP  DEPLOYMENTS     Intel  /AMD  X86   servers  
  • 10. APACHE®  HADOOP™  DEPLOYMENT  ON  THE  SEAMICRO   SM15000™   SM15000   ZooKeeper   HDFS   NameNode   ZooKeeper   MapReduce   JobTracker   MapReduce   JobTracker   Up  to  160  Gb/s   Redundant  NameNode  and   JobTracker   bandwidth  for  data   ingesWon   10   |      RIGHT  SIZING  HADOOP  DEPLOYMENTS     DataNode/   TaskTracker   DataNode/   TaskTracker   DataNode/   TaskTracker   DataNode/   TaskTracker   HDFS   NameNode   DataNode/   TaskTracker   DataNode/   TaskTracker   10Gb/s  network   bandwidth/node   DataNode/   TaskTracker   DataNode/   TaskTracker   60  DataNode/  TaskTracker   Nodes       •  8GB/s  storage  bandwidth   •  Flexible  storage  capacity  
  • 11. SEAMICRO  REFERENCE  ARCHITECTURE  FOR  APACHE®   HADOOP™   !  Hadoop  HA  with  NameNode  and  JobTracker  AcRve/Standby  ConfiguraRon   !  Up  to  60  DataNode/TaskTracker  nodes,  512  x86  cores,  960  TB  raw  capacity,  10   Gb/s  Internode  bandwidth  and  160  Gb/s  uplink  bandwidth  in  28  RU  and  5.8    kW   SoluWon  Components   •  •  •  •  •  •  Highly  Available  AcRve/Standby  NameNode   Highly  Available  AcRve/Standby  JobTracker   DataNode/TaskTracker   SM15000  Internal  Drives  for  NameNode  and  JobTracker   11   |      RIGHT  SIZING  HADOOP  DEPLOYMENTS       •  •  SeaMicro  SM15000  AMD  Opteron  or   Intel  Xeon  (Ivy  Bridge)  CPU  chassis   32  GB  memory  per  node   2  10GbE  Network  cards   8  Storage  Controller  cards   4  FS  4060-­‐L  enclosures  with  SAS   zoning  enabled   60  4TB  3.5”  SAS  or  SATA  drives  per   enclosure   Any  Hadoop  distribuRon  (CDH,  HDP,   MapR,  Apache  Hadoop  etc.)   ZooKeeper  for  NameNode  and   JobTracker  HA  
  • 12. SM15000  HADOOP  DEPLOYMENT   SoluWon  Components   SM15000  Intel   Xeon   SM15000   AMD  Opteron   Performance   OpRmized   Cost  OpRmized   Racks   <1   <1   Servers   64   64   Cores   256   512   DRAM   2TB   4  TB   240/960  TB   240/960  TB   Cable  Management   0  RU   0  RU   ToR  SwRches   None   None   Downlink  Network  Cables   None   None   60   60   10  Gb/s   10  Gb/s   Uplink  bandwidth   Up  to  160  Gb/s   Up  to  160  Gb/s   Storage  bandwidth   8  GigaBytes/s   8  GigaBytes/s   Use  case   Hard  Drives   Data  Nodes   Bandwidth  per  node   12   |      RIGHT  SIZING  HADOOP  DEPLOYMENTS    
  • 13. SM15000  –  INDUSTRY’S  ONLY  FLEXIBLE  SYSTEM  FOR   OPTIMIZING  HADOOP  CLUSTERS   Storage   Intensive   Compute   Intensive   Network   Intensive   Compute   Intensive   Storage   Intensive   Map   Reduce   Map   Reduce   Map   HDFS   Input   Up  to  512  x86   cores  with  4TB   DRAM  per  Fabric   Server  in  10RU   13   |      RIGHT  SIZING  HADOOP  DEPLOYMENTS     Map  and   Intermediate   Data  Write   Flexible  scale-­‐out   storage  with  over   1400  spindles  and   5  Petabytes  of   capacity   Shuffle   Reduce   10  Gpbs  Inter-­‐ Node  Bandwidth   per  server     HDFS   Output   160  Gbps    shared     uplink  for  Inter-­‐ Rack  traffic  
  • 14. SM15000  HADOOP  PERFORMANCE  BETTER  THAN   COMPETITIVE  OFFERINGS   !  77%  less  power  per  node   !  30%  less  power  per  core   !  63%  more  data  sorted  per  second  per  Wat  than  Large  Vendor   Large  Vendor   Terasort   CompleRon   7  min   13  seconds   8  min   33  seconds   Nodes   62  incl.  HA   18   248   216   5800  W   7200  W   MB/s  per  Wat   0.4   0.24   MB/s  per  CPU   core   9.3   9.0   Wats/Node   94  W   400  W   Wats/Core   23  W   33  W   SeaMicro  SM15000  with  64  Nodes  based  on     Intel  Xeon®  (Ivy  Bridge)  1265  L-­‐v2  CPU,  32  GB   memory,  and  2  3.5”  3TB  SAS  drives  per  node   CPU  Cores   Power   CompeRRve  soluRon  consists  of  dual  socket  2U  rack-­‐ mount  servers  with  Intel  Xeon  E5-­‐2667  2.9  GHz  octal   core  CPUs  with  64  GB  memory,  16  disks,  and  4  GbE   network  links  per  node   14   |      RIGHT  SIZING  HADOOP  DEPLOYMENTS     Terasort  -­‐  Sort  rate  per  Wa`   MB/s  per  Wa`   SM15000   0.4   0.35   0.3   0.25   0.2   0.15   0.1   0.05   0   SM15000   Large  Vendor  
  • 15. WAYFAIR.COM:  PERSONALIZED  SHOPPING  EXPERIENCE   FOR  “A  ZILLION  THINGS”   Applica'ons:  Apache®  Hadoop™,  SQL  server,   PHP   !  Challenge   ‒  Space  and  power  constraints  hindered  availability  of  “shared   nothing”  servers  for  development   ‒  Too  costly  in  space  and  power  to  use  tradiRonal  servers  for  the   number  of  servers  required  and  accurately  test  applicaRon   performance   !  SoluRon   ‒  SeaMicro  SM  high  density  server     ‒  256  Intel®  Xeon®  cores  in  10  RU  system   ‒  64  servers,  1.28  Tbps  SeaMicro  Freedom™  Supercompute  Fabric   !  Results   ‒  Reduced  development  cycles  and  shortened  Rme  to  market  for   new  products   ‒  Increased  producRvity  of  development  engineers  by  providing   abundant  access  to  “shared  nothing”  servers  versus  developing   on  virtualized  server  farms   ‒  Eliminated  unnecessary  equipment  such  as  top  of  rack  switches   and  terminal  servers;  simplified  network  and  power  cabling   15   |      RIGHT  SIZING  HADOOP  DEPLOYMENTS     “The  SeaMicro  SM  server  is  helping  us  operate  at  a  large  scale   and  fast  pace.    The  key  benefits  are  reduced  operaRng  costs  and   increased  efficiency  for  our  big  data  development  infrastructure.     It  provides  the  highest  density  and  flexibility  while  slashing   energy  consumpRon:  256  Intel  Xeon  cores,  64  hosts.    It   consumes  50  percent  less  power  and  doubled  our  compuRng   capacity...”     Ben  Clark,  Director  of  So@ware  Engineering  
  • 16. EHARMONY:  INCREASE  COMPUTING  WHILE     REDUCING  TOTAL  COST  OF  OWNERSHIP   Applica'ons:  Apache®  Hadoop™   !  Challenge   ‒  Provide  cost  effecRve  compuRng     plaxorm  for  Apache  Hadoop   ‒  Reduce  costs  incurred  from  external     cloud  compuRng   !  SoluRon   ‒  SeaMicro  SM10000-­‐64  high  density  server     ‒  512  Intel®  Atom™  cores  in  10  RU  system   !  Results   ‒  Reduce  TCO  by  more  than  74  percent   ‒  Save  thousands  per  month  spent  on     cloud  compuRng  service   ‒  URlize  compuRng  resources  7  x  24   16   |      RIGHT  SIZING  HADOOP  DEPLOYMENTS     “We  purchased  SeaMicro  servers  and  immediately   reduced  our  operaRng  expenses…The  system  has   been  in  place  for  over  two  years,  and  we  have  had   zero  down  Rme.”     Cormac  Twomey,  Data  Center  Opera'ons  
  • 17. AMD  SEAMICRO  PARTNERS  WITH  INDUSTRY  LEADERS   Hadoop   Partner 17   |      RIGHT  SIZING  HADOOP  DEPLOYMENTS     OS/Hypervisor   OpenStack  
  • 19. FS  4060-­‐L  SAS  ZONING   !  SAS  zoning  allows  the  logical  parRRoning  of  FS  4060-­‐L  enclosure  into  two  30   disk  enclosures  in  4U   ‒  Provides  fully  independent    end  to  end  path  to  all  30  drives  in  each  zone   ‒  Up  to  2  S-­‐cards  connected  to  a  storage  enclosure   SM15000  S-­‐card   SM15000  S-­‐card   Zone  1   FS  4060-­‐L   Zone  2   FS  4060-­‐L  with  SAS  zoning   19   |      RIGHT  SIZING  HADOOP  DEPLOYMENTS    
  • 20. DISCLAIMER  &  ATTRIBUTION   The  informaRon  presented  in  this  document  is  for  informaRonal  purposes  only  and  may  contain  technical  inaccuracies,  omissions  and   typographical  errors.     The  informaRon  contained  herein  is  subject  to  change  and  may  be  rendered  inaccurate  for  many  reasons,  including  but  not  limited  to   product  and  roadmap  changes,  component  and  motherboard  version  changes,  new  model  and/or  product  releases,  product  differences   between  differing  manufacturers,  so{ware  changes,  BIOS  flashes,  firmware  upgrades,  or  the  like.  AMD  assumes  no  obligaRon  to  update  or   otherwise  correct  or  revise  this  informaRon.  However,  AMD  reserves  the  right  to  revise  this  informaRon  and  to  make  changes  from  Rme  to   Rme  to  the  content  hereof  without  obligaRon  of  AMD  to  noRfy  any  person  of  such  revisions  or  changes.     AMD  MAKES  NO  REPRESENTATIONS  OR  WARRANTIES  WITH  RESPECT  TO  THE  CONTENTS  HEREOF  AND  ASSUMES  NO  RESPONSIBILITY  FOR   ANY  INACCURACIES,  ERRORS  OR  OMISSIONS  THAT  MAY  APPEAR  IN  THIS  INFORMATION.     AMD  SPECIFICALLY  DISCLAIMS  ANY  IMPLIED  WARRANTIES  OF  MERCHANTABILITY  OR  FITNESS  FOR  ANY  PARTICULAR  PURPOSE.  IN  NO   EVENT  WILL  AMD  BE  LIABLE  TO  ANY  PERSON  FOR  ANY  DIRECT,  INDIRECT,  SPECIAL  OR  OTHER  CONSEQUENTIAL  DAMAGES  ARISING  FROM   THE  USE  OF  ANY  INFORMATION  CONTAINED  HEREIN,  EVEN  IF  AMD  IS  EXPRESSLY  ADVISED  OF  THE  POSSIBILITY  OF  SUCH  DAMAGES.     ATTRIBUTION   ©  2013  Advanced  Micro  Devices,  Inc.  All  rights  reserved.  AMD,  the  AMD  Arrow  logo,  AMD  Opteron,  Freedom  and  combinaRons  thereof   are  trademarks  of  Advanced  Micro  Devices,  Inc.  in  the  United  States  and/or  other  jurisdicRons..  Other  names  are  for  informaRonal   purposes  only  and  may  be  trademarks  of  their  respecRve  owners.   20   |      RIGHT  SIZING  HADOOP  DEPLOYMENTS