
IT organizations can deploy a Hadoop cluster by buying an appliance, going to IaaS/PaaS public clouds, or building the solution themselves. Many major enterprise hardware vendors offer Big Data Hadoop appliances, as shown in the table below (Merv Adrian has more detail on the systems. Adrian, Arun Chandrasekaran, Svetlana Sicular and Marcus Collins are my go-to analyst for Big Data Hadoop questions):
| Vendor | Appliance Name |
|---|---|
| EMC | Greenplum Data Computing Appliance (DCA) |
| HP | AppSystem for Apache Hadoop |
| IBM | PureData System for Analytics |
| NetApp | Open Solution for Hadoop |
| Oracle | Big Data Appliance |
| Teradata | Aster Big Analytic Appliance |
These appliances provide an out-of-the-box solution for customers who want to do as little as possible to get Hadoop up and running. However, there will always be some organizations that prefer the DIY approach, the same way some leading-edge users set up HPC clusters themselves using Beowulf. These users will select standard servers for handling different Hadoop functions, which they then assemble into a complete Hadoop environment. Cloudera, a Big Data Hadoop provider, certifies hardware specifically for this purpose. Cloudera-certified servers include the Supermicro 815 and Supermicro 826, which are optimized for Hadoop Name Nodes (which should be memory-heavy) and Data Nodes (which are more storage-heavy), respectively. It is possible to use Gartner Tech Planner to find other systems that have processors and other specifications similar to the two mentioned Supermicro servers. The table of servers below shows some candidates:
| Suitable for Hadoop Name Node | Suitable for Hadoop Data Node |
|---|---|
| IBM – System x3550 M4 | IBM – System x3630 M4 |
| Dell – PowerEdge R620 | Dell – PowerEdge R520 |
| HP – ProLiant DL160 Gen8 | HP – ProLiant DL380e Gen8 |
| HP – ProLiant DL360 Gen8 |
The list is incomplete, and there are likely to be other vendors and server products that are well suited for deploying Hadoop. Please use the comment section if you know of any other servers that fit into these two groups.
Follow Chris Gaun on Twitter
Note: This is an individual analyst’s blog and not a piece of peer reviewed, actionable, Gartner research.
Category: Big Data Tags:

Chris Gaun




































































































1 response so far ↓
1 Selecting Hadoop Server Hardware For Big Data Workloads | Big Data Analytics March 11, 2013 at 12:53 pm
[...] on blogs.gartner.com 이것이 좋아요:좋아하기 가져오는 [...]
Leave a Comment