IT organizations can deploy a Hadoop cluster by buying an appliance, going to IaaS/PaaS public clouds, or building the solution themselves. Many major enterprise hardware vendors offer Big Data Hadoop appliances, as shown in the table below (Merv Adrian has more detail on the systems. Adrian, Arun Chandrasekaran, Svetlana Sicular and Marcus Collins are my go-to analyst for Big Data Hadoop questions):
|EMC||Greenplum Data Computing Appliance (DCA)|
|HP||AppSystem for Apache Hadoop|
|IBM||PureData System for Analytics|
|NetApp||Open Solution for Hadoop|
|Oracle||Big Data Appliance|
|Teradata||Aster Big Analytic Appliance|
These appliances provide an out-of-the-box solution for customers who want to do as little as possible to get Hadoop up and running. However, there will always be some organizations that prefer the DIY approach, the same way some leading-edge users set up HPC clusters themselves using Beowulf. These users will select standard servers for handling different Hadoop functions, which they then assemble into a complete Hadoop environment. Cloudera, a Big Data Hadoop provider, certifies hardware specifically for this purpose. Cloudera-certified servers include the Supermicro 815 and Supermicro 826, which are optimized for Hadoop Name Nodes (which should be memory-heavy) and Data Nodes (which are more storage-heavy), respectively. It is possible to use Gartner Tech Planner to find other systems that have processors and other specifications similar to the two mentioned Supermicro servers. The table of servers below shows some candidates:
|Suitable for Hadoop Name Node||Suitable for Hadoop Data Node|
|IBM – System x3550 M4||IBM – System x3630 M4|
|Dell – PowerEdge R620||Dell – PowerEdge R520|
|HP – ProLiant DL160 Gen8||HP – ProLiant DL380e Gen8|
|HP – ProLiant DL360 Gen8|
The list is incomplete, and there are likely to be other vendors and server products that are well suited for deploying Hadoop. Please use the comment section if you know of any other servers that fit into these two groups.
Follow Chris Gaun on Twitter
Note: This is an individual analyst’s blog and not a piece of peer reviewed, actionable, Gartner research.
Read Complimentary Relevant Research
Organizing for Big Data Through Better Process and Governance
With big data past the Peak of Inflated Expectations on the Hype Cycle, organizations are addressing next-level challenges and asking,...
View Relevant Webinars
What Big Data Means Today and How to Position Effectively
Gartner's original prediction that the term "Big Data" would become meaningless by 2020 was actually a bit off its largely useless already...
Comments or opinions expressed on this blog are those of the individual contributors only, and do not necessarily represent the views of Gartner, Inc. or its management. Readers may copy and redistribute blog postings on other blogs, or otherwise for private, non-commercial or journalistic purposes, with attribution to Gartner. This content may not be used for any other purposes in any other formats or media. The content on this blog is provided on an "as-is" basis. Gartner shall not be liable for any damages whatsoever arising out of the content or use of this blog.