
Hadoop FAQs - April Webinar Q&A
by Merv Adrian | April 16, 2017
Nick Heudecker and I received numerous questions during our April Hadoop webinar with several hundred attendees, and we have summarized and answered them below.
How can Hadoop-Spark interface with conventional RDBMS such as Oracle/UDB/Teradata...

Hadoop Tracker - March 2017
by Merv Adrian | March 16, 2017
Stack expansion has ground to a halt. The last time an Apache project was added to the list of those most supported by leading Hadoop distribution vendors was July 2016, when...

Hadoop Project Commercial Support Tracker July 2016
by Merv Adrian | July 30, 2016
There are now 15 projects supported by all 5 distributors I track, and several have had new releases since April. Kafka is the newest addition, and I believe the remaining 4-supporter offerings, Mahout...

Hadoop Apache Project Commercial Support Tracker April 2016
by Merv Adrian | April 27, 2016
There are now 20 commonly supported projects: Avro, Flume and Solr join the group supported by all 5 distributors and other changes appear as well.
For this version of the...

Supported Hadoop Stack Continues Expansion
by Merv Adrian | December 24, 2015
For the past year and a half I've been tracking the path from 6 broadly supported (4 or more distributors) "Hadoop" projects in 2012 to 15 in June 2014, and now 17 in December...

Now, What Is Hadoop? And What's Supported?
by Merv Adrian | July 2, 2015
Updated August 11, 2015
This perennial question resurfaced recently in a thoughtful blog post by Andreas Neumann, Chief Architect of Cask, called What is Hadoop, anyway?. Ultimately, after a careful deconstruction of...

Hadoop Is A Recursive Acronym
by Merv Adrian | October 13, 2014
Hopefully, that title got your attention. A recursive acronym - the term first appeared in the book Gödel, Escher, Bach: An Eternal Golden Braid and is likely more familiar to tech folks...

Hadoop is in the Mind of the Beholder
by Merv Adrian | March 24, 2014
This post was jointly authored by Merv Adrian (@merv) and Nick Heudecker (@nheudecker) and appears on both blogs.
In the early days of Hadoop (versions up through 1.x), the project...

Hadoop Summit Recap Part Two - SELECT FROM hdfs WHERE bigdatavendor USING SQL
by Merv Adrian | July 15, 2013
Probably the most widespread, and commercially imminent, theme at the Summit was “SQL on Hadoop.” Since last year, many offerings have been touted, debated, and some have even shipped. In...

Hadoop 2013 - Part Two: Projects
by Merv Adrian | February 21, 2013
In Part One of this series, I pointed out that how significant attention is being lavished on performance in 2013. In this installment, the topic is projects, which are proliferating precipitously. One...