Entries Categorized as 'Apache Yarn'
by Merv Adrian | September 6, 2013 | 12 Comments
Many things have changed in the software industry in an era when the use of open source software has pervaded the mainstream IT shop. One of them is the significance – and descriptive adequacy – of the word “proprietary.” Merriam-Webster defines it as “something that is used, produced, or marketed under exclusive legal right of [...]
Category: Apache Apache Yarn Big Data BigInsights Cassandra Cloudera Hadoop Hbase IBM MapR MapReduce open source OSS Pig YARN Tags: Apache, big data, BigInsights, Cassandra, Cloudera, Datastax, Hadapt, Hadoop, Hbase, HDFS, IBM, open source, OSS, Pig, Yarn
by Merv Adrian | July 15, 2013 | 10 Comments
Probably the most widespread, and commercially imminent, theme at the Summit was “SQL on Hadoop.” Since last year, many offerings have been touted, debated, and some have even shipped. In this post, I offer a brief look at where things stood at the Summit and how we got there. To net it out: offerings today [...]
Category: Apache Apache Drill Apache Yarn Aster Big Data Cloudera data warehouse DBMS Gartner Hadapt Hadoop HCatalog HDFS Hive Hortonworks IBM MapR MapReduce Microsoft Netezza Oozie Oracle Rainstor RDBMS Real-time SQL Server Sqoop Teradata YARN Tags: Apache, Aster, big data, BigSQL, CDH, Cloudera, data warehouse, DB2, Drill, EMC, ETL, Greenplum, Hadapt, Hadoop, HAWQ, Hbase, HCatalog, HDFS, Hive, Hortonworks, HP, Impala, Isilon, Kognitio, MapR, MapReduce, MPP, MySQL, OneFS, Oracle, Paraccel, Platfora, Polybase, Postgres, Rainstor, SQL, Sqoop, Stinger, Teradata, Tez, Vertica
by Merv Adrian | July 10, 2013 | 7 Comments
I had the privilege of keynoting this year’s Hadoop Summit, so I may be a bit prejudiced when I say the event confirmed my assertion that we have arrived at a turning point in Hadoop’s maturation. The large number of attendees (2500, a solid increase – and more “suits”) and sponsors (70, also a significant uptick) made [...]
Category: Apache Apache Yarn Big Data Cloudera Gartner graph databases Hadoop HDFS Hortonworks IBM Intel MapR MapReduce Storm Yahoo! YARN Tags: Apache, big data, Cloudera, Hadoop, HDFS, Hortonworks, IBM, MapR, MapReduce, Yahoo!
by Merv Adrian | February 21, 2013 | 1 Comment
In Part One of this series, I pointed out that how significant attention is being lavished on performance in 2013. In this installment, the topic is projects, which are proliferating precipitously. One of my most frequent client inquiries is “which of these pieces make Hadoop?” As recently as a year ago, the question was pretty simple for [...]
Category: Accumulo Ambari Apache Apache Drill Apache Yarn BigInsights Cassandra Cloudera Dataguise EMC Gartner Giraph graph databases Hadapt Hadoop Hbase HCatalog HDFS Hive Hortonworks Hstreaming IBM InfoSphere Lucense MapReduce Mshout Oozie open source Pig Rainstor Serengeti Solr SQLstream Sqoop VMware Zookeeper Tags: Apache, BigInsights, Cassandra, Cloudera, Flume, Hadapt, Hadoop, Hbase, HDFS, Hive, Hortonworks, Hstreaming, IBM, InfoSphere, MapR, MapReduce, Oozie, Pig, SQLStream, Sqoop, zookeeper