Lessons Learned Deploying Hadoop

So you’ve done your research and quickly realized that Hadoop is going to be at the very core of your company’s Big Data Platform, given that data storage and processing will never get more cost effective than open-source software running on commodity hardware. The next level down the rabbit hole has you in a quandary, though. Beyond Hadoop, you start running into technologies like Hive, Impala, Pig, Storm, YARN and other elements from Hadoop’s periodic table of technologies.

Read more

Big Data Toolset Smackdown

If you’ve spent more than five minutes researching big data, you’ve already come across technologies like Hadoop, BigQuery, EMR and RedShift, and you may even know what OLAP and NoSQL means and where they fit in. What you may not have been exposed to, however, is the next and very important level down the rabbit hole—technologies like Tableu, Microstrategy, QlikView, Hive and Impala. Let’s take a look at each of these technologies and see where they fit and how they compare to each other.

Read more