Tuesday, January 27, 2015

Hive : SORT BY vs ORDER BY vs DISTRIBUTE BY vs CLUSTER BY

SORT BY Hive uses the columns in SORT BY to sort the rows before feeding the rows to a reducer. The sort order will be dependent on the column types. If the column is of numeric type, then the sort order is also in numeric order. If the column is of string type, then the … Continue reading Hive : SORT BY vs ORDER BY vs DISTRIBUTE BY vs CLUSTER BY


The post Hive : SORT BY vs ORDER BY vs DISTRIBUTE BY vs CLUSTER BY appeared first on Saurz{Code}.


Friday, January 23, 2015

How-To : Setup Development Environment for Hadoop MapReduce

This post is intended for folks who are looking out for a quick start on developing a basic Hadoop MapReduce application. We will see how to set up a basic MR application for WordCount using Java, Maven and Eclipse and run a basic MR program in local mode , which is easy for debugging at … Continue reading How-To : Setup Development Environment for Hadoop MapReduce


The post How-To : Setup Development Environment for Hadoop MapReduce appeared first on Saurz{Code}.


Thursday, January 8, 2015

How to Connect HiveServer2 service with JDBC Client ?

HiveServer2 (HS2) is a server interface that enables remote clientsto execute queries against Hive and retrieve the results. The current implementation, based on Thrift RPC, is an improved version of HiveServer and supports multi-client concurrency and authentication. It is designed to provide better support for open API clients like JDBC and ODBC. In this post, … Continue reading How to Connect HiveServer2 service with JDBC Client ?


The post How to Connect HiveServer2 service with JDBC Client ? appeared first on SaurzCode.


Wednesday, January 7, 2015

How to Configure MySQL Metastore for Hive

Hive by default comes with Derby as its metastore storage, which is suited only for testing purposes and in most of the production scenarios it is recommended to use MySQL as a metastore. This is a step by step guide on How to Configure MySQL Metastore for Hive in place of Derby Metastore (Default). Assumptions … Continue reading How to Configure MySQL Metastore for Hive


The post How to Configure MySQL Metastore for Hive appeared first on SaurzCode.