Tuesday, January 27, 2015

Hive : SORT BY vs ORDER BY vs DISTRIBUTE BY vs CLUSTER BY

SORT BY Hive uses the columns in SORT BY to sort the rows before feeding the rows to a reducer. The sort order will be dependent on the column types. If the column is of numeric type, then the sort order is also in numeric order. If the column is of string type, then the … Continue reading Hive : SORT BY vs ORDER BY vs DISTRIBUTE BY vs CLUSTER BY


The post Hive : SORT BY vs ORDER BY vs DISTRIBUTE BY vs CLUSTER BY appeared first on Saurz{Code}.


No comments:

Post a Comment