Comparison of Storage formats in Hive – TEXTFILE vs ORC vs PARQUET

We will compare the different storage formats available in Hive. The comparison will be based on the size of the  data on HDFS and time for executing a simple query. Cluster summary The performance is bench marked using a 5 node Hadoop cluster. Each node is a 8 core, 8…

Continue reading