Bucketing in Hive

Bucketing Bucketing is a method to evenly distributed the data across many files. Create multiple buckets and then place each record into one of the buckets based on some logic mostly some hashing algorithm. Bucketing feature of Hive can be used to distribute/organize the table/partition data into multiple files such…

Continue reading

Some Configuration Properties in Hive

We will see some of the configuration properties available in Hive. Hive Warehouse Directory hive.metastore.warehouse.dir Location of directory on HDFS which will be used for storing the hive warehouse data. Default Value: /user/hive/warehouse 0: jdbc:hive2://localhost:10000> show conf “hive.metastore.warehouse.dir”; +———————–+———+————————————————-+–+ |        default        |  type   |                      desc                       | +———————–+———+————————————————-+–+ | /user/hive/warehouse  |…

Continue reading