Create, Use and Drop a Database in Hive

We will see how to Create, Use and Drop a database in Hive. Create a Database #List all the databases 0: jdbc:hive2://localhost:10000> show databases; +—————-+–+ | database_name  | +—————-+–+ | default        | +—————-+–+ #Create a new Database 0: jdbc:hive2://localhost:10000> create database mydb; #List all the databases 0: jdbc:hive2://localhost:10000> show databases;…

Continue reading

Different Approaches for Inserting Data Using Dynamic Partitioning into a Partitioned Hive Table

We will see different ways for inserting data using Dynamic partitioning into a Partitioned Hive table. To know how to create partitioned tables in Hive, go through the following links:- Creating Partitioned Hive table and importing data Creating Hive Table Partitioned by Multiple Columns and Importing Data Dynamic Partitioning Dynamic…

Continue reading

Different Approaches for Inserting Data Using Static Partitioning into a Partitioned Hive Table

We will see different ways for inserting data using static partitioning into a Partitioned Hive table. To know how to create partitioned tables in Hive, go through the following links:- Creating Partitioned Hive table and importing data Creating Hive Table Partitioned by Multiple Columns and Importing Data Static Partitioning Static…

Continue reading

Different Approaches for Inserting Data into a Hive Table

We will see different ways for inserting data into a Hive table. We have a table Employee in Hive with the following schema:- 0: jdbc:hive2://localhost:10000> desc employee; +———–+————+———-+–+ | col_name  | data_type  | comment  | +———–+————+———-+–+ | id        | bigint     |          | | name      | string     |          | | age      …

Continue reading

Hive Queries- Sort by, Order by, Cluster by, and Distribute By

In Hive queries, we can use Sort by, Order by, Cluster by, and Distribute by to manage the ordering and distribution of the output of a SELECT query. We will see this with an example. We have a table Employee in Hive, partitioned by Department. 0: jdbc:hive2://localhost:10000> desc employee; +————————–+———————–+———————–+–+…

Continue reading

Start Hiveserver2, Connect Through Beeline and Run Hive Queries

Hiveserver2 HiveServer2 is an enhanced Hive server designed for multi-client concurrency and improved authentication. It also provides better support for clients connecting through JDBC and ODBC. Start Hiverserver2 We have our hive installation under the directory – /home/hadoop/hive. Go to the ‘bin‘ directory under hive installation directory. To start the…

Continue reading