bigdata – Page 3 – My IT Learnings

Create, Use and Drop a Database in Hive

rajesh • March 17, 2016bigdata

We will see how to Create, Use and Drop a database in Hive. Create a Database #List all the databases 0: jdbc:hive2://localhost:10000> show databases; +—————-+–+ | database_name | +—————-+–+ | default | +—————-+–+ #Create a new Database 0: jdbc:hive2://localhost:10000> create database mydb; #List all the databases 0: jdbc:hive2://localhost:10000> show databases;…

Different Approaches for Inserting Data Using Dynamic Partitioning into a Partitioned Hive Table

rajesh • March 17, 2016bigdata

We will see different ways for inserting data using Dynamic partitioning into a Partitioned Hive table. To know how to create partitioned tables in Hive, go through the following links:- Creating Partitioned Hive table and importing data Creating Hive Table Partitioned by Multiple Columns and Importing Data Dynamic Partitioning Dynamic…

Different Approaches for Inserting Data Using Static Partitioning into a Partitioned Hive Table

rajesh • March 16, 2016bigdata

We will see different ways for inserting data using static partitioning into a Partitioned Hive table. To know how to create partitioned tables in Hive, go through the following links:- Creating Partitioned Hive table and importing data Creating Hive Table Partitioned by Multiple Columns and Importing Data Static Partitioning Static…

Different Approaches for Inserting Data into a Hive Table

rajesh • March 16, 2016bigdata

Hive Queries- Sort by, Order by, Cluster by, and Distribute By

rajesh • March 14, 2016bigdata

In Hive queries, we can use Sort by, Order by, Cluster by, and Distribute by to manage the ordering and distribution of the output of a SELECT query. We will see this with an example. We have a table Employee in Hive, partitioned by Department. 0: jdbc:hive2://localhost:10000> desc employee; +————————–+———————–+———————–+–+…

Start Hiveserver2, Connect Through Beeline and Run Hive Queries

rajesh • March 11, 2016bigdata

Hiveserver2 HiveServer2 is an enhanced Hive server designed for multi-client concurrency and improved authentication. It also provides better support for clients connecting through JDBC and ODBC. Start Hiverserver2 We have our hive installation under the directory – /home/hadoop/hive. Go to the ‘bin‘ directory under hive installation directory. To start the…

Creating Hive Table Partitioned by Multiple Columns and Importing Data

rajesh • March 11, 2016bigdata

We will see how to create a Hive table partitioned by multiple columns and how to import data into the table. Partitioning We can use partitioning feature of Hive to divide a table into different partitions. Each partition of a table is associated with a particular value(s) of partition column(s).…

Creating External Hive table and importing data

rajesh • March 9, 2016bigdata

We will see how to create an external table in Hive and how to import data into the table. External Table External tables in Hive do not store data for the table in the hive warehouse directory. External table in Hive stores only the metadata about the table in the…

Creating Hive table using ORC format and importing data

rajesh • February 2, 2016bigdata

We will see how to create a table in Hive using ORC format and how to import data into the table. ORC format ORC (Optimized Row Columnar) file format provides a highly efficient way to store Hive data. Using ORC format improves performance when reading, writing, and processing data in…

Creating Hive table using SEQUENCEFILE format and importing data

rajesh • January 28, 2016bigdata

We will see how to create a table in Hive using SEQUENCEFILE format and how to import data into the table. Create table CREATE TABLE Employee( ID BIGINT, NAME STRING, AGE INT, SALARY BIGINT ) COMMENT ‘This is Employee table stored as sequencefile’ ROW FORMAT DELIMITED FIELDS TERMINATED BY ‘,’…

Tag Archives: bigdata