YouTube Twitter RSS
My IT Learnings
Posts related to computer science, algorithms, software development, databases etc
 
Skip to content
  • Home
  • Posts
Archive by category "bigdata"

Category Archives: bigdata

Creating Hive table using ORC format and importing data

rajesh • February 2, 2016bigdata bigdata, create table, hadoop, hive, load data, orc
0

We will see how to create a table in Hive using ORC format and how to import data into the table. ORC format ORC (Optimized Row Columnar) file format provides a highly efficient way to store Hive data. Using ORC format improves performance when reading, writing, and processing data in…

Continue reading

Creating Hive table using SEQUENCEFILE format and importing data

rajesh • January 28, 2016bigdata bigdata, create table, hadoop, hive, load data, sequencefile
0

We will see how to create a table in Hive using SEQUENCEFILE format and how to import data into the table. Create table CREATE TABLE Employee( ID BIGINT, NAME STRING, AGE INT, SALARY BIGINT ) COMMENT ‘This is Employee table stored as sequencefile’ ROW FORMAT DELIMITED FIELDS TERMINATED BY ‘,’…

Continue reading

Creating Hive table using TEXTFILE format and importing data

rajesh • January 28, 2016bigdata bigdata, create table, hadoop, hive, load data
1

We will see how to create a table in Hive using TEXTFILE format and how to import data into the table. TEXTFILE Textfile format stores data as plain text files. Textfile format enables rapid development due to its simplicity but other file formats like ORC are much better when it comes…

Continue reading

  • YouTube Twitter RSS
  • Archives

    • February 2016 (2)
    • January 2016 (5)
    • December 2015 (3)
    • November 2015 (4)
    • October 2015 (6)
  • Categories

    • algorithms (3)
    • bigdata (3)
    • computer science (2)
    • data structures (2)
    • database (5)
    • eclipse (1)
    • hadoop (3)
    • java (4)
    • maven (1)
  • Tags

    algorithms ArrayList base64 bfs bigdata binary search BlockingQueue consumer create table database data structures eclipse encoding graph hadoop hdfs hive IN operator insert java jdbc jsch load data maven mysql oracle orc performance producer remote script select sequencefile sql ssh ssh2 stack unicode update