SreeRam Hadoop Notes: Hive(10AmTo1:00Pm) Lab1 notes : Hive Inner and External Tables

Tuesday, 1 August 2017

Hive(10AmTo1:00Pm) Lab1 notes : Hive Inner and External Tables

hive> create table samp1(line string);
-- here we did not select any database.
default database in hive is "default".

the hdfs location of default database is
/user/hive/warehouse

-- when you create a table in default database, under warehouse location, one directory will be created with table name.

in hdfs,
/user/hive/warehouse/samp1 directory is created.

hive> create database mydb;

when a database is created, in warehouse location, with name database and extension ".db" , one directory will be created.

How to select database:

hive> use mydb;

hive> create table test1(line string);

under mydb.db directory, test1 table directory will be created.

/user/hive/warehouse/mydb.db/test1.

[cloudera@quickstart ~]$ ls file*
file1 file2 file3
[cloudera@quickstart ~]$ cat file1
aaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
[cloudera@quickstart ~]$ cat file2
bbbbbbbbbbbbbbbbb
bbbbbbbbbbbbbbbbbbbb
bbbbbbbbbbbbbb
[cloudera@quickstart ~]$ cat file3
cccccccccccccccccccc
ccccccccccccccccccc
ccccccccccccc
[cloudera@quickstart ~]$

hive> use default;
hive> load data local inpath 'file1'
into table samp1;
-- when you load file into table,
the file will be copied into table's backend directory.

in hdfs,
/user/hive/warehouse/samp1/file1

hive> load data local inpath 'file2'
into table samp1;

now table directory has two files,
file1 and file2.
hive> select * from samp1;
o/p:
aaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
bbbbbbbbbbbbbbbbb
bbbbbbbbbbbbbbbbbbbb
bbbbbbbbbbbbbb

-- hive will read all rows of all files of
table directory.

another way of loading file into table.

$ hadoop fs -copyFromLocal file3
/user/hive/warehouse/samp1

hive> select * from samp1;
OK
aaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
bbbbbbbbbbbbbbbbb
bbbbbbbbbbbbbbbbbbbb
bbbbbbbbbbbbbb
cccccccccccccccccccc
ccccccccccccccccccc
ccccccccccccc

hive> use mydb;
hive> show tables;
test1
hive> load data local inpath 'file1' into table test1;
hive>
in hdfs,
/user/hive/warehouse/mydb.db/test1/file1

===============================

Hive tables are basically two types.

1) Inner tables [user managed tables]
2) External tables.

when inner table is dropped,
both metadata and data(from hdfs) will be deleted.

when external table is dropped ,
only metadata will be deleted,
but still data is safely available in hdfs table's backend location.

so that you can reuse data in future.

where hive tables metadata will be stored.
-- in rdbms,
under metastore database.

when you submit a query in hive,
hive will contact metastore, and indentify table's backend hdfs location, and reads data.

by default every table is inner table. [managed table].

to create external table.

hive> create external table etab1(line string);

hive>load data local inpath 'file1'
into table etab1;

hive> load data local inpath 'file2'
into table etab1;

now etab1 is created under mydb database,
under etab1 table directory we have 3 files.

these file locations will be updated hive metastore(in rdbms).

when this table is dropped from hive..

hive> drop table etab1;

-- from rdbms , metadata of this table will be deleted.
-- but still in hdfs, the table directory and its files are available.
[ data is not lost]

so that , in future, hive or other ecosystem can use this data. [adv: reusability]

How to reuse it.
----------------

hive> use mydb;
hive> create table etab1(line string);
hive> select * from etab1;
aaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
bbbbbbbbbbbbbbbbb
bbbbbbbbbbbbbbbbbbbb
bbbbbbbbbbbbbb

-- when you create etab1,
in hdfs , under database location,
one directory should be created .
but under /user/hive/warehouse/mydb.db,
already etab1 is existed with two files.

if directory existed, hive will use it,
if not existed, hive will create it.

============

hive> drop table etab1;
-- both data and metadata will be deleted.
bcoz, this time we create etab1 as "inner" table.

=================

Both inner and external tables can use
custom hdfs locations.

hive> create table mytab(line string)
location '/user/mydata';

in hdfs,
/user/mydata directory will be created
hive> load data local inpath 'file1'
into table mytab;

now file1 will be copied into /user/mydata.

hive> drop table mytab;
here mytab is created as inner table.
so both metadata and data (/user/mydata)
will be deleted

hive> create external table urtab(line string)
location '/user/urdata';

now in hdfs,
/user/urdata directory will be created.

hive> load data local inpath 'file1'
into table urtab;

hive> load data local inpath 'file2'
into table urtab;

hive> load data local inpath 'file3'
into table urtab;

now file1,2,3 will be copied into
/user/urdata directory of hdfs.

hive> drop table urtab;
-- only metadata from rdbms will be deleted. still /user/urdata directory is available with file1,2,3 files.

Reusing next time:

hive> create table ourtab(line string)
location '/user/urdata';

here /user/urdata is already existed in hdfs. so hive will use it. if not existed hive will create it.

=====================================

32 comments:

soumya6 September 2017 at 05:23
Check it once Through Hadoop admin Online Training for more info.
ReplyDelete
Replies
Unknown20 September 2017 at 20:18
This comment has been removed by the author.
ReplyDelete
Replies
saikumar21 October 2017 at 11:25
I want real time hands on experience in Hadoop so can you please provide your phone number sir.

My contact number : 8374272346
ReplyDelete
Replies
Hari9 November 2017 at 00:44
Hi,
Thanks for sharing the great information about Hadoop… Its useful and helpful information…Keep Sharing.
ReplyDelete
Replies
VINOD20 December 2017 at 20:19
Hi
thanks for sharing usefull information.If any one wants any type of books fallow https://bigdatahadoopinstitute.blogspot.com/2017/07/hadoop-books_31.html#comment-form .Thanks
ReplyDelete
Replies
Rajkamal3 January 2018 at 04:39
TIB Academy is one of the best Hadoop Training Institute in Bangalore. We Offers Hands-On Training with Live project.
ReplyDelete
Replies
SAS Certified Study17 January 2018 at 19:14
Thanks for sharing the great information. More Info Big Data Path (https://bigdatapath.wordpress.com/)
ReplyDelete
Replies
mani28 March 2018 at 22:51
thank you for sharing such a good and useful information, please keep on share like this
hadoop training in hyderabad
hadoop online training
hadoop training in ameerpet
ReplyDelete
Replies
Mallikarjuna19 May 2018 at 21:33
Thanks for providing very useful knowledge.
ReplyDelete
Replies
amar4 June 2018 at 04:13
nice blog
ReplyDelete
Replies
Tejuteju24 June 2018 at 22:57
Big data in hadoop is the interesting topic and to get some important information. Big data Hadoop online Course
ReplyDelete
Replies
Technogeekscs30 August 2018 at 03:34
Your blog is really awesome and informative. Keep blogging and sharing. Thank you!

Big Data Testing Classes
ReplyDelete
Replies
ganga pragya5 September 2018 at 23:57
I believe there are many more pleasurable opportunities ahead for individuals that looked at your site.

angularjs2-Training in sholinganallur

angularjs4-Training in sholinganallur

angularjs-Training in annanagar

angularjs2-Training in annanagar

angularjs4-Training in annanagar

ReplyDelete
Replies
sathya shri6 September 2018 at 02:38
This comment has been removed by the author.
ReplyDelete
Replies
Unknown17 September 2018 at 04:34
Wow it is really wonderful and awesome thus it is very much useful for me to understand many concepts and helped me a lot. it is really explainable very well and i got more information from your blog.

rpa training in Chennai | rpa training in pune

rpa online training | rpa training in bangalore
ReplyDelete
Replies
kranthi20 September 2018 at 04:14
I like your attempt in providing good content on Hadoop. We have a similar site where we also provide good information on Big Data Hadoop
ReplyDelete
Replies
afiah b6 October 2018 at 02:09
Impressive. Your story always bring hope and new energy. Keep up the good work.
java training in chennai

java training in marathahalli | java training in btm layout
ReplyDelete
Replies
saimouni8 October 2018 at 02:58
Good Post, I am a big believer in posting comments on sites to let the blog writers know that they ve added something advantageous to the world wide web.
python training in pune
python training institute in chennai
python training in Bangalore
ReplyDelete
Replies
Unknown9 October 2018 at 04:02
Really you have done great job,There are may person searching about that now they will find enough resources by your post
Devops training in marathahalli
Devops training in rajajinagar
ReplyDelete
Replies
dwarakesh15 November 2018 at 02:32

Does your blog have a contact page? I’m having problems locating it but, I’d like to shoot you an email. I’ve got some recommendations for your blog you might be interested in hearing.

Amazon Web Services Training in Pune | Best AWS Training in Pune

AWS Training in Chennai | Best Amazon Web Services Training in Chennai

Amazon Web Services Training in Chennai |Best AWS Training in Chennai

Amazon Web Services Online Training | Online Amazon Web Services Certification Course Training
ReplyDelete
Replies
Anonymous18 November 2018 at 10:05
Hi, Very good writing, Your experiences are fetching valuable insights on Hive I&E Tables.

I will recommend Hadoop Training in Chennai by FITA for advance Hadoop classes.
ReplyDelete
Replies
Unknown7 December 2018 at 01:59
Nice Article I Found this blog very informative and helpful to get good information. I'm very happy to share your info with us. thank you Keep Updating Us..,

Best Event Stalls Exhibition in India
Best Event Technology Services in India
ReplyDelete
Replies
cynthiawilliams4 January 2019 at 04:24
I am glad that I came across your post. Looking forward to learn more.
RPA Training in Chennai
Blue Prism Training in Chennai
DevOps Training in Chennai
R Programming Training in Chennai
AWS Training in Chennai
DevOps Training in Chennai
Angularjs Training in Chennai
Data Science Course in Chennai
ReplyDelete
Replies
Sadhana Rathore21 January 2019 at 20:56
Very useful blog with lots of information, share more updates.
Python Training in Chennai
Python Classes in Chennai
ccna Training institute in Chennai
ccna course in Chennai
Python Training in Adyar
Python Training in Velachery
ReplyDelete
Replies
Ankita Singh18 March 2019 at 01:06
I really appreciate this post thank you for sharing these type of posts. Good post. I was searched for this topic. Finally, I got the information on this blog. Thanks for posting such a nice article.-
https://www.bharattaxi.com
ReplyDelete
Replies
Atul27 January 2020 at 00:21
It was great experience after reading this. thanks for sharing such good stuff with us.
Hadoop Course in Delhi
ReplyDelete
Replies
Admin18 February 2020 at 02:03
I think you did an awesome job explaining it. Sure beats having to research it on my own. Thanks
RDVV BCOM TimeTable 2020
University Of Kota BCOM TimeTable 2020
ReplyDelete
Replies
Admin29 July 2020 at 05:00
Dil Bechara 2020 FHD Download Here
Sushant Singh Rajput Last Movie Dil Bechara 2020 Download HDRip
ReplyDelete
Replies
sanjay31 August 2020 at 00:23
Thank you for sharing this valuable information. Good job.
Cyber Security Training Course in Chennai | Certification | Cyber Security Online Training Course | Ethical Hacking Training Course in Chennai | Certification | Ethical Hacking Online Training Course |
CCNA Training Course in Chennai | Certification | CCNA Online Training Course | RPA Robotic Process Automation Training Course in Chennai | Certification | RPA Training Course Chennai | SEO Training in Chennai | Certification | SEO Online Training Course
ReplyDelete
Replies
Reshma6 December 2021 at 03:10

Great post. Thanks for sharing.....
RPA Training in Bangalore
RPA Training in Pune
ReplyDelete
Replies
David Fincher23 January 2022 at 22:33
This post is so interactive and informative.keep update more information...
SEO Training in Anna Nagar
SEO Training in Chennai

ReplyDelete
Replies
manasha28 March 2022 at 06:26
Great post. keep sharing such a worthy information.
Tally Course in Chennai
Online Tally Course
ReplyDelete
Replies

Add comment

Data science Software Course Training in Ameerpet Hyderabad

Tuesday, 1 August 2017

Hive(10AmTo1:00Pm) Lab1 notes : Hive Inner and External Tables

32 comments: