SreeRam Hadoop Notes: Hive Partitioned tables [case study]

Thursday, 8 June 2017

Hive Partitioned tables [case study]

[cloudera@quickstart ~]$ cat saleshistory
01/01/2011,2000
01/01/2011,3000
01/02/2011,5000
01/02/2011,4000
01/02/2011,1000
01/03/2011,2000
01/25/2011,3000
01/25/2011,5000
01/29/2011,4000
01/29/2011,1000
02/01/2011,2000
02/01/2011,3000
02/02/2011,8000
03/02/2011,9000
03/02/2011,3000
03/03/2011,5000
03/25/2011,7000
03/25/2011,2000
04/29/2011,5000
04/29/2011,3000
05/01/2011,2000
05/01/2011,3000
05/02/2011,5000
05/02/2011,4000
06/02/2011,1000
06/03/2011,2000
06/25/2011,3000
07/25/2011,5000
07/29/2011,4000
07/29/2011,1000
08/01/2011,2000
08/01/2011,3000
08/02/2011,5000
09/02/2011,4000
09/02/2011,1000
09/03/2011,2000
09/25/2011,3000
10/25/2011,5000
10/29/2011,4000
10/29/2011,1000
10/29/2011,5000
11/01/2011,2000
11/01/2011,3000
11/02/2011,5000
11/02/2011,4000
11/02/2011,1000
11/03/2011,2000
11/25/2011,3000
12/25/2011,5000
12/29/2011,4000
12/29/2011,1000
12/30/2011,9000
12/30/2011,40000
[cloudera@quickstart ~]$

create table myraw(dt string, amt int)
row format delimited
fields terminated by ',';

load data local inpath 'saleshistory' into table myraw;

create table urraw like myraw;

-----------------

insert overwrite table urraw
select * from (
select dt, amt from myraw
union all
select concat(substr(dt,1,9),'2') as dt, amt+1000 as amt
from myraw
union all
select concat(substr(dt,1,9),'3') as dt, amt+4000 as amt
from myraw
union all
select concat(substr(dt,1,9),'4') as dt, amt+500 as amt
from myraw
union all
select concat(substr(dt,1,9),'5') as dt, amt+8000 as amt
from myraw
union all
select concat(substr(dt,1,9),'6') as dt, amt+1000 as amt
from myraw ) s;

-- 01/01/2011 1000

create table ourraw(dt array<string> , amt int);

insert overwrite table ourraw
select split(dt,'/') , amt from urraw;

create table sales(dt string, amt int);

insert overwrite table sales
select concat(dt[2],'-',dt[0],'-',dt[1]), amt from
ourraw;

create table salespart(dt string, amt int)
partitioned by (y int, m int, d int)
row format delimited
fields terminated by ',';

set hive.exec.dynamic.partition=true;
set hive.exec.dynamic.partition.mode=nonstrict;
set hive.exec.max.dynamic.partitions=100000;
set hive.exec.max.dynamic.partitions.pernode=10000;

insert overwrite table salespart
partition (y, m, d)
select dt, amt, year(dt), month(dt), day(dt)
from sales;

------ sankara.deva2016@gmail.com
-----sub: partitions case study.

8 comments:

Unknown4 July 2017 at 07:54
Great post! I am actually getting ready to across this information, is very helpful my friend. Also great blog here with all of the valuable information you have Keep up the good work you are doing here.Well, got a good knowledge.

Java Training in Chennai

Salesforce Training in Chennai
ReplyDelete
Replies
amar4 June 2018 at 04:14
nice blog
ReplyDelete
Replies
Anonymous17 July 2018 at 00:09
Your Blog is really valuable, Hadoop training in Hyderabad
So you are decided to be one of the demand in Hadoop

ReplyDelete
Replies
Technogeekscs25 September 2018 at 02:27
Good information. Technogeeks is known as the best training institute in Pune, India. Our Corporate training services on bigdata and hadoop have been successful and a big hit among the employees and organisations.

Big Data Testing Classes
Hadoop Big Data Classes in Pune
Big Data Training Institutes in Pune
Hadoop Training in Pune
Hadoop Pune
ReplyDelete
Replies
priya28 January 2019 at 21:52
I was recommended this web site by means of my cousin. I am now not certain whether this post is written through him as nobody else recognise such precise about my difficulty. You're amazing! Thank you!
Data Science Training in Indira nagar
Data Science training in marathahalli
Data Science Interview questions and answers
Data Science training in btm layout
Data Science Training in BTM Layout
Data science training in bangalore
ReplyDelete
Replies
pratik22 September 2021 at 22:15
Ηeуa i am foг thе fіrst time here.
I сame acrοss this boaгԁ and I find Ӏt reallу useful & it hеlped me out a
lot. I hoρe to givе somethіng baсk and helρ others like you helped me.
Here My website for SEO Company in Vijaywada
ReplyDelete
Replies
Anonymous31 May 2022 at 08:49
SMM PANEL
smm panel
İş ilanları blog
instagram takipçi satın al
HİRDAVATCİ BURADA
beyazesyateknikservisi.com.tr
servis
tiktok jeton hilesi
ReplyDelete
Replies
anjani0218 August 2025 at 21:47
Pega Course Online is a great way to master low-code application development and BPM from anywhere. It helps professionals gain in-demand skills and prepare for Pega certification with ease.
ReplyDelete
Replies

Add comment

Data science Software Course Training in Ameerpet Hyderabad

Thursday, 8 June 2017

Hive Partitioned tables [case study]

8 comments: