hive中的lateral view 用法详解下篇

来源:互联网 发布:知的笔顺怎么写呀 编辑:程序博客网 时间:2024/06/11 12:03

例子

假设我们有一张表pageAds,它有两列数据,第一列是pageid string,第二列是adid_list,即用逗号分隔的广告ID集合:

string pageidArray<int> adid_list"front_page"[1, 2, 3]"contact_page"[3, 4, 5]

 

 

 

要统计所有广告ID在所有页面中出现的次数。

首先分拆广告ID:

SELECT pageid, adid FROM pageAds LATERAL VIEW explode(adid_list) adTable AS adid;

执行结果如下:

string pageidint adid"front_page"1"front_page"2"front_page"3"contact_page"3"contact_page"4"contact_page"5

 

 

 

 

 

接下来就是一个聚合的统计:

SELECT adid, count(1) FROM pageAds LATERAL VIEW explode(adid_list) adTable AS adidGROUP BY adid;

执行结果如下:

int adidcount(1)1121324151

 

 

 

 

多个lateral view语句

一个FROM语句后可以跟多个lateral view语句,后面的lateral view语句能够引用它前面的所有表和列名。 以下面的表为例:

Array<int> col1Array<string> col2[1, 2][a", "b", "c"][3, 4][d", "e", "f"]



> LATERAL VIEW explode(col1) myTable1 AS myCol1;

执行结果为:

int mycol1Array<string> col21[a", "b", "c"]2[a", "b", "c"]3[d", "e", "f"]4[d", "e", "f"]

 

 

 


加上一个lateral view:

SELECT myCol1, myCol2 FROM baseTable LATERAL VIEW explode(col1) myTable1 AS myCol1    LATERAL VIEW explode(col2) myTable2 AS myCol2;

它的执行结果为:

int myCol1string myCol21"a"1"b"1"c"2"a"2"b"2"c"3"d"3"e"3"f"4"d"4"e"4"f"







 




多个lateral view语句

一个FROM语句后可以跟多个lateral view语句,后面的lateral view语句能够引用它前面的所有表和列名。 以下面的表为例:

Array<int> col1Array<string> col2[1, 2][a", "b", "c"][3, 4][d", "e", "f"]
  >SELECT myCol1, col2 FROM baseTable    LATERAL VIEW explode(col1) myTable1 AS myCol1;

执行结果为:

int mycol1Array<string> col21[a", "b", "c"]2[a", "b", "c"]3[d", "e", "f"]4[d", "e", "f"]

 

 

 

 
加上一个lateral view:

SELECT myCol1, myCol2 FROM baseTable LATERAL VIEW explode(col1) myTable1 AS myCol1    LATERAL VIEW explode(col2) myTable2 AS myCol2;

它的执行结果为:

int myCol1string myCol21"a"1"b"1"c"2"a"2"b"2"c"3"d"3"e"3"f"4"d"4"e"4"f"









注意上面语句中,两个lateral view按照出现的次序被执行。

转自 https://cwiki.apache.org/confluence/display/Hive/LanguageManual+LateralView#

      http://blog.csdn.net/inte_sleeper/article/details/7196114

0 0
原创粉丝点击