這篇文章主要介紹“hive怎么導入mysql腳本”,在日常操作中,相信很多人在hive怎么導入mysql腳本問題上存在疑惑,小編查閱了各式資料,整理出簡單好用的操作方法,希望對大家解答”hive怎么導入mysql腳本”的疑惑有所幫助!接下來,請跟著小編一起來學習吧!
import subprocess
import sys
print "腳本名:", sys.argv[0]
tableName=sys.argv[1]
startdate=sys.argv[2]
enddate=sys.argv[3]
if startdate=="":
print "please input statdate !"
sys.exit()
if enddate=="" :
enddate=startdate
com=""
if tableName=="t_netflow_pc_city_cate_pv" :
#t_netflow_pc_city_cate_pv_yyyymm hql
com="hive -e \"use tmpdb; select statdate , chlname, pagetype,area1 ,cate1, cate2 , pv, uv ,newuv , visit,newvisit , avgpage , avgtime , jumpin , jumpout , outrate from ext_pctrack_is where statdate>='"+startdate+"' and statdate<='"+enddate+"' and cate3='A' and cate4='A' and cate5='A' and area2='A' and area3='A' and area4='A' and source1='A' and source2='A';\" > tmp/t_netflow_pc_city_cate_pv_"+startdate+".txt;"
elif tableName=="t_netflow_pc_source_cate_pv":
com="hive -e \"use tmpdb; select statdate , chlname, pagetype,source1 ,source2 , cate1, cate2,cate3 , pv, uv ,newuv , visit,newvisit , avgpage , avgtime , jumpin , jumpout , outrate from ext_pctrack_is where statdate>='"+startdate+"' and statdate<='"+enddate+"' and cate4='all' and cate5='all' and area1='all' and area2='all' and area3='all' and area4='all' ;\" > tmp/t_netflow_pc_source_cate_pv_"+enddate+".txt;"
elif tableName=="t_netflow_pc_pv":
com="hive -e \"use tmpdb; select statdate , chlname,pagetype,source1 , area1,cate1, pv, uv ,newuv , visit,newvisit , avgpage , avgtime , jumpin , jumpout , outrate from ext_pctrack_is where statdate>='"+startdate+"' and statdate<='"+enddate+"' and cate2='all' and cate3='all' and cate4='all' and cate5='all' and area2='all' and area3='all' and area4='all' and source2='all';\" > tmp/t_netflow_pc_pv_"+startdate+".txt;"
print com
def executeHql(hql):
p = subprocess.Popen(hql, shell=True, stdout=subprocess.PIPE, stderr=subprocess.STDOUT)
res="1"
for line in p.stdout.readlines():
print line
if "Exception" in line :
executeHql(com)
res="0"
else :
res="1"
retval = p.wait()
return res
print "hive select start ------------------------------------------"
count=1
while 1:
print "count is ----------------------" +str(count)
count=count+1
res=executeHql(com)
if res=="1":
break
print "hive select end ------------------------------------------"
print "mysql load start ------------------------------------------"
mload="mysql -uXXX -pXXXX -hXXXX -PXXX -DXXX -e\"LOAD DATA LOCAL INFILE 'tmp/"+tableName+"_"+startdate+".txt' INTO TABLE "+tableName+" FIELDS TERMINATED BY '\t' LINES TERMINATED BY '\n'\""
p = subprocess.Popen(mload, shell=True, stdout=subprocess.PIPE, stderr=subprocess.STDOUT)
for line in p.stdout.readlines():
print line
retval = p.wait()
print "mysql load end ------------------------------------------"
到此,關于“hive怎么導入mysql腳本”的學習就結束了,希望能夠解決大家的疑惑。理論與實踐的搭配能更好的幫助大家學習,快去試試吧!若想繼續學習更多相關知識,請繼續關注億速云網站,小編會繼續努力為大家帶來更多實用的文章!
免責聲明:本站發布的內容(圖片、視頻和文字)以原創、轉載和分享為主,文章觀點不代表本網站立場,如果涉及侵權請聯系站長郵箱:is@yisu.com進行舉報,并提供相關證據,一經查實,將立刻刪除涉嫌侵權內容。