In this part of the project, you will various queries usingHive, Pig and Hadoop streaming. The schema is available below, butdonāt forget to apply the correct delimiter:
http://rasinsrv07.cstcis.cti.depaul.edu/CSC555/SSBM1/SSBM_schema_hive.sql
The data is available at:
http://rasinsrv07.cstcis.cti.depaul.edu/CSC553/data/ (we will useScale4)
Use Lineorder.tbl
Implement the following query: select sum(lo_revenue), p_brand1from lineorder, part, supplier where lo_partkey = p_partkey andlo_suppkey = s_suppkey and p_category = ‘MFGR#12′ and s_region =’EUROPE’ group by p_brand1;
using Hive and Pig .
Expert Answer
An answer will be send to you shortly. . . . .