hadoop - How to pass pig option as parameter in oozie? -
in order execute pig script, need turn off optimizer. using below command in command line , scripts work fine.
pig -t columnmapkeyprune population.pig
how pass option in oozie?
i tried passing argument.
<action> <pig> <job-tracker>${jobtracker}</job-tracker> <name-node>${namenode}</name-node> <script>population.pig</script> <argument>-t</argument> <argument>columnmapkeyprune</argument> <param>piggybankjar=${piggybankjar}</param> <param>datafujar=${datafujar}</param> <param>inputpath=${inputpath}</param> <param>outputpath=${outputpath}</param> </pig> </action>
received below error:
e0701: xml schema error, cvc-complex-type.2.4.a: invalid content found starting element 'param'. 1 of '{"uri:oozie:workflow:0.4":argument, "uri:oozie:workflow:0.4":file, "uri:oozie:workflow:0.4":archive}' expected.
tried using param, didnt worked
<action> <pig> <job-tracker>${jobtracker}</job-tracker> <name-node>${namenode}</name-node> <script>population.pig</script> <param>-t</param> <param>columnmapkeyprune</param> <param>piggybankjar=${piggybankjar}</param> <param>datafujar=${datafujar}</param> <param>inputpath=${inputpath}</param> <param>outputpath=${outputpath}</param> </pig> </action>
for pig oozie action allows param, archive, file , argument tags. how pass optimizer_off option ?
set below mentioned property in pig script , try once.
set pig.optimizer.rules.disabled 'columnmapkeyprune';
for reference, please go through below mentioned link
[http://pig.apache.org/docs/r0.14.0/perf.html#optimization-rules]
Comments
Post a Comment