
how to use dynamic allocation in a oozie spark action on CDH5
using spark’s dynamic allocation feature in a oozie spark action can be a tricky.
using spark’s dynamic allocation feature in a oozie spark action can be a tricky.
If you execute spark jobs within an oozie workflow using a action node on a Cloudera CDH5 cluster, your job may not show up on your spark history server.
This blogpost will guide you to the process of cloning, patching, building and deploying a custom version of the oozie workflow engine based on the cdh 5.
how to inject the configuration of a remote ha-hdfs in a distcp call without modifing the local cluster configuration.
oozie’s ‘capture-output’ is a powerful method the pass dynamic configuration properties from action to action, but you may hit the maximum size limit quite fast.
How to use Accumulos RangePartioner to increase your mr-job ingest rate (and the neccessary pieces to include it into an oozie worflow)