Passing many parameters from Java action to Oozie workflow

oozie’s ‘capture-output’ is a powerful method the pass dynamic configuration properties from action to action, but you may hit the maximum size limit quite fast. Read more

Using Accumulos RangePartioner in a m/r job (and Oozie workflow)

How to use Accumulos RangePartioner to increase your mr-job ingest rate (and the neccessary pieces to include it into an oozie worflow)
Read more

Upgrading an existing Accumulo 1.6 CDH5 cluster to HDFS HA

Theses steps are neccessary for Accumulo 1.6 after upgrading an existing CDH5 cluster to HDFS HA.
Read more

Moving a Cloudera Manager 5 Installation to a new host – don’t forget the repo folder

Having trouble provisioning new nodes after you have moved you cloudera manager installation to a new host ?
Read more

oozie-graphite available for CDH5

Our open source project oozie-graphite is now available for CDH5.
Read more

Accumulo 1.6 (CDH5) not available for Ubuntu 14.04 (Trusty Tahr)

If you want to deploy a CDH5 Cluster including Accumulo 1.6 on Ubuntu, you will need to stick to Ubuntu 12.04 Precise Pangolin for the time being.
Read more

Increasing mapreduce.job.counters.max on CDH5 YARN (MR2)

How to increase mapreduce.job.counters.max on YARN (MR2) for HUE / HIVE / OOZIE.
Read more

Reinitializing an Accumulo cluster on CDH4 or CDH5

How to manually reinitialize an existing Accumulo cluster on CDH4 or CDH5
Read more

Fixing DFSMiniCluster ExceptionInInitializerError on Bamboo OnDemand

How to fix ExceptionInInitializerError in DFSMiniCluster caused by wrong default file permissions on certain linux distros on your bamboo OnDemand setup.
Read more

Oozie bundle monitoring: tapping into hadoop counters

This is the first post about GraphiteMRCounterExecutor use cases: we start by utilizing already available hadoop counters that deliver very valueable graphs.
Read more