Who we are

We are located in Hanover (Germany), and we embrace big data technologies. We help our international customers to understand new technologies, to select the right building blocks and to tailor them to individual business cases.

What we do

We design systems for unstructured and structured data that scale. We build resilient connections between systems. We implement and coach in big data technologies.

Technologies

We scale your data processing with Hadoop, Spark or streaming technologies like Apache Flink oder Apache Storm. We create analytics tools based on Apache Hue, Apache Pig, Presto, Hive, Cassandra or HBase. We implement resilient interconnected data processing with Oozie, Airflow or Schedoscope. Read on…

Latest posts from our developer blog

how to use dynamic allocation in a oozie spark action on CDH5

using spark's dynamic allocation feature in a oozie spark action can be a tricky. First you need to make sure that dynamic allocation is actually available on your cluster. Navigate to your "Spark" service, then "Configuration" and search…

spark oozie action jobs not showing up on spark history server

If you execute spark jobs within an oozie workflow using a <spark> action node on a Cloudera CDH5 cluster, your job may not show up on your spark history server. Even if you configured all these things using the cloudera manager, your…

fixing spark classpath issues on CDH5 accessing Accumulo 1.7.2

We experienced some strange NoSuchMethorError while migrating a Accumulo based application from 1.6.0 to 1.7.2 running on CDH5. A couple of code changes where necessary moving from 1.6.0 to 1.7.2, but these were pretty straightforward (members…

how to collect cloudera manager usage data with google analytics

The Cloudera Manager is already capable of tracking usage data via Google Analytics, but that data is beeing send to a cloudera account. This blog post is about configuring the cloudera manager and changing the tracking id so that these usage…