
GDELT on SCDF 2.2.0: Implementing an advanced processor to drop duplicate data with kafka streams
In the 4th part of our blog post series “processing GDELT data with SCDF on kubernetes” we will reimplement the deduplication filter from the last post as a kafka streams application including custom SerDes.