Monthly Archives: October 2020

I built a working Hadoop-Spark-Hive cluster on Docker. Here is how.

TL;DR: I made a Docker compose that runs Hadoop, Spark and Hive in a multi-container environment. You can find the necessary files for it here:   How it started We at DIKW are working on a Certified Data Engineering … Continue reading

Posted in Howto, Learning Big Data, Spark | Tagged , , , , , , | 10 Comments