Tag Archives: docker-compose

I built a working Hadoop-Spark-Hive cluster on Docker. Here is how.

TL;DR: I made a Docker compose that runs Hadoop, Spark and Hive in a multi-container environment. You can find the necessary files for it here: https://github.com/Marcel-Jan/docker-hadoop-spark   How it started We at DIKW are working on a Certified Data Engineering … Continue reading

Posted in Howto, Learning Big Data, Spark | Tagged , , , , , , | 6 Comments