-
Recent Posts
Recent Comments
- Mart Laurano on I built a working Hadoop-Spark-Hive cluster on Docker. Here is how.
- Marcel-Jan Krijgsman on I tried Lion’s Mane as a cognitive enhancer. Here are my experiences with it.
- B on I tried Lion’s Mane as a cognitive enhancer. Here are my experiences with it.
- Suresh Vemuri on I built a working Hadoop-Spark-Hive cluster on Docker. Here is how.
- Suresh Vemuri on I built a working Hadoop-Spark-Hive cluster on Docker. Here is how.
Archives
- May 2022
- April 2022
- March 2022
- January 2022
- November 2021
- October 2021
- September 2021
- August 2021
- July 2021
- June 2021
- May 2021
- February 2021
- October 2020
- November 2019
- September 2019
- June 2019
- April 2019
- March 2019
- January 2019
- December 2018
- May 2018
- April 2018
- February 2018
- January 2018
- December 2017
- November 2017
- August 2017
- July 2017
- June 2017
- May 2017
- April 2017
- February 2017
Categories
Meta
-
Privacy & Cookies: This site uses cookies. By continuing to use this website, you agree to their use.
To find out more, including how to control cookies, see here: Cookie Policy
Tag Archives: docker-compose
My Github repo got 50 stars
I never imagined myself as a maintainer of a data engineering related open source thing. Yet. But when I was working on our data engineering course, I needed some kind of data lake software. At first I used the Cloudera … Continue reading
Posted in Apache Products for Outsiders, Data engineering, Learning Big Data
Tagged Docker, docker-compose, Github, Hadoop, stars
Leave a comment
Gaining insights on my workout data with Apache Superset
For a few years I’ve been gathering data on my workouts. In Excel. It’s not exactly state of the art data architecture, but it was fine for a while. But data alone doesn’t do much. I wanted some questions answered. … Continue reading
Posted in Apache Products for Outsiders, Howto
Tagged Apache Superset, DATETIME, Docker, docker-compose, health data, PostgreSQL
Leave a comment
I built a working Hadoop-Spark-Hive cluster on Docker. Here is how.
TL;DR: I made a Docker compose that runs Hadoop, Spark and Hive in a multi-container environment. You can find the necessary files for it here: https://github.com/Marcel-Jan/docker-hadoop-spark [Update 2021-11-09: Since Docker Desktop turned “Expose daemon on tcp://localhost:2375 without TLS” off by … Continue reading
Posted in Howto, Learning Big Data, Spark
Tagged Apache Spark, Big Data Europe, DIKW, Docker, docker-compose, Hadoop, Hive
17 Comments