Recent Posts
- Showing a gift total on a Raspberry Pi with an e-ink display – how hard could it be?
- How I memorise my lines (and other things) with Anki
- Categorising text with ChatGPT. Results may be messy.
- A Strava dashboard on a Raspberry Pi (Part 3): The Strava API
- A Strava dashboard on a Raspberry Pi (Part 2): Installing software
Recent Comments
Monthly Archives: April 2018
Starting at Port of Rotterdam per 1 May 2018
Next week (1 May 2018) I will start as a Hadoop specialist/data steward/data custodian/data something something at the Advanced Analytics team at Port of Rotterdam. We haven’t worked out a fancy data something title yet. I’m already working at this … Continue reading
Dataworks Summit Berlin 2018, day two
Back for round two of keynotes, good technical sessions and discussing them with fellow data specialists in between. Keynotes First up was Frank Säuberlich from Teradata, who had an interesting example of machine learning for fraud detection at Danske Bank. … Continue reading
Posted in Conferences, Events
Tagged Apache Atlas, Apache Metron, Apache Ranger, Data Steward Studio, Dataworks Summit, Docker, GDPR, Personal data, Roaring Elephant podcast, Spark, Synerscope, TPC-H
Leave a comment
Building HDP 2.6 on AWS, Part 3: the worker nodes
This is part 3 in a series on how to build a Hortonworks Data Platform 2.6 cluster on AWS. By now we have an edge node to run Ambari Server, three master nodes for Hadoop name nodes and such. Now … Continue reading
Posted in Howto
Tagged Amazon Web Services, AWS, cloning nodes, Hadoop, HDP, Hortonworks Data Platform, Ubuntu Server, worker nodes
Leave a comment