Recent Posts
- Showing a gift total on a Raspberry Pi with an e-ink display – how hard could it be?
- How I memorise my lines (and other things) with Anki
- Categorising text with ChatGPT. Results may be messy.
- A Strava dashboard on a Raspberry Pi (Part 3): The Strava API
- A Strava dashboard on a Raspberry Pi (Part 2): Installing software
Recent Comments
Monthly Archives: September 2019
Tech dossier: pandas
I’m keeping tech dossiers in Evernote on open source products I want to keep track of. And I decided to put them on my blog. My previous ones were on Kubernetes and Elasticsearch. This one is on the Python data … Continue reading
Posted in Data engineering, Python, Tech dossier
Tagged data manipulation, multiindex, pandas, programming, Python, Tech dossier
Leave a comment
Book review: Spark in Action, 2nd edition
There are lots of books on Spark, but not a lot that aimed at the data engineer. Data engineers use Spark to ingest and transform data, which is different from what data scientists use it for. On the Roaring Elephant … Continue reading
Posted in Data engineering, Spark
Tagged Apache Spark, Jean-Georges Perrin, Roaring Elephant podcast, Spark
2 Comments