Recent Comments
Author Archives: Marcel-Jan Krijgsman
Masterclass Machine Learning in Cycling
Last Tuesday Paul van Herpt and I traveled to Lille for a special Machine Learning and Cycling Masterclass. As data partner of Soudal Quick-Step Pro Cycling Team, these are exactly the applications that touch where we as Transfer Solutions can … Continue reading
Posted in Active Learning, Conferences
Tagged computer vision, cycling, Machine learning, Python
Leave a comment
How to use data to find the best spot for a sponsor event
As you might know I’m currently doing sponsor events for Tour for Life, to collect funds for the Daniel den Hoed Foundation, for cancer research. Aniel and me have been doing this for the 3rd year now. And we noticed … Continue reading
Visiting PyGrunn 2025
Conferences are a great way to learn diverse topics in your field. That’s why I like to go to events like Pycons and last Friday, PyGrunn. PyGrunn is a Python event in Groningen, the Netherlands. I submitted two talks for … Continue reading
Posted in Conferences, Python
Tagged Model Context Protocols, PyGrunn, Python, repositories, typing
Leave a comment
My experiences with agentic AI
Originally I wanted to write a blogpost about what data engineers are going to do with AI writing their code. But before I can write that, I need to share my experiences so far. Because from this you’ll get an … Continue reading
Profiling data with ydata in PySpark
When you got a dataset to explore, there are several ways to do that in PySpark. You can do a describe or a summary. But if you want something a little more advanced, and if you want to get a … Continue reading
Posted in Data management, Python, Spark
Tagged data profiling, data quality, PySpark, Python
Leave a comment
My experiences with Azure Purview
At my last customer I have extensively worked with Ataccama, a data management product. It has a data catalog to store metadata on datasets, and it can do data quality checks. In Azure Microsoft has a data management product too. … Continue reading
Posted in Azure, Data engineering, Data management
Tagged azure, data catalog, data management, data quality, purview
Leave a comment
Things I learned about Azure Data Fabric
Currently I’m helping colleagues to read open data in Azure Data Fabric. Here are some of my experiences with it. I don’t want to do an extensive description of what Data Fabric is. In short, if you have an organisational … Continue reading
Posted in Azure, Cloud, Things I Learned
Tagged azure, Data Fabric, Data Factory, Lakehouse, Notebook, Python, Things I Learned
Leave a comment
A great time at PyCon Ireland 2024
I think it was last year when I announced that I wanted to go back to conferences again. Preferably as a speaker. But what conference is the best for data engineers? I couldn’t quite figure it out. Then the call … Continue reading
Using OCR to get data from my Robi scale
How it started For several years I kept track of my weight and fat with a Soehnle Body Balance, which I bought in 2018. That worked quite well until I saw more and more these weird deviations. Take a look … Continue reading
Showing a gift total on a Raspberry Pi with an e-ink display – how hard could it be?
TL;DR: These Python and Raspberry Pi projects. They are fun aren’t they? And often they look deceptively simple. But you don’t see all the projects that failed and usually not where they struggled. This project got stuck (and almost failed) … Continue reading
Posted in Howto, Python
Tagged chromium browser, e-ink display, failure, Python, Raspberry Pi, requests-html, scraping
Leave a comment