Recent Comments
Author Archives: Marcel-Jan Krijgsman
What I learned from using OCR to get data from my weighing scale
A bit more than a year ago I wrote about the Robi S11 personal weighing scale and that it would not share its data with me, except as jpeg file (from the Fitdays app). Recently I got my Python code … Continue reading
Posted in Python, Things I Learned
Leave a comment
inkystarmap – an always up to date starmap on the wall
Last year I did a talk at Pycon Ireland 2024 about e-ink displays, in which I gave several examples of ways you can program e-ink displays on a Raspberry Pi with Python. For this talk I developed one extra application: … Continue reading
Posted in Uncategorized
Leave a comment
Masterclass Machine Learning in Cycling
Last Tuesday Paul van Herpt and I traveled to Lille for a special Machine Learning and Cycling Masterclass. As data partner of Soudal Quick-Step Pro Cycling Team, these are exactly the applications that touch where we as Transfer Solutions can … Continue reading
Posted in Active Learning, Conferences
Tagged computer vision, cycling, Machine learning, Python
Leave a comment
How to use data to find the best spot for a sponsor event
As you might know I’m currently doing sponsor events for Tour for Life, to collect funds for the Daniel den Hoed Foundation, for cancer research. Aniel and me have been doing this for the 3rd year now. And we noticed … Continue reading
Visiting PyGrunn 2025
Conferences are a great way to learn diverse topics in your field. That’s why I like to go to events like Pycons and last Friday, PyGrunn. PyGrunn is a Python event in Groningen, the Netherlands. I submitted two talks for … Continue reading
Posted in Conferences, Python
Tagged Model Context Protocols, PyGrunn, Python, repositories, typing
Leave a comment
My experiences with agentic AI
Originally I wanted to write a blogpost about what data engineers are going to do with AI writing their code. But before I can write that, I need to share my experiences so far. Because from this you’ll get an … Continue reading
Profiling data with ydata in PySpark
When you got a dataset to explore, there are several ways to do that in PySpark. You can do a describe or a summary. But if you want something a little more advanced, and if you want to get a … Continue reading
Posted in Data management, Python, Spark
Tagged data profiling, data quality, PySpark, Python
Leave a comment
My experiences with Azure Purview
At my last customer I have extensively worked with Ataccama, a data management product. It has a data catalog to store metadata on datasets, and it can do data quality checks. In Azure Microsoft has a data management product too. … Continue reading
Posted in Azure, Data engineering, Data management
Tagged azure, data catalog, data management, data quality, purview
Leave a comment
Things I learned about Azure Data Fabric
Currently I’m helping colleagues to read open data in Azure Data Fabric. Here are some of my experiences with it. I don’t want to do an extensive description of what Data Fabric is. In short, if you have an organisational … Continue reading
Posted in Azure, Cloud, Things I Learned
Tagged azure, Data Fabric, Data Factory, Lakehouse, Notebook, Python, Things I Learned
Leave a comment
A great time at PyCon Ireland 2024
I think it was last year when I announced that I wanted to go back to conferences again. Preferably as a speaker. But what conference is the best for data engineers? I couldn’t quite figure it out. Then the call … Continue reading