-
Recent Posts
Recent Comments
- Marcel-Jan Krijgsman on I built a working Hadoop-Spark-Hive cluster on Docker. Here is how.
- Chris on I built a working Hadoop-Spark-Hive cluster on Docker. Here is how.
- admin_r0g1nuq9 on I built a working Hadoop-Spark-Hive cluster on Docker. Here is how.
- LJ on I built a working Hadoop-Spark-Hive cluster on Docker. Here is how.
- admin_r0g1nuq9 on I built a working Hadoop-Spark-Hive cluster on Docker. Here is how.
Archives
Categories
Meta
-
Privacy & Cookies: This site uses cookies. By continuing to use this website, you agree to their use.
To find out more, including how to control cookies, see here: Cookie Policy
Tag Archives: metadata
Don’t do data management just because you have to
Lately more and more organizations are doing data management. Suddenly there are data owners, data stewards and metadata repositories (in whatever form) everywhere. We all seem to do this mainly because we have to. Because of the GDPR or the … Continue reading →
Posted in Data engineering
|
Tagged Amundsen, Apache Atlas, data lineage, GDPR, Lyft, metadata
|
Leave a comment
Things I’ve learned about metadata for a data lake
I’ve been thinking of writing a blogpost about Apache Atlas. For one and a half years I’ve gained a unique experience with this product that I would like to share with the world. But first we need to talk about … Continue reading →
Posted in Data engineering
|
Tagged Apache Atlas, data catalog, data lake, data ownership, data source, data visibility, datasets, metadata
|
Leave a comment