Recent Comments
Tag Archives: data profiling
Profiling data with ydata in PySpark
When you got a dataset to explore, there are several ways to do that in PySpark. You can do a describe or a summary. But if you want something a little more advanced, and if you want to get a … Continue reading
Posted in Data management, Python, Spark
Tagged data profiling, data quality, PySpark, Python
Leave a comment