This post is about missing data and it’s part of the Data Cleaning series that we started here. To teach you some good practices on how to deal with missing data, I solved a practical problem. In the solution of the problem, you’ll find:
- The correct procedure to deal with missing data.
- A warning about possible data leakage situations.
- Details on how to build pipelines and why you need them.
You can also access this example on GitHub.