ML-learning-path

Self learning guide for machine learning

View the Project on GitHub elephantscale/ML-learning-path

Index/ Python Data Analysis


Data Cleansing

Objective

Learn cleaning up messy data

Reference

Essential Reading

Data Cleansing

Handling Missing Data

Extra Reading

Checklist

After completing the exercises below, you should be comfortable with

Exercises

Difficulty Level

★☆☆ - Easy
★★☆ - Medium
★★★ - Challenging
★★★★ - Bonus

A - Handling Missing Data

A1 - Handling Missing Data

We have the following data. What would be a good way to handle missing data? Please discuss the following choices:

year     month   rainfall
2019     Jan     10
2019     Feb     12
2019     Mar     ?
2019     Apr     20
2019     May     ?

A2 - Handling Missing data

How will you handle missing data in this dataset?

Person   Height_cm
A         180
B         ?
C         172
D         155
E         160
F         ?

Exercicses

EX-1 : Cleanup data (college admission) (★☆☆)

EX-2 : Cleaning up House Sales Data (★★☆)

More Exercices


Index/ Python Data Analysis