Log normalization in Python
Now that we know that the Proline column in our wine dataset has a large amount of variance, let's log normalize it.
numpy has been imported as np.
This exercise is part of the course
Preprocessing for Machine Learning in Python
Exercise instructions
- Print out the variance of the
Prolinecolumn for reference. - Use the
np.log()function on theProlinecolumn to create a new, log-normalized column namedProline_log. - Print out the variance of the
Proline_logcolumn to see the difference.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Print out the variance of the Proline column
print(____)
# Apply the log normalization function to the Proline column
wine[____] = np.____(____)
# Check the variance of the normalized Proline column
print(____)