ML | Handle Missing Data with Simple Imputer

0 0 1 minute read

SimpleImputer is a scikit-learn class which is helpful in handling the missing data in the predictive model dataset. It replaces the NaN values with a specified placeholder.
It is implemented by the use of the SimpleImputer() method which takes the following arguments :

missing_values : The missing_values placeholder which has to be imputed. By default is NaN
strategy : The data which will replace the NaN values from the dataset. The strategy argument can take the values – ‘mean'(default), ‘median’, ‘most_frequent’ and ‘constant’.
fill_value : The constant value to be given to the NaN data using the constant strategy.

Code: Python code illustrating the use of SimpleImputer class.

Python3

import numpy as np
 
# Importing the SimpleImputer class
from sklearn.impute import SimpleImputer
 
# Imputer object using the mean strategy and
# missing_values type for imputation
imputer = SimpleImputer(missing_values = np.nan,
                        strategy ='mean')
 
data = [[12, np.nan, 34], [10, 32, np.nan],
        [np.nan, 11, 20]]
 
print("Original Data : \n", data)
# Fitting the data to the imputer object
imputer = imputer.fit(data)
 
# Imputing the data    
data = imputer.transform(data)
 
print("Imputed Data : \n", data)

Output

Original Data : 

[[12, nan, 34]
[10, 32, nan]
[nan, 11, 20]]


Imputed Data : 

[[12, 21.5, 34]
[10, 32, 27]
[11, 11, 20]]

Remember: The mean or median is taken along the column of the matrix

Namachila 2 minutes ago

0 0 1 minute read

ML | Handle Missing Data with Simple Imputer

Python3

Namachila

Leave a Reply Cancel reply

Hibs and Ross County fans on final

Signs of life for Apple stock as Wall St

The ultimate guide to herbal teas

After all is said and done, more is said than done

Knowledge is power

The Future Of Possible

Xbox boss talks Project Scorpio price

Hibs and Ross County fans on final

Tip of the day: That man again

Python3

Namachila

Subscribe to our mailing list to get the new updates!

Synchronization by using Semaphore in Python

try-except vs If in Python

Related Articles

Python – find_element() method in Selenium

Python – Import CSV into PostgreSQL

numpy.empty() in Python

numpy.sin() in Python

Leave a Reply Cancel reply

Hibs and Ross County fans on final

Signs of life for Apple stock as Wall St

The ultimate guide to herbal teas

After all is said and done, more is said than done

Knowledge is power

The Future Of Possible

Xbox boss talks Project Scorpio price

Hibs and Ross County fans on final

Tip of the day: That man again