After completing this tutorial you will know Encoding is a required pre processing step when working with categorical data for machine learning algorithms. Use the pd. get_dummies. Sklearn one hot encoding pandas. In this blog post I explain how to create a DataGenerator with a one hot encoder to encode your labels in the same way for every batch. In this tutorial we will learn to convert our input or output sequence data to a one hot encoding for use in sequence classification. One Hot Encoding a NumPy Array. The basic strategy is to convert each category value into a new column and assign a 1 or 0 (True/False) value to the column. Using more technical words one hot encoding is the process of converting categorical values into a 1 dimensional numerical vector. Before applying the One Hot Encoding and after I apply one hot encoding this is the output. We can see that both scikit learn method and pandas method generate the same result. import pandas as pd
import numpy as np
a = np.array([1, 2, 3, 1])
one_hot = pd.get_dummies(a)
print(one_hot)