How to perform the frequent category imputation in machine learning?

by Amrita Mitra | Nov 16, 2022 | Data Preprocessing, Machine Learning Using Python

What is the frequent category imputation in machine learning?

If a column contains only numerical data, then we can use mean or median imputation to fill in the missing values of the column. But if a column contains categorical values, then we mostly use most frequent category imputation for filling in the missing values of the column.

Please note that the most frequent value in a column is also the mode of the values of the column. Hence, to fill in the missing categorical values, we can calculate the mode of the data and then, use the mode to fill in the missing values.

How to perform the frequent category imputation in machine learning?

Let’s read the titanic dataset. If we print the percentage of missing values in each column of the dataset, we will see some values are missing from the embark town column.

import seaborn

df = seaborn.load_dataset("titanic")
print(df.isnull().mean()*100)

The output shows the following:

survived        0.000000
pclass          0.000000
sex             0.000000
age            19.865320
sibsp           0.000000
parch           0.000000
fare            0.000000
embarked        0.224467
class           0.000000
who             0.000000
adult_male      0.000000
deck           77.216611
embark_town     0.224467
alive           0.000000
alone           0.000000
dtype: float64

So, there are almost 0.224467% missing values in the embark town column. So, let’s find out the most frequent value of the …

Pages: 1 2 3

Calculate the pseudoinverse of a matrix using Python

by Amrita Mitra | October 3, 2023 | Featured, Linear Algebra | 0 Comments

What is the pseudoinverse of a matrix? We know that if A is a square matrix with full rank, then A-1 is said to be the inverse of A if the following condition holds: $latex AA^{-1}=A^{-1}A=I $ The pseudoinverse or the Moore-Penrose inverse of a matrix is a...

Cholesky decomposition using Python

by Amrita Mitra | October 3, 2023 | Featured, Linear Algebra | 0 Comments

What is Cholesky decomposition? A square matrix A is said to have Cholesky decomposition if it can be written as a product of a lower triangular matrix and its conjugate transpose. $latex A=LL^{*} $ If all the entries of A are real numbers, then the conjugate...

Tensor Hadamard Product using Python

by Amrita Mitra | October 3, 2023 | Featured, Linear Algebra | 0 Comments

In one of our previous articles, we already discussed what the Hadamard product in linear algebra is. We discussed that if A and B are two matrices of size mxn, then the Hadamard product of A and B is another mxn matrix C such that: $latex H_{i,j}=A_{i,j} \times...

Perform tensor addition and subtraction using Python

by Amrita Mitra | October 3, 2023 | Featured, Linear Algebra | 0 Comments

We can use numpy nd-array to create a tensor in Python. We can use the following Python code to perform tensor addition and subtraction. import numpy A = numpy.random.randint(low=1, high=10, size=(3, 3, 3)) B = numpy.random.randint(low=1, high=10, size=(3, 3, 3)) C =...

How to create a tensor using Python?

by Amrita Mitra | October 3, 2023 | Featured, Linear Algebra | 0 Comments

What is a tensor? A tensor is a generalization of vectors and matrices. It is easily understood as a multidimensional array. For example, in machine learning, we can organize data in an m-way array and refer it as a data tensor. Data related to images, sounds, movies,...

How to combine NumPy arrays using horizontal stack?

by Amrita Mitra | October 3, 2023 | Featured, Linear Algebra | 0 Comments

We can use the hstack() function from the numpy module to combine two or more NumPy arrays horizontally. For example, we can use the following Python code to combine three NumPy arrays horizontally. import numpy A = numpy.array([[1, 2, 3], [4, 5, 6], [7, 8, 9]]) B =...

How to combine NumPy arrays using vertical stack?

by Amrita Mitra | October 3, 2023 | Featured, Linear Algebra | 0 Comments

Let’s say we have two or more NumPy arrays. We can combine these NumPy arrays vertically using the vstack() function from the numpy module. For example, we can use the following Python code to combine three NumPy arrays vertically. import numpy A = numpy.array([[1, 2,...

Singular Value Decomposition (SVD) using Python

by Amrita Mitra | October 3, 2023 | Featured, Linear Algebra | 0 Comments

What is Singular Value Decomposition (SVD)? Let A be an mxn rectangular matrix. Using Singular Value Decomposition (SVD), we can decompose the matrix A in the following way: $latex A_{m \times n}=U_{m \times m}S_{m \times n}V_{n \times n}^T $ Here, U is an mxm matrix....

Eigen decomposition of a square matrix using Python

by Amrita Mitra | October 3, 2023 | Featured, Linear Algebra | 0 Comments

Let A be a square matrix. Let’s say A has k eigenvalues λ1, λ2, ... λk. And the corresponding eigenvectors are X1, X2, ... Xk. $latex X_1=\begin{bmatrix} x_{11} \\ x_{21} \\ x_{31} \\ ... \\ x_{k1} \end{bmatrix} \\ X_2=\begin{bmatrix} x_{12} \\ x_{22} \\ x_{32} \\ ......

How to calculate eigenvalues and eigenvectors using Python?

by Amrita Mitra | October 3, 2023 | Featured, Linear Algebra | 0 Comments

In our previous article, we discussed what eigen values and eigenvectors of a square matrix are and how we can calculate the eigenvalues and eigenvectors of a square matrix mathematically. We discussed that if A is a square matrix, then $latex (A- \lambda I) \vec{u}=0...

Amrita Mitra

Author

Ms. Amrita Mitra is an author, who has authored the books “Cryptography And Public Key Infrastructure“, “Web Application Vulnerabilities And Prevention“, “A Guide To Cyber Security” and “Phishing: Detection, Analysis And Prevention“. She is also the founder of Asigosec Technologies, the company that owns The Security Buddy.

0 Comments

Submit a Comment Cancel reply

You must be logged in to post a comment.

Continue with Google

Continue with LinkedIn

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Not a premium member yet?

Please follow the link below to buy The Security Buddy Premium Membership.

Buy Premium Membership

How to perform the frequent category imputation in machine learning?

What is the frequent category imputation in machine learning?

How to perform the frequent category imputation in machine learning?

Calculate the pseudoinverse of a matrix using Python

Cholesky decomposition using Python

Tensor Hadamard Product using Python

Perform tensor addition and subtraction using Python

How to create a tensor using Python?

How to combine NumPy arrays using horizontal stack?

How to combine NumPy arrays using vertical stack?

Singular Value Decomposition (SVD) using Python

Eigen decomposition of a square matrix using Python

How to calculate eigenvalues and eigenvectors using Python?

Amrita Mitra

0 Comments

Submit a Comment Cancel reply

Not a premium member yet?

Featured Posts

Recent Posts

Calculate the pseudoinverse of a matrix using Python

Cholesky decomposition using Python

Tensor Hadamard Product using Python

Perform tensor addition and subtraction using Python

How to create a tensor using Python?

How to combine NumPy arrays using horizontal stack?

How to combine NumPy arrays using vertical stack?

Singular Value Decomposition (SVD) using Python

Eigen decomposition of a square matrix using Python

How to calculate eigenvalues and eigenvectors using Python?

Categories

Not A Premium Member Yet?

How to perform the frequent category imputation in machine learning?

What is the frequent category imputation in machine learning?

How to perform the frequent category imputation in machine learning?

Related posts:

Amrita Mitra

0 Comments

Submit a Comment Cancel reply

Not a premium member yet?

Log In

Featured Posts

Recent Posts

Categories