How to provide custom headers to a CSV file using the pandas Python library?

by Amrita Mitra | Nov 12, 2022 | Machine Learning Using Python, Python Pandas

Sometimes a CSV file does not contain any DataFrame header. In such cases, the pandas Python library treats the first row of the DataFrame as a header. To address the problem, we can specify a custom header in the read_csv() function while creating a DataFrame.

For example, let’s say we are reading a CSV file iris2.csv and creating a DataFrame df.

import pandas

df = pandas.read_csv("iris2.csv")

print(df.head())

When we print the first few rows of the DataFrame, it shows the following:

   5.1  3.5  1.4  0.2  setosa
0  4.9  3.0  1.4  0.2  setosa
1  4.7  3.2  1.3  0.2  setosa
2  4.6  3.1  1.5  0.2  setosa
3  5.0  3.6  1.4  0.2  setosa
4  5.4  3.9  1.7  0.4  setosa

As we can see the DataFrame header is missing in the DataFrame. And pandas treats the first row of the DataFrame as the header.

So, we can provide the name of the columns as a header while reading the CSV file.

import pandas

df = pandas.read_csv("iris2.csv", names=["Sepal Length", "Sepal Width", "Petal Length", "Petal Width", "Species"])

print(df.head())

The five columns of the DataFrame are now named “Sepal Length”, “Sepal Width”, “Petal Length”, “Petal Width”, and “Species”, respectively. So, at this point, if we print the first few lines of the DataFrame, the output will be:

   Sepal Length  Sepal Width  Petal Length  Petal Width Species
0           5.1          3.5           1.4          0.2  setosa
1           4.9          3.0           1.4          0.2  setosa
2           4.7          3.2           1.3          0.2  setosa
3           4.6          3.1           1.5          0.2  setosa
4           5.0          3.6           1.4          0.2  setosa

As we can see the DataFrame header is successfully added while reading the CSV file using the read_csv() function.

Calculate the pseudoinverse of a matrix using Python

by Amrita Mitra | October 3, 2023 | Featured, Linear Algebra | 0 Comments

What is the pseudoinverse of a matrix? We know that if A is a square matrix with full rank, then A-1 is said to be the inverse of A if the following condition holds: $latex AA^{-1}=A^{-1}A=I $ The pseudoinverse or the Moore-Penrose inverse of a matrix is a...

Cholesky decomposition using Python

by Amrita Mitra | October 3, 2023 | Featured, Linear Algebra | 0 Comments

What is Cholesky decomposition? A square matrix A is said to have Cholesky decomposition if it can be written as a product of a lower triangular matrix and its conjugate transpose. $latex A=LL^{*} $ If all the entries of A are real numbers, then the conjugate...

Tensor Hadamard Product using Python

by Amrita Mitra | October 3, 2023 | Featured, Linear Algebra | 0 Comments

In one of our previous articles, we already discussed what the Hadamard product in linear algebra is. We discussed that if A and B are two matrices of size mxn, then the Hadamard product of A and B is another mxn matrix C such that: $latex H_{i,j}=A_{i,j} \times...

Perform tensor addition and subtraction using Python

by Amrita Mitra | October 3, 2023 | Featured, Linear Algebra | 0 Comments

We can use numpy nd-array to create a tensor in Python. We can use the following Python code to perform tensor addition and subtraction. import numpy A = numpy.random.randint(low=1, high=10, size=(3, 3, 3)) B = numpy.random.randint(low=1, high=10, size=(3, 3, 3)) C =...

How to create a tensor using Python?

by Amrita Mitra | October 3, 2023 | Featured, Linear Algebra | 0 Comments

What is a tensor? A tensor is a generalization of vectors and matrices. It is easily understood as a multidimensional array. For example, in machine learning, we can organize data in an m-way array and refer it as a data tensor. Data related to images, sounds, movies,...

How to combine NumPy arrays using horizontal stack?

by Amrita Mitra | October 3, 2023 | Featured, Linear Algebra | 0 Comments

We can use the hstack() function from the numpy module to combine two or more NumPy arrays horizontally. For example, we can use the following Python code to combine three NumPy arrays horizontally. import numpy A = numpy.array([[1, 2, 3], [4, 5, 6], [7, 8, 9]]) B =...

How to combine NumPy arrays using vertical stack?

by Amrita Mitra | October 3, 2023 | Featured, Linear Algebra | 0 Comments

Let’s say we have two or more NumPy arrays. We can combine these NumPy arrays vertically using the vstack() function from the numpy module. For example, we can use the following Python code to combine three NumPy arrays vertically. import numpy A = numpy.array([[1, 2,...

Singular Value Decomposition (SVD) using Python

by Amrita Mitra | October 3, 2023 | Featured, Linear Algebra | 0 Comments

What is Singular Value Decomposition (SVD)? Let A be an mxn rectangular matrix. Using Singular Value Decomposition (SVD), we can decompose the matrix A in the following way: $latex A_{m \times n}=U_{m \times m}S_{m \times n}V_{n \times n}^T $ Here, U is an mxm matrix....

Eigen decomposition of a square matrix using Python

by Amrita Mitra | October 3, 2023 | Featured, Linear Algebra | 0 Comments

Let A be a square matrix. Let’s say A has k eigenvalues λ1, λ2, ... λk. And the corresponding eigenvectors are X1, X2, ... Xk. $latex X_1=\begin{bmatrix} x_{11} \\ x_{21} \\ x_{31} \\ ... \\ x_{k1} \end{bmatrix} \\ X_2=\begin{bmatrix} x_{12} \\ x_{22} \\ x_{32} \\ ......

How to calculate eigenvalues and eigenvectors using Python?

by Amrita Mitra | October 3, 2023 | Featured, Linear Algebra | 0 Comments

In our previous article, we discussed what eigen values and eigenvectors of a square matrix are and how we can calculate the eigenvalues and eigenvectors of a square matrix mathematically. We discussed that if A is a square matrix, then $latex (A- \lambda I) \vec{u}=0...