Pandas

Pandas Overview:

Pandas is an open source tool, high  level data manipulation tool used for data cleansing, data wrangling and data analysis purpose. Pandas is used in solving different kind of data analysis like: Data extraction and loading, Data manipulation, Handling missing values and etc,.

Introduction to Pandas Data Structures:

Pandas deals with the following types of data structures:

  • Series
  • Data Frame
  • Panel
Series: 
Series is a one-dimensional labeled array capable of holding data of any type (integer, string, float and etc,.)

Creating Series:

Syntax:
import pandas as pd
import numpy as np
s = pd.Series(data)






DataFrame:

DataFrame is a 2-dimensional labeled data structure with columns of potentially different types. Spreadsheet is an example of 2-D.

A pandas DataFrame can be created using various inputs like:
  • Lists,
  • Dictionary,
  • Series,
  • Numpy ndarrays,
  • DataFrame



Comments