0% found this document useful (0 votes)

5 views9 pages

Pandas Assignment Version-2

The document provides an introduction to the Pandas library in Python, covering installation, data structures (Series and DataFrame), and various methods for creating these structures. It explains how Series can be used like a NumPy array and a dictionary, while DataFrames can be constructed from different data sources, including lists, dictionaries, and NumPy arrays. The document includes code examples to illustrate the concepts discussed.

Uploaded by

muhammadramzansial77

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views9 pages

Pandas Assignment Version-2

Uploaded by

muhammadramzansial77

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Introduction to Pandas: Data Manipulation in Python

1. Installing Pandas
Pandas is a powerful Python library specifically designed for handling structured data. It
simplifies tasks like data cleaning, transformation, and analysis by providing user-friendly data
structures and functions. To begin using it, you first need to install the library.

Example:

You can easily install Pandas using pip, Python's package manager.

pip install pandas

After installation, you can import it and check the version to confirm it's ready to use.

import pandas as pd

print(pd.__version__)

Output:

Pandas version: 2.1.4

2. Understanding Pandas Data Structures

The two fundamental data types in Pandas are the Series and the DataFrame. A Series is a
one-dimensional array-like object capable of holding various data types, while a DataFrame is
a two-dimensional, table-like structure. Think of a DataFrame as a spreadsheet or SQL table.

Example:

A Series can represent a single column of data.

A DataFrame is a collection of Series objects, where each Series represents a column.

import pandas as pd

# Creating a Series for a list of daily temperatures
temperatures = pd.Series([25, 27, 24, 26])
print(temperatures)
# Creating a DataFrame for student data
student_data = pd.DataFrame({
'Student_ID': [101, 102],
'Score': [85, 92]
})
print(student_data)

Output:

0 25
1 27
2 24
3 26
dtype: int64
Student_ID Score
0 101 85
1 102 92

3. Different Ways to Create a Series Object

A Pandas Series can be constructed from several different types of data sources, making it
highly versatile.

Example:

import pandas as pd

import numpy as np

# From a simple list
fruits = pd.Series(["apple", "banana", "orange"])

# From a NumPy array
np_array = np.array([10, 20, 30])
numbers = pd.Series(np_array)

# From a Python dictionary
# Keys become the index labels, and values become the data
product_prices = pd.Series({"Laptop": 1200, "Mouse": 25, "Keyboard": 75})

# From a single scalar value, repeated for a given index
# The value '50' is assigned to each index label
single_value_series = pd.Series(50, index=["item1", "item2", "item3"])

print(fruits)
print(numbers)
print(product_prices)
print(single_value_series)

Output:

0 apple
1 banana
2 orange
dtype: object
0 10
1 20
2 30
dtype: int64
Laptop 1200
Mouse 25
Keyboard 75
dtype: int64
item1 50
item2 50
item3 50
dtype: int64

4. Series as a Specialized NumPy Array

A Series can be seen as an enhanced version of a NumPy array. While it shares core features
like vectorized operations, it adds the crucial element of a labeled index, which allows for
more intuitive data access and alignment.

Example:

The .values attribute of a Series provides access to the underlying NumPy array, while the
.index attribute reveals the added labels.

import pandas as pd

gpa_scores = pd.Series([3.8, 3.5, 4.0], index=["A-1", "A-2", "A-3"])

# The core values (like a NumPy array)
print(gpa_scores.values)

# The labeled index (the extra feature)
print(f"Series index: {gpa_scores.index}")

Output:

Series values: [3.8 3.5 4. ]

Series index: Index(['A-1', 'A-2', 'A-3'], dtype='object')

5. Series as a Specialized Dictionary

A Series acts similarly to a Python dictionary, where the index labels serve as keys and the
data values are the associated values. This allows for quick and efficient data retrieval using
familiar dictionary-style syntax.

Example:

You can access data points in a Series using their index label, just as you would use a key to
look up a value in a dictionary.

import pandas as pd

city_populations = pd.Series([1000000, 250000, 500000], index=["Tokyo", "London", "Paris"])

# Accessing the population of "London"
print(f"Population of London: {city_populations['London']}")

Output:

Population of London: 250000

6. Understanding DataFrame Objects

A Pandas DataFrame is the most widely used data structure in Pandas. It’s a two-dimensional,
mutable table of data with labeled axes (rows and columns). It’s essentially a container for
multiple Series objects that share the same index.

Example:
import pandas as pd
# Creating a DataFrame from a dictionary of lists
# Each list becomes a column in the table
employee_data = {
'Employee_ID': [1, 2, 3],
'Department': ['IT', 'HR', 'Finance']
}

employee_df = pd.DataFrame(employee_data)
print(employee_df)

Output:

Employee_ID Department
0 1 IT
1 2 HR
2 3 Finance

7. DataFrame as a Specialized NumPy Array

Just as a Series extends a NumPy array, a DataFrame can be viewed as an extended
two-dimensional NumPy array. It not only contains a grid of data but also provides labels for
both rows and columns, making it much easier to work with.

Example:

You can create a DataFrame from a NumPy array and then add meaningful labels for the
columns and rows.

import numpy as np

import pandas as pd

# A 2x3 NumPy array
np_matrix = np.array([[10, 20, 30], [40, 50, 60]])

# Creating a DataFrame with column and row labels
df_from_array = pd.DataFrame(np_matrix, columns=["Col A", "Col B", "Col C"], index=["Row 1",
"Row 2"])
print(df_from_array)

Output:

Col A Col B Col C
Row 1 10 20 30
Row 2 40 50 60

8. DataFrame as a Specialized Dictionary

A DataFrame can also be understood as a dictionary where the keys are the column names
and the values are the corresponding Series objects. This means you can access a column
using dictionary-like syntax.

Example:

Accessing a specific column from a DataFrame is straightforward using bracket notation.

import pandas as pd

dataset = pd.DataFrame({
'Product': ['Phone', 'Tablet'],
'Price': [800, 450]
})

# Accessing the 'Price' column
prices = dataset['Price']
print(f"The prices are: \n{prices}")

Output:
The prices are:
0 800
1 450
Name: Price, dtype: int64

9. Constructing DataFrame Objects (Multiple

Methods)
DataFrames are incredibly flexible and can be created from a wide variety of data sources.
Here are some of the most common methods.
(a) From a Single Series
A single Series can be directly converted into a DataFrame. The
Series' index becomes the DataFrame's row index, and its values
become a single column.
import pandas as pd

scores_series = pd.Series([95, 88, 72], name="Exam_Scores")
scores_df = pd.DataFrame(scores_series)
print(scores_df)

Output:

Exam_Scores
0 95
1 88
2 72

(b) From a List of Dictionaries

This is a very common method, where each dictionary in the list represents a single row, and
the dictionary keys become the column names.

import pandas as pd

project_members = [
{"Name": "Alex", "Role": "Developer"},
{"Name": "Ben", "Role": "Designer"},
{"Name": "Chris", "Role": "Manager"}
]
project_df = pd.DataFrame(project_members)
print(project_df)

Output:

Name Role
0 Alex Developer
1 Ben Designer
2 Chris Manager
(c) From a Dictionary of Series Objects
By using a dictionary where the keys are column names and the values are Series objects, you
can build a DataFrame with aligned columns.

import pandas as pd

# Creating two Series with a shared index
units = pd.Series([150, 200], index=["Q1", "Q2"])
revenue = pd.Series([5000, 7500], index=["Q1", "Q2"])

sales_report = pd.DataFrame({"Units_Sold": units, "Total_Revenue": revenue})
print(sales_report)

Output

Units_Sold Total_Revenue
Q1 150 5000
Q2 200 7500

(d) From a Two-Dimensional NumPy Array

A 2D NumPy array can be used as the foundation for a DataFrame. You can then add column
and row labels for better readability.

import numpy as np

import pandas as pd

data_array = np.array([[1, 2, 3], [4, 5, 6]])
dataset_df = pd.DataFrame(data_array, columns=["A", "B", "C"])
print(dataset_df)

Output:

A B C
0 1 2 3
1 4 5 6

(e) From a NumPy Structured Array

This method is useful when you have data with a mix of data types (e.g., numbers and strings)
that you want to organize into a DataFrame.

import numpy as np

import pandas as pd

# A structured array with a defined data type for each field
employee_info = np.array([
(101, "John", 60000),
(102, "Jane", 75000)
], dtype=[("ID", "i4"), ("Name", "U10"), ("Salary", "i4")])

employee_info_df = pd.DataFrame(employee_info)
print(employee_info_df)

Output:

ID Name Salary
0 101 John 60000
1 102 Jane 75000

Subject IP
No ratings yet
Subject IP
9 pages
The Pandas Library
No ratings yet
The Pandas Library
39 pages
Introduction To Pandas For Data Analysis
No ratings yet
Introduction To Pandas For Data Analysis
6 pages
Lab-3 Pandas Library
No ratings yet
Lab-3 Pandas Library
18 pages
Pandas 2
No ratings yet
Pandas 2
36 pages
Pandas Shan Ver2
No ratings yet
Pandas Shan Ver2
25 pages
FDS Module 2 Notes
No ratings yet
FDS Module 2 Notes
24 pages
Week 4.1
No ratings yet
Week 4.1
16 pages
Data Handling Using Pandas-1
No ratings yet
Data Handling Using Pandas-1
60 pages
Pandas
No ratings yet
Pandas
163 pages
14 Pandas
No ratings yet
14 Pandas
25 pages
Unit 4
No ratings yet
Unit 4
36 pages
Python Data Processing
No ratings yet
Python Data Processing
36 pages
Pandas
No ratings yet
Pandas
82 pages
Unit I: Data Handling Using Pandas and Data Visualization: Marks:30
No ratings yet
Unit I: Data Handling Using Pandas and Data Visualization: Marks:30
75 pages
Unit III Part 2 1725700061785
No ratings yet
Unit III Part 2 1725700061785
85 pages
ML Unit-2 Notes
No ratings yet
ML Unit-2 Notes
17 pages
Pandas - Ipynb - Colab
No ratings yet
Pandas - Ipynb - Colab
8 pages
The Pandas Series Object-Print
No ratings yet
The Pandas Series Object-Print
16 pages
Class 12 Panda Project
No ratings yet
Class 12 Panda Project
13 pages
Pandas Series and DataFrame Guide
No ratings yet
Pandas Series and DataFrame Guide
10 pages
Data Manipulation With Pandas
No ratings yet
Data Manipulation With Pandas
38 pages
Pandas Assignment 3
No ratings yet
Pandas Assignment 3
5 pages
Pandas Series - Notes For PA3
No ratings yet
Pandas Series - Notes For PA3
9 pages
Pandas
No ratings yet
Pandas
12 pages
Data Manipulation With Pandas
No ratings yet
Data Manipulation With Pandas
138 pages
Class Notes: Class: XII Date: 7-Apr-2020 Subject: Informatics Practices Topic: 2. Python Pandas
No ratings yet
Class Notes: Class: XII Date: 7-Apr-2020 Subject: Informatics Practices Topic: 2. Python Pandas
4 pages
Python Pandas Module - Introduction-07-11-2023
No ratings yet
Python Pandas Module - Introduction-07-11-2023
84 pages
Python Pandas
No ratings yet
Python Pandas
177 pages
UNIT II Notes
No ratings yet
UNIT II Notes
23 pages
Python Pandas New Sylabus
No ratings yet
Python Pandas New Sylabus
53 pages
Cheat Sheet: The Pandas Dataframe Object: Column Index (DF - Columns)
No ratings yet
Cheat Sheet: The Pandas Dataframe Object: Column Index (DF - Columns)
6 pages
Lab-3 Pandas Library
No ratings yet
Lab-3 Pandas Library
14 pages
XII - Ip - Panda - I - Part - I - 2023 (1) 1 1
No ratings yet
XII - Ip - Panda - I - Part - I - 2023 (1) 1 1
25 pages
Data Science - Unit-3-Part-2
No ratings yet
Data Science - Unit-3-Part-2
32 pages
Pandas Research
No ratings yet
Pandas Research
14 pages
Pandas Notes
No ratings yet
Pandas Notes
19 pages
Pandas & Numpy
No ratings yet
Pandas & Numpy
32 pages
Python Pandas ch-2
No ratings yet
Python Pandas ch-2
56 pages
Introduction To Pandas & Data Structures
No ratings yet
Introduction To Pandas & Data Structures
11 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
16 pages
Pandas
No ratings yet
Pandas
13 pages
UNIT - 3 Pandas
No ratings yet
UNIT - 3 Pandas
21 pages
Python Pandas
No ratings yet
Python Pandas
21 pages
Pandas DataFrame Basics Guide
No ratings yet
Pandas DataFrame Basics Guide
41 pages
Unit I: Data Handling Using Pandas and Data Visualization: Marks:25
No ratings yet
Unit I: Data Handling Using Pandas and Data Visualization: Marks:25
135 pages
Unit 3 (FODS)
No ratings yet
Unit 3 (FODS)
34 pages
Unit - 1 - Python Pandas
No ratings yet
Unit - 1 - Python Pandas
176 pages
Pandas
No ratings yet
Pandas
57 pages
Pandas
No ratings yet
Pandas
12 pages
Block 1-Data Handling Using Pandas DataFrame
No ratings yet
Block 1-Data Handling Using Pandas DataFrame
17 pages
All Document Reader 1715619870900
No ratings yet
All Document Reader 1715619870900
6 pages
Data Handling Using Pandas - 1-2-1
No ratings yet
Data Handling Using Pandas - 1-2-1
10 pages
Pandas Notes
No ratings yet
Pandas Notes
44 pages
Class 12 Practical File
No ratings yet
Class 12 Practical File
29 pages
Data Manipulation With Pandas
No ratings yet
Data Manipulation With Pandas
38 pages
Mdad - Numpy ML
No ratings yet
Mdad - Numpy ML
85 pages
Python Programming For Data Science
No ratings yet
Python Programming For Data Science
36 pages
Final Formatted After Iloc Loc
No ratings yet
Final Formatted After Iloc Loc
34 pages
Fluid Lecture 6
No ratings yet
Fluid Lecture 6
14 pages
Partial Fraction Solution
100% (1)
Partial Fraction Solution
2 pages
Past Papers of Bs Math 6th Semester PDF
100% (1)
Past Papers of Bs Math 6th Semester PDF
8 pages
Summary 21 - 25 Sem. 4
No ratings yet
Summary 21 - 25 Sem. 4
3 pages
MRS
100% (1)
MRS
7 pages
Reading - 3 DSS
No ratings yet
Reading - 3 DSS
4 pages
Image To PDF 20250419 05.51.35
100% (1)
Image To PDF 20250419 05.51.35
4 pages
Complex Analysis Solved Mid Final
100% (1)
Complex Analysis Solved Mid Final
3 pages
Change of Basis
100% (1)
Change of Basis
13 pages
Reding - 4 DSS
No ratings yet
Reding - 4 DSS
3 pages
Whistleblower Solutions for Firms
No ratings yet
Whistleblower Solutions for Firms
4 pages
Train Tours to Sapa: Top 5 Companies
No ratings yet
Train Tours to Sapa: Top 5 Companies
5 pages
Serrano v. Central Bank of The Philippines
No ratings yet
Serrano v. Central Bank of The Philippines
2 pages
Techbotzlogo Rebrand
No ratings yet
Techbotzlogo Rebrand
22 pages
Yeast - WPS Office
No ratings yet
Yeast - WPS Office
4 pages
Listado de Partes
No ratings yet
Listado de Partes
299 pages
Video Teaching Boosts Patient Safety
No ratings yet
Video Teaching Boosts Patient Safety
7 pages
Libro Ingles ID 3 Profesores
No ratings yet
Libro Ingles ID 3 Profesores
192 pages
Disassembly and Assembly: Automatic Transmission
No ratings yet
Disassembly and Assembly: Automatic Transmission
1 page
Case Analysis - PACADI
0% (1)
Case Analysis - PACADI
12 pages
Regional and Community Techniques in Food Preparation
No ratings yet
Regional and Community Techniques in Food Preparation
5 pages
Data Analytics
100% (3)
Data Analytics
190 pages
MLA Style
No ratings yet
MLA Style
4 pages
Electrical Layout and Estimate 2nd Edition by Max B. Fajardo JR., Leo R. Fajardo
92% (141)
Electrical Layout and Estimate 2nd Edition by Max B. Fajardo JR., Leo R. Fajardo
349 pages
Reisetter, Matt - Reisetter For Iowa House - 1631 - DR2 - Summary
No ratings yet
Reisetter, Matt - Reisetter For Iowa House - 1631 - DR2 - Summary
1 page
Customer Service Management Guide
No ratings yet
Customer Service Management Guide
24 pages
Assessment Task 2 Instructions: Answer
No ratings yet
Assessment Task 2 Instructions: Answer
8 pages
Product Inspection
No ratings yet
Product Inspection
16 pages
BoardingCard 345631232 VNO BVA
No ratings yet
BoardingCard 345631232 VNO BVA
1 page
Module 6 Stoichiometry 1
No ratings yet
Module 6 Stoichiometry 1
37 pages
Solving Problems Involving Loans
No ratings yet
Solving Problems Involving Loans
13 pages
LLM Dissertation Handbook Edinburgh
100% (2)
LLM Dissertation Handbook Edinburgh
6 pages
IS208 PROFESSIONAL ISSUES IN INFORMATION SYSTEMS Revised
67% (3)
IS208 PROFESSIONAL ISSUES IN INFORMATION SYSTEMS Revised
2 pages
History of English Puppet Theater PDF
100% (5)
History of English Puppet Theater PDF
362 pages
Future Literacy 30-1 Vocab Practice
No ratings yet
Future Literacy 30-1 Vocab Practice
128 pages
Gr-10 - Unit 1 - Communication Skills
No ratings yet
Gr-10 - Unit 1 - Communication Skills
8 pages
Cot DLP Oral Com q1 2019
No ratings yet
Cot DLP Oral Com q1 2019
1 page
Manual DA5
No ratings yet
Manual DA5
71 pages
International Retailing
100% (1)
International Retailing
29 pages
All Netapp2
No ratings yet
All Netapp2
167 pages

Pandas Assignment Version-2

Uploaded by

Pandas Assignment Version-2

Uploaded by

Introduction to Pandas: Data Manipulation in Python

pip install pandas​

import pandas as pd​

Pandas version: 2.1.4​

2. Understanding Pandas Data Structures

A Series can represent a single column of data.

3. Different Ways to Create a Series Object

import pandas as pd​

4. Series as a Specialized NumPy Array

import pandas as pd​

Series values: [3.8 3.5 4. ]​

5. Series as a Specialized Dictionary

import pandas as pd​

Population of London: 250000​

6. Understanding DataFrame Objects

7. DataFrame as a Specialized NumPy Array

import numpy as np​

8. DataFrame as a Specialized Dictionary

Accessing a specific column from a DataFrame is straightforward using bracket notation.

import pandas as pd​

9. Constructing DataFrame Objects (Multiple

(b) From a List of Dictionaries

import pandas as pd​

import pandas as pd​

(d) From a Two-Dimensional NumPy Array

import numpy as np​

(e) From a NumPy Structured Array

import numpy as np​

You might also like

pip install pandas

import pandas as pd

Pandas version: 2.1.4

import pandas as pd

import pandas as pd

Series values: [3.8 3.5 4. ]

import pandas as pd

Population of London: 250000

import numpy as np

import pandas as pd

import pandas as pd

import pandas as pd

import numpy as np

import numpy as np