KEMBAR78
AirBnB Data Analysis - Architecture | PDF
0% found this document useful (0 votes)
139 views7 pages

AirBnB Data Analysis - Architecture

This document provides an architecture design for analyzing Airbnb listing and review data. It describes importing CSV datasets, performing exploratory data analysis to understand the data, cleaning and preprocessing the data, defining use cases for analysis, analyzing the data, and visualizing insights. The goal is to gain meaningful insights from the data to help inform business decisions.

Uploaded by

maqbool bhai
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
139 views7 pages

AirBnB Data Analysis - Architecture

This document provides an architecture design for analyzing Airbnb listing and review data. It describes importing CSV datasets, performing exploratory data analysis to understand the data, cleaning and preprocessing the data, defining use cases for analysis, analyzing the data, and visualizing insights. The goal is to gain meaningful insights from the data to help inform business decisions.

Uploaded by

maqbool bhai
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

Architecture Design

MERGEFOR
MAT 2

Architecture
Travel Data Analysis
(AirBnB Data Analysis)

Written By / Author Lokesh Attarde


Document Version LLD-V1.0
Last Revised Date 15/10/2021

AIRBNB DATA ANALYSIS 1


Architecture Design
MERGEFOR
MAT 2

Document Version Control:


Date Version Author Comments

15/10/2021 V1.0 Lokesh Attarde First Draft

Approval Status:
Version Review Data Reviewed By Approved By Comments
V1.0

AIRBNB DATA ANALYSIS 2


Architecture Design
MERGEFOR
MAT 2

Contents
Document Version Control 2
1 Introduction 4
1.1 Why this Architecture design document? 4
1.2 Scope 4
2 Architecture 5
2.1 Architecture Description 5
2.1.1 Data Description 5
2.1.2 Define the Use Cases 5
2.1.3 Import the Dataset 5
2.1.4 Exploratory Data Analysis (EDA) 6
2.1.5 Data Pre-processing, Data Cleaning & Imputation (Handling the Categorical &
Numerical Variables) 6
2.1.6 Analyse the Data 7
2.1.7 Visualize & Share Meaningful Insights 7

AIRBNB DATA ANALYSIS 3


Architecture Design
MERGEFOR
MAT 2

1 Introduction

1.1 Why this Architecture design document?


The purpose of this document is to provide a detailed architecture design of the Airbnb Data Analysis
Project by focusing on each of the attributes of our architecture.
This document will address the background of this project, and the architecturally significant function
requirements. The intension of this document is to help the development team to determine how the
system will be structured at the highest level.

1.2 Scope
Architecture Design Document (ADD) is an architecture design process that follows a step-by-step
refinement process. The process can be used for designing data structures, required software
architecture, source code and ultimately, performance algorithms. Overall, the design principles may
be defined during requirement analysis and then refined during architectural design work.

AIRBNB DATA ANALYSIS 4


Architecture Design
MERGEFOR
MAT 2

2 Architecture

2.1 Architecture Description –


2.1.1 Data Description –
In this analysis project, our listings dataset have around 1.19 Lacs of records with 20 different features.
Features are distributed as 10 Continuous features and 10 Categorical features and in our reviews
dataset, we have around 3.44 Lacs of records with 6 different features among them there are 3
Continuous features and 3 Categorical features. These datasets are given in the form of Comma
Separated Value (.csv) format.

2.1.2 Define the Use Cases –


At this stage, based on the given dataset and business problems we have defined the several Use
Cases to perform the analysis on and this will definitely help out get the key insights from this data
based on which business decisions will be taken. Furthermore, It helps in not only understanding the
meaningful relationships between attributes but it also allows us to do our own research and
come-up with our findings.

2.1.3 Import the Dataset –


As we have received the dataset in the form of Comma Separated Value (.csv) format, therefore we
can import the same using Pandas read_csv( ) function.

AIRBNB DATA ANALYSIS 5


Architecture Design
MERGEFOR
MAT 2
2.1.4 Exploratory Data Analysis (EDA) –
 "Exploratory Data Analysis" (EDA) is a "Data Exploration" step in the Data Analysis Process,
where a number of techniques are used to better understand the dataset being used.
 Understanding the Dataset can refer to a number of things including but not limited to…
 Extracting Important "Variables".
 Identifying "Outliers", "Missing Values", or "Human Error".
 Understanding the Relationships between variables.
 Ultimately, maximizing our insights of a dataset and minimizing potential "Error" that
may occur later in the process.
 In other words, it will gives you a better Understanding of the "Variables" and
the "Relationships" between them.
 Here, we make use of dataprep module to automate our EDA process.
 It provides the following information:
 Overview: detect the types of columns in a DataFrame.
 Variables: variable type, unique values, distinct count, missing values
 Quartile statistics like minimum value, Q1, median, Q3, maximum, range, interquartile
range
 Descriptive statistics like mean, mode, standard deviation, sum, median absolute
deviation, coefficient of variation, kurtosis, skewness.
 Correlations: highlighting of highly correlated variables, Spearman, Pearson and
Kendall matrices
 Missing Values: Bar Chart, Heatmap and spectrum of missing values.

2.1.5 Data Pre-processing, Data Cleaning & Imputation (Handling the


Categorical & Numerical Variables) –
Data pre-processing is a process of preparing the raw data and making it suitable for our analysis
purpose, where we have to do lot of Data Cleaning, handle the missing values by using appropriate
imputation techniques and based on that variable nature i.e. either of Categorical & Numerical
variable. Here, in this project, we have done the substitution/imputation of missing values using either
mean, median or mode according to the nature of those variables. Moreover, we also removed the
columns which are does not participate in our analysis.

AIRBNB DATA ANALYSIS 6


Architecture Design
MERGEFOR
MAT 2
2.1.6 Analyse the Data –
Once the pre-processing is done, we are good to go with our actual analysis where we write lines of
codes and logics to prepare our data as per the defined use cases.

2.1.7 Visualize & Share Meaningful Insights –


Finally, it’s time to turn our data into some sort of visual representation. In short, Data visualization
is the process of translating large data sets and metrics into charts, graphs and other visuals such as
Bar Plot, Pie Chart, Heat map, Box Plot, Scatter Plot, and many more. The resulting visual
representation of data makes it easier to identify and share insights about the information
represented in the data.

Here is the beautiful glimpse of one of our visuals are –

All those different analysis help out to make better business decisions and help analyse customer
trends and satisfaction, which can lead to new and better products and services.

AIRBNB DATA ANALYSIS 7

You might also like