0% found this document useful (0 votes)

16 views65 pages

Data Analytics Using WEKA

Uploaded by

Joyce Wm Wong

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views65 pages

Data Analytics Using WEKA

Uploaded by

Joyce Wm Wong

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 65

NTW: Data analytics for Layman using WEKA

Introduction

1
Introduction

“…..we are actually living in the data age” (Han et al., 2012)

2
Data
• Data are any facts, numbers, or text that can be processed by a
computer. This includes:
• Operational or transactional data – sales, cost, inventory, payroll,
accounting, etc.
• Non-operational data – industry sales, forecast data, remote sensors on a
satellite, microarrays generating gene expression data, microeconomic
data, etc.
• Metadata: data about the data itself such as logical database design or
data dictionary definitions – “data about data”

3
Metadata

4
Data
• Modern ICT technologies make it
very easy to generate large
volumes of data, and because
storage is quite cheap, there is a
tendency to keep that data
regardless of whether it has a
point.
• Every organization benefits from
collecting and analysing its data.
• Analysing data is crucial for
knowledge-driven decision
making.
• The problem then becomes how
to analyse these data.

5
Data Analysis
• Data analysis (analysis of data or data analytics) – is a
process of inspecting, cleansing, transforming, and
modeling data with the goal of discovering useful information,
suggesting conclusions, and supporting decision-making.

• Data analysis method includes simple query and reporting,

statistical analysis, more complex multidimensional analysis,
and data mining.

6
Definition of Data Mining
• Data mining is the process of findings interesting structure
in data (Roiger, 2017).

• Data mining is the process of automatically discovering

useful information in large data repositories (Tan et al.,
2006).

• Data mining is the application of specific algorithms for

extracting patterns from data (Fayyad, 1996).

7
Definition of Data Mining

“Data mining is a process of discovering

various models, summaries and derived values
from a given collection of data”.
(Kantardzic, 2020)

8
Data Mining Process

(Kantardzic, 2020)

9
Data Mining Process

Knowledge Discovery in Databases (KDD)

10
Data Mining Techniques
Unsupervised Learning Supervised Learning
Independent Variable (x) Independent Variable (x) Dependent Variable (y)
• Clustering • Simple Linear Regression • Simple Linear Regression
Numerical

▪ K-means • Multiple Regression • Multiple Regression

• Logistic Regression
• Decision Trees

• Association rules • Logistic Regression • Logistic Regression

Categorical

▪ Apriori • Decision Trees • Decision Trees

▪ FP-growth
Data Mining Tools

12
Data Mining Tools

13
Data Mining Applications
Area Application
Finance/ Banking Credit card analysis, loyal customers
Insurance Claims, fraud, churn analysis
Telecommunication Call record analysis
Transport Logistics management
Consumer goods Promotion analysis
Scientific research Image, video, speech pattern analysis
Utilities Power usage analysis
Law enforcement Crime analysis

14
NTW: Basic Data analytics using WEKA
Data Pre-Processing

15
Data, Objects and Attributes
Attributes
• Data is a collection of objects and
their attributes ID Age Height Height Books
(m) (ft) Category
• An attribute is a property or
101 20 1.75 5.74 SDA321
characteristic of an object
• Examples: eye color of a 102 18 1.80 5.91 ENG364
person, temperature, etc.
103 21 1.53 5.20 IT543
• Attribute is also known as variable,
field, characteristic, or feature Objects 104 55 1.65 5.41 SDQ735

• A collection of attributes describe 105 20 1.58 5.18 IT954

an object
106 21 1.63 5.35 ENG735
• Object is also known as record,
point, case, sample, entity, or 107 19 1.72 5.64 SDM628
instance
Types of Attribute
Attribute
Types

Categorical Numerical

Nominal Ordinal Interval Ratio

Qualitative types Quantitative types

Properties of Attributes Values
• The type of an attribute depends on which of the
following properties it possesses:
✓ Distinctness: = ≠
✓ Order: < >
✓ Addition/subtraction: + -
✓ Multiplication/division: * /

❖ Nominal attribute: distinctness

❖ Ordinal attribute: distinctness and order
❖ Interval attribute: distinctness, order and addition/subtraction
❖ Ratio attribute: all 4 properties
Why pre-processing the data?
• Some data pre-processing is needed for all mining
tools.

• The purpose of pre-processing is to transform data

sets so that their information content is best exposed
to the mining tools.

• Pre-processing data also prepares the miner so that

when using prepared data, the miner produces better
models.
Major tasks in data pre-processing
1. Data Cleaning

2. Data Integration

3. Data Transformation

4. Data Reduction

20
1. Data Cleaning
• Data in the real world is dirty: Lots of potentially incorrect data, e.g., instrument
faulty, human or computer error, transmission error, etc.
– Incomplete (missing values): lacking attribute values, lacking certain
attributes of interest, or containing only aggregate data
❖ e.g., Occupation = “ ” (missing values)
– Noisy: containing noise, errors, or outliers
❖ e.g., Salary = “−10” (an error)
– Inconsistent: containing discrepancies in codes or names, e.g.,
❖ Age = “42”, Birthday = “03/07/2010”
❖ Was rating “1, 2, 3”, now rating “A, B, C”
❖ discrepancy between duplicate records
– Intentional (e.g., disguised missing data)
21 ❖ Jan. 1 as everyone’s birthday?
NTW: Data analytics for Layman using WEKA
Data Pre-Processing (HANDS-ON)

22
Preparing .csv file
• Open an excel file
(Data2.xlsx)
• Save as Data2.csv
• Close the Data2.csv
file.

23
Data Pre-Processing in WEKA
• Open the
Data2.csv file in
WEKA.
• Check the
attribute
names.
• All attribute
types were set
as numeric.
• Is these
correct?
24
Data Pre-Processing in WEKA
Click edit to
view all the
dataset.

How to
change the
NoMatrik
attribute
as nominal
type?

25
Data Pre-Processing in WEKA
Choose>Filter>Unsupervised>Attribute>NumericToNominal

Right-click

Change first-last
to first only

26
Data Pre-Processing in WEKA
Choose>Add>Right-Click

Change to last

Change to
Grade and
Nominal
attribute

Type A+, A, B+,

B, C, D, E, F

27
Data Pre-Processing in WEKA
Set the Grade
attribute as A+, A,
B+, B, C, D, E or F

90 – 100 A+
80 – 89 A
70 – 79 B+
60 – 69 B
50 – 59 C
40 – 49 D
30 – 39 E
20 – 29 F
10 – 19 F
0–9 F

28
Data Pre-Processing in WEKA

29
Data Pre-Processing in WEKA

Save the file

30
NTW: Data analytics for Layman using WEKA

Data Pre-processing and Knowledge Discovery

Using Weka to Generate Apriori Algorithms

31
Data Pre-Processing in WEKA
• Waikato Environment for Knowledge Analysis
(WEKA)
• https://www.cs.waikato.ac.nz/ml/weka/index.
html
WEKA
• Weka is a collection of machine learning
algorithms for data mining tasks.
• The algorithms can either be applied directly to a
dataset or called from your own Java code.
• Weka contains tools for data pre-processing,
classification, regression, clustering, association
rules and visualization.
WEKA
The Explorer
• Gives access to all facilities of Weka using menu selection
and form filling
• Prepare the data, open the Explorer and load the data
• Flip back and forth between results, evaluate models built
on different datasets and visualize graphically both and
models and datasets, including classification errors

35
Preparing the data
• Data can be imported from a file in various
formats: e.g
– CSV (comma-separated values)
– ARFF (attribute-relation file format)
– Binary serialized instances
– Matlab ASCII files

36
Preparing the data
• ARFF files have 2 sections HEADER and DATA
• HEADER:
@RELATION dataset name

@ATTRIBUTE attribute name and type

• DATA:
@DATA
list of dataset

37
Attributes in WEKA
• Nominal: one of a predefined list of values
- e.g. red, green, blue
• Numeric: A real or integer number
• String: Enclosed in “double quotes”
• Date
• Relational

38
Apriori Algorithm in Weka

39
Using Weka to Generate Apriori Algorithm
Open an Excel file name
Using Weka to Generate Apriori Algorithm

Save as csv file>

click Save> click Yes
Using Weka to Generate Apriori Algorithm
Since most of the
attribute types of the
data is not numeric,
the file need to be
converted to .arff first
before it can be
opened in Weka.

Open the csv

file in MS Word
and you will get
the data just
like this
Using Weka to Generate Apriori Algorithm
@RELATION

@ATTRIBUTE string

@DATA

Remove , ,
Using Weka to Generate Apriori Algorithm

Save As >
click Save
> click OK
> close the
file
Using Weka to Generate Apriori Algorithm
Open the MS
Word of
Women_clothing
in the Notepad

Save as the file as

Women_clothing.arff
Using Weka to Generate Apriori Algorithm
Then, you will get the ARFF type of Women_clothing file which can be opened in Weka
Using Weka to Generate Apriori Algorithm
Open the
Women_clothing.arff
file in Weka.

Inspect the attribute

type for each of
attributes

To do an Apriori
algorithm, you need
to transform the
string type to
nominal type.
Using Weka to Generate Apriori Algorithm
1. Select> filters>
unsupervised >
attributes >
StringToNominal>
right-click > Show
properties > write 1-7

2. Click Apply
Using Weka to Generate Apriori Algorithm
Inspect all the
attribute types

Remove Gender
and Age as we
want to remain
the nominal
attributes in
order to do an
Apriori
algorithm
Using Weka to Generate Apriori Algorithm
Select Associate
> right-click >
Show properties

Minimum
support
Using Weka to Generate Apriori Algorithm

Evaluate and
interpret the
results

Save As the file as

Women_clothing1.arff
NTW: Data analytics for Layman using WEKA
Pattern and Knowledge Discovery
Using WEKA for Clustering

52
Clustering
Open Iris data in
WEKA database
file

Check the
attributes

Choose No class
Clustering

Cluster tab >

Clusterer >
Choose
Clustering

SimpleKMeans
Clustering
Right-click >
Show
properties…
Clustering

Change the
numClusters
into 3

Click OK
Clustering

Click Start
Clustering

Analyze
the result
Clustering
Clustering

Right-click >
Visualize
cluster
assignments
Clustering
Choose:
X: TotalsalesQuantity(Num)
Y: TotalSalesValue(Num)

Drag Jitter to the

right side

Examine the
result of cluster
analysis
Wants to dig more??
Hope we have more time!
Neural
Decision Tree
Network
(with source
code!)
References
• Data mining and KDD (SIGKDD: CDROM)
– Conferences: ACM-SIGKDD, IEEE-ICDM, SIAM-DM, PKDD, PAKDD, etc.
– Journal: Data Mining and Knowledge Discovery, KDD Explorations
• Database systems (SIGMOD: CD ROM)
– Conferences: ACM-SIGMOD, ACM-PODS, VLDB, IEEE-ICDE, EDBT, ICDT, DASFAA
– Journals: ACM-TODS, IEEE-TKDE, JIIS, J. ACM, etc.
• AI & Machine Learning
– Conferences: Machine learning (ML), AAAI, IJCAI, COLT (Learning Theory), etc.
– Journals: Machine Learning, Artificial Intelligence, etc.
• Statistics
– Conferences: Joint Stat. Meeting, etc.
– Journals: Annals of statistics, etc.
• Visualization
– Conference proceedings: CHI, ACM-SIGGraph, etc.
– Journals: IEEE Trans. visualization and computer graphics, etc.
• Website:
– http://www.kdnuggets.com/
Thank you and see you again next time

Ask me personally or interested to become my post-graduate

student?
nurdiyana@upnm.edu.my

+0176962946

Weka Data Mining Lab Guide
No ratings yet
Weka Data Mining Lab Guide
20 pages
Why Data Mining
No ratings yet
Why Data Mining
12 pages
BI - Experiment - No - 1
No ratings yet
BI - Experiment - No - 1
7 pages
DMDV 210
No ratings yet
DMDV 210
63 pages
Big Data & Weka Tool Guide
No ratings yet
Big Data & Weka Tool Guide
32 pages
Weka: A Tool For Data Preprocessing, Classification, Ensemble, Clustering and Association Rule Mining
No ratings yet
Weka: A Tool For Data Preprocessing, Classification, Ensemble, Clustering and Association Rule Mining
4 pages
WEKA Data Mining Lab Manual
100% (1)
WEKA Data Mining Lab Manual
8 pages
Dataminingg
No ratings yet
Dataminingg
22 pages
Experiment 1: Installation of WEKA Tool Aim
No ratings yet
Experiment 1: Installation of WEKA Tool Aim
19 pages
Data Mining in Bioinformatics
No ratings yet
Data Mining in Bioinformatics
21 pages
Data Mining Lab Report
No ratings yet
Data Mining Lab Report
36 pages
WEKA Intro
No ratings yet
WEKA Intro
17 pages
DWDM Lab Manual
No ratings yet
DWDM Lab Manual
47 pages
DWBI Lab Manual 2023-24 Final
No ratings yet
DWBI Lab Manual 2023-24 Final
40 pages
DM Lab Manualiii I 1 Mrits
No ratings yet
DM Lab Manualiii I 1 Mrits
39 pages
OS Journal
No ratings yet
OS Journal
28 pages
Printing 1-3
No ratings yet
Printing 1-3
36 pages
DMW LabFile 0901CS243D11 Swastik
No ratings yet
DMW LabFile 0901CS243D11 Swastik
25 pages
DMDV 210
No ratings yet
DMDV 210
61 pages
Lecture 12 - Weka Tutorial
No ratings yet
Lecture 12 - Weka Tutorial
84 pages
WEKA Data Mining Practical Guide
No ratings yet
WEKA Data Mining Practical Guide
18 pages
Workshop 1
No ratings yet
Workshop 1
16 pages
An Introduction To WEKA: Contributed by Yizhou Sun 2008
No ratings yet
An Introduction To WEKA: Contributed by Yizhou Sun 2008
85 pages
Data Mining Term Project Machine Learning With WEKA: Weka Explorer Tutorial For Version 3.4.3
No ratings yet
Data Mining Term Project Machine Learning With WEKA: Weka Explorer Tutorial For Version 3.4.3
42 pages
DMDV Main Manual
No ratings yet
DMDV Main Manual
35 pages
Exp 6
No ratings yet
Exp 6
9 pages
An Introduction To WEKA Explorer: in Part From: Yizhou Sun 2008
No ratings yet
An Introduction To WEKA Explorer: in Part From: Yizhou Sun 2008
104 pages
DW 9 Exp 1
No ratings yet
DW 9 Exp 1
43 pages
WEKA Lab Session
No ratings yet
WEKA Lab Session
88 pages
Introduction To Weka
No ratings yet
Introduction To Weka
39 pages
DM Lab Material
No ratings yet
DM Lab Material
88 pages
An Introduction To WEKA: Contributed by Yizhou Sun 2008
No ratings yet
An Introduction To WEKA: Contributed by Yizhou Sun 2008
85 pages
Experiment No: 01 Data Exploration & Data Preprocessing
No ratings yet
Experiment No: 01 Data Exploration & Data Preprocessing
54 pages
WEKA Data Mining Tool Guide
No ratings yet
WEKA Data Mining Tool Guide
19 pages
DMDV
No ratings yet
DMDV
22 pages
WEKA Data Mining Techniques Guide
No ratings yet
WEKA Data Mining Techniques Guide
17 pages
WEKA Guide for ML Practitioners
No ratings yet
WEKA Guide for ML Practitioners
58 pages
WEKA Data Mining Course Overview
No ratings yet
WEKA Data Mining Course Overview
5 pages
DWDM Lab Manual 7th Sem
No ratings yet
DWDM Lab Manual 7th Sem
45 pages
Implementation of Apriori Algorithm Using Weka: Ajay Kumar Shrivastava R. N. Panda
No ratings yet
Implementation of Apriori Algorithm Using Weka: Ajay Kumar Shrivastava R. N. Panda
4 pages
Appendix Weka
No ratings yet
Appendix Weka
17 pages
DWM1 Riya
No ratings yet
DWM1 Riya
16 pages
Data Warehousing Lab Exp 1-3
No ratings yet
Data Warehousing Lab Exp 1-3
24 pages
Lab Updated - Merged
No ratings yet
Lab Updated - Merged
49 pages
DWDM Lab Manual 2024-2025
No ratings yet
DWDM Lab Manual 2024-2025
96 pages
DMW Lab Print
No ratings yet
DMW Lab Print
21 pages
WEKA: ML Tool for Data Scientists
No ratings yet
WEKA: ML Tool for Data Scientists
23 pages
Data Warehousing Lab Course Guide
0% (1)
Data Warehousing Lab Course Guide
28 pages
Data Mining File
No ratings yet
Data Mining File
87 pages
Priyadarshini J. L. College of Engineering, Nagpur: Session 2022-23 Semester-V
No ratings yet
Priyadarshini J. L. College of Engineering, Nagpur: Session 2022-23 Semester-V
31 pages
01 DM HAI Class1 2019 09 05
No ratings yet
01 DM HAI Class1 2019 09 05
77 pages
Lab Manual
No ratings yet
Lab Manual
16 pages
Ccs341 Datawarehousing
No ratings yet
Ccs341 Datawarehousing
66 pages
More Data Mining With Weka: Ian H. Witten
No ratings yet
More Data Mining With Weka: Ian H. Witten
61 pages
Data-Mining-Lab-Manual Cs 703b
No ratings yet
Data-Mining-Lab-Manual Cs 703b
41 pages
The Logic Behind Fault Trees - An Explanation of Fault Tree Gates & Events
No ratings yet
The Logic Behind Fault Trees - An Explanation of Fault Tree Gates & Events
13 pages
Industrial Ecology Theory Controversies
No ratings yet
Industrial Ecology Theory Controversies
31 pages
Phylogeny of Dictyoptera Re-Examined
No ratings yet
Phylogeny of Dictyoptera Re-Examined
24 pages
SW101 - Arsenic Treatment Technologies For Soil, Water, Waste
100% (1)
SW101 - Arsenic Treatment Technologies For Soil, Water, Waste
237 pages
Phylogeny of Dictyoptera - Dating The Origin of Cockroaches, Praying Mantises and Termites With Molecular Data and Controlled Fossil Evidence
No ratings yet
Phylogeny of Dictyoptera - Dating The Origin of Cockroaches, Praying Mantises and Termites With Molecular Data and Controlled Fossil Evidence
27 pages
Alienoptera - A New Insect Order in The Roach-Mantodean Twilight Zone
No ratings yet
Alienoptera - A New Insect Order in The Roach-Mantodean Twilight Zone
11 pages
Malaysia Hazardous Waste Management
No ratings yet
Malaysia Hazardous Waste Management
7 pages
Hazardous Waste in New Trend of Circular Economy
No ratings yet
Hazardous Waste in New Trend of Circular Economy
8 pages
Comparative Life Cycle Assesment of Traditional and Emerging Oily Sludge Treatment Approaches
No ratings yet
Comparative Life Cycle Assesment of Traditional and Emerging Oily Sludge Treatment Approaches
47 pages
The BATINTREC Process For Reclaiming Used Batteries
No ratings yet
The BATINTREC Process For Reclaiming Used Batteries
5 pages
Right Hand Fingering: Group 1 Scales: Same Fingering, Starting From A White Key
No ratings yet
Right Hand Fingering: Group 1 Scales: Same Fingering, Starting From A White Key
1 page
Waste Battery Recycling Methods
No ratings yet
Waste Battery Recycling Methods
11 pages
Bioleaching of Battery Metals
No ratings yet
Bioleaching of Battery Metals
6 pages
Lithium-Ion Battery Recycling Trends
No ratings yet
Lithium-Ion Battery Recycling Trends
28 pages
Recovery and Recycling Processes of Scheduled Waste in Malaysia
No ratings yet
Recovery and Recycling Processes of Scheduled Waste in Malaysia
134 pages
China-Malaysia Industrial Park Guide
No ratings yet
China-Malaysia Industrial Park Guide
10 pages
Strengthening National Unity
No ratings yet
Strengthening National Unity
3 pages
Flyrock Risk Prediction in Quarry Blasting
No ratings yet
Flyrock Risk Prediction in Quarry Blasting
11 pages
EIA Guideline in Malaysia 2016 (Scanned 190318)
No ratings yet
EIA Guideline in Malaysia 2016 (Scanned 190318)
228 pages
Going Trenchless History
100% (2)
Going Trenchless History
3 pages
Power Generation
No ratings yet
Power Generation
4 pages
Scheduled Waste Management
No ratings yet
Scheduled Waste Management
9 pages
Wastewater Treatment Quiz Template
No ratings yet
Wastewater Treatment Quiz Template
1 page
Strengthening National Unity
No ratings yet
Strengthening National Unity
3 pages
Scheduled Waste Management
No ratings yet
Scheduled Waste Management
9 pages
6 March SET 2 AGENTFORCE BRAND
No ratings yet
6 March SET 2 AGENTFORCE BRAND
61 pages
Wisc Report
No ratings yet
Wisc Report
4 pages
Detecting Low-Rate DoS/DDoS with Deep Learning
No ratings yet
Detecting Low-Rate DoS/DDoS with Deep Learning
7 pages
Introduction To Socio Cultural and Anthropological Concepts
No ratings yet
Introduction To Socio Cultural and Anthropological Concepts
17 pages
WAEC Literature in English Syllabus
No ratings yet
WAEC Literature in English Syllabus
2 pages
Z. Int
No ratings yet
Z. Int
168 pages
January 2014 (IAL) MS - F1 Edexcel
No ratings yet
January 2014 (IAL) MS - F1 Edexcel
18 pages
Supervisory Plan for Modular Learning
No ratings yet
Supervisory Plan for Modular Learning
2 pages
Avasars For STD 7 - Week 1 - Aug 2020
No ratings yet
Avasars For STD 7 - Week 1 - Aug 2020
22 pages
Camping Paper
No ratings yet
Camping Paper
2 pages
Self-Discipline Guide for Teens
No ratings yet
Self-Discipline Guide for Teens
11 pages
MAINSTREAM
No ratings yet
MAINSTREAM
8 pages
Module 1 Comfort and Hygiene Measure
No ratings yet
Module 1 Comfort and Hygiene Measure
70 pages
Nutraceuticals Efficacy Safety and Toxicity
No ratings yet
Nutraceuticals Efficacy Safety and Toxicity
295 pages
Mark Scheme (Results) January 2023
No ratings yet
Mark Scheme (Results) January 2023
17 pages
1000 Most Common English Phrases
No ratings yet
1000 Most Common English Phrases
2 pages
Midterm Exam in Hbo
No ratings yet
Midterm Exam in Hbo
5 pages
Classroom Teacher Attitudes Toward Inclusion
100% (1)
Classroom Teacher Attitudes Toward Inclusion
68 pages
Ss119-1.g (Assessment and Evaluation)
No ratings yet
Ss119-1.g (Assessment and Evaluation)
3 pages
Project Report ON Employee Welfare OF Fertilizer Industry
No ratings yet
Project Report ON Employee Welfare OF Fertilizer Industry
4 pages
USMLE Step 1 Prep for Med Students
No ratings yet
USMLE Step 1 Prep for Med Students
4 pages
How Could I Hide My Face
No ratings yet
How Could I Hide My Face
5 pages
Detailed Lesson Plan in Math V I. Objectives: ST ST
100% (1)
Detailed Lesson Plan in Math V I. Objectives: ST ST
6 pages
Synchronous Class Monitoring Form: Abing, Jolena MAE
No ratings yet
Synchronous Class Monitoring Form: Abing, Jolena MAE
6 pages
Strategy Formulation: Business Strategy
No ratings yet
Strategy Formulation: Business Strategy
29 pages
Biomimicry in Energy-Efficient Architecture
No ratings yet
Biomimicry in Energy-Efficient Architecture
8 pages
Checklist
No ratings yet
Checklist
3 pages
PRAVEEN SENANAYAKE Personal Statement
No ratings yet
PRAVEEN SENANAYAKE Personal Statement
2 pages
EFL Lesson Planning for Secondary Schools
No ratings yet
EFL Lesson Planning for Secondary Schools
19 pages
Andrea Redinger Keynote at KDP Event
No ratings yet
Andrea Redinger Keynote at KDP Event
2 pages

Data Analytics Using WEKA

Uploaded by

Data Analytics Using WEKA

Uploaded by

NTW: Data analytics for Layman using WEKA

• Data analysis method includes simple query and reporting,

• Data mining is the process of automatically discovering

• Data mining is the application of specific algorithms for

“Data mining is a process of discovering

Knowledge Discovery in Databases (KDD)

▪ K-means • Multiple Regression • Multiple Regression

• Association rules • Logistic Regression • Logistic Regression

▪ Apriori • Decision Trees • Decision Trees

• A collection of attributes describe 105 20 1.58 5.18 IT954

Nominal Ordinal Interval Ratio

Qualitative types Quantitative types

❖ Nominal attribute: distinctness

• The purpose of pre-processing is to transform data

• Pre-processing data also prepares the miner so that

Type A+, A, B+,

Save the file

Data Pre-processing and Knowledge Discovery

@ATTRIBUTE attribute name and type

Save as csv file>

Open the csv

Save as the file as

Inspect the attribute

Save As the file as

Cluster tab >

Drag Jitter to the

Ask me personally or interested to become my post-graduate

You might also like