DWDM LAB PROGRAMList of Experiments

The document outlines a series of experiments focused on data warehousing, machine learning, and data preprocessing using tools like WEKA and programming languages such as Java and Python. Key activities include building a data warehouse, exploring WEKA features, performing classification and clustering, and implementing algorithms like Apriori and Naive Bayes. Additionally, it emphasizes data visualization and analysis through various programming tasks and techniques.

Uploaded by

Vamsi Krishna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

69 views3 pages

DWDM LAB PROGRAMList of Experiments

Uploaded by

Vamsi Krishna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

List of Experiments:

1. Creation of a Data Warehouse.

 Build Data Warehouse/Data Mart (using open source tools like Pentaho Data Integration
Tool, Pentaho Business Analytics; or other data warehouse tools like Microsoft-SSIS,
Informatica, Business Objects,etc.,)
 Design multi-dimensional data models namely Star, Snowflake and Fact Constellation
schemas for any one enterprise (ex. Banking, Insurance, Finance, Healthcare,
manufacturing, Automobiles, sales etc).
 Write ETL scripts and implement using data warehouse tools.
 Perform Various OLAP operations such slice, dice, roll up, drill up and pivot

2. Explore machine learning tool “WEKA”

 Explore WEKA Data Mining/Machine Learning Toolkit.
 Downloading and/or installation of WEKA data mining toolkit.
 Understand the features of WEKA toolkit such as Explorer, Knowledge Flow interface,
Experimenter, command-line interface.
 Navigate the options available in the WEKA (ex. Select attributes panel, Preprocess
panel, Classify panel, Cluster panel, Associate panel and Visualize panel)
 Study the arff file format Explore the available data sets in WEKA. Load a data set (ex.
Weather dataset, Iris dataset, etc.)
 Load each dataset and observe the following:
1. List the attribute names and they types
2. Number of records in each dataset
3. Identify the class attribute (if any)
4. Plot Histogram
5. Determine the number of records for each class.
6. Visualize the data in various dimensions

3. Perform data preprocessing tasks and Demonstrate performing association rule mining on data
sets
 Explore various options available in Weka for preprocessing data and apply
Unsupervised filters like Discretization, Resample filter, etc. on each dataset
 Load weather. nominal, Iris, Glass datasets into Weka and run Apriori
Algorithm with different support and confidence values.
 Study the rules generated. Apply different discretization filters on numerical attributes
and run the Apriori association rule algorithm. Study the rules generated.
 Derive interesting insights and observe the effect of discretization in the rule generation
process.

4. Demonstrate performing classification on data sets Weka/R

 Load each dataset and run 1d3, J48 classification algorithm. Study the classifier output.
Compute entropy values, Kappa statistic.
 Extract if-then rules from the decision tree generated by the classifier, Observe the
confusion matrix.
 Load each dataset into Weka/R and perform Naïve-bayes classification and k-Nearest
Neighbour classification. Interpret the results obtained.
 Plot RoC Curves
 Compare classification results of ID3, J48, Naïve-Bayes and k-NN classifiers for each
dataset, and deduce which classifier is performing best and poor for each dataset and
justify.

5. Demonstrate performing clustering of data sets

 Load each dataset into Weka/R and run simple k-means clustering algorithm with
different values of k (number of desired clusters).
 Study the clusters formed. Observe the sum of squared errors and centroids, and derive
insights.
 Explore other clustering techniques available in Weka/R.
 Explore visualization features of Weka/R to visualize the clusters. Derive interesting
insights and explain.

6. Demonstrate knowledge flow application on data sets into Weka/R

 Develop a knowledge flow layout for finding strong association rules by using Apriori,
FP Growth algorithms
 Set up the knowledge flow to load an ARFF (batch mode) and perform a cross validation
using J48 algorithm
 Demonstrate plotting multiple ROC curves in the same plot window by using j48 and
Random forest tree

7. Demonstrate ZeroR technique on Iris dataset (by using necessary preprocessing

technique(s)) and share your observations
8. Write a java program to prepare a simulated data set with unique instances.
9. Write a Python program to generate frequent item sets / association rules using Apriori
algorithm
10. Write a program to calculate chi-square value using Python/R. Report your observation.
11. Write a program of Naive Bayesian classification using Python/R programming language.
12. Implement a Java/R program to perform Apriori algorithm
13. Write a R program to cluster your choice of data using simple k-means algorithm using
JDK
14. Write a program of cluster analysis using simple k-means algorithm Python/R programming
language.
15. Write a program to compute/display dissimilarity matrix (for your own dataset containing at
least four instances with two attributes) using Python
16. Visualize the datasets using matplotlib in python/R.(Histogram, Box plot, Bar chart, Pie
chart etc.,)

DWDM Manual-1
No ratings yet
DWDM Manual-1
96 pages
Data Mining Lab
No ratings yet
Data Mining Lab
3 pages
R23-DWDM Syllabus
No ratings yet
R23-DWDM Syllabus
5 pages
DWDM Lab Manual
No ratings yet
DWDM Lab Manual
51 pages
CCS341 Data Warehousing Lab Manual
No ratings yet
CCS341 Data Warehousing Lab Manual
6 pages
DM Lab Cse
No ratings yet
DM Lab Cse
108 pages
WEKA Lab Questions Answers
No ratings yet
WEKA Lab Questions Answers
5 pages
DWM Lab Manual
No ratings yet
DWM Lab Manual
92 pages
DWDM Record Print1
No ratings yet
DWDM Record Print1
100 pages
DWDN Lab
No ratings yet
DWDN Lab
7 pages
DW Univ Lab Qs (Updated)
No ratings yet
DW Univ Lab Qs (Updated)
2 pages
OS Journal
No ratings yet
OS Journal
28 pages
Komal DWDM 1to5
No ratings yet
Komal DWDM 1to5
61 pages
Data Mining & Warehousing Lab Guide
No ratings yet
Data Mining & Warehousing Lab Guide
4 pages
Model Test Question Paper
No ratings yet
Model Test Question Paper
4 pages
R23!3!1 DWDM Final Syllabus On 21-06-2025
No ratings yet
R23!3!1 DWDM Final Syllabus On 21-06-2025
5 pages
CCS341-DW Mp-Set 1
No ratings yet
CCS341-DW Mp-Set 1
2 pages
CCS341-DW Mp-Set 2
No ratings yet
CCS341-DW Mp-Set 2
2 pages
DWDM r23 Batch
No ratings yet
DWDM r23 Batch
140 pages
DataWarehousing DataMining Question Bank
No ratings yet
DataWarehousing DataMining Question Bank
3 pages
CCS341 Set1
67% (3)
CCS341 Set1
2 pages
Individual Assignment 2
No ratings yet
Individual Assignment 2
4 pages
DMDV Main Manual
No ratings yet
DMDV Main Manual
35 pages
Ccs341 - Data Warehousing - 30.11.2024
No ratings yet
Ccs341 - Data Warehousing - 30.11.2024
2 pages
DMDW LAB NEW - Merged
No ratings yet
DMDW LAB NEW - Merged
53 pages
Datawarehouse Lab Manunaul Edited
No ratings yet
Datawarehouse Lab Manunaul Edited
34 pages
Practicum Module Weka Orange
No ratings yet
Practicum Module Weka Orange
5 pages
Printing 1-3
No ratings yet
Printing 1-3
36 pages
Aman 61
No ratings yet
Aman 61
24 pages
CCS341 Set2
100% (1)
CCS341 Set2
2 pages
R23 ML Lab
No ratings yet
R23 ML Lab
1 page
DA LabFile
No ratings yet
DA LabFile
63 pages
ML Index Nancy
No ratings yet
ML Index Nancy
3 pages
CSE602 - Data Warehousing & Data Mining
No ratings yet
CSE602 - Data Warehousing & Data Mining
6 pages
DWH Manual Merged
No ratings yet
DWH Manual Merged
47 pages
Iare Data Preparation and Analysis Lab Manual
No ratings yet
Iare Data Preparation and Analysis Lab Manual
55 pages
Data Mining Lab Manual
No ratings yet
Data Mining Lab Manual
8 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
1 page
PDF
No ratings yet
PDF
7 pages
DMDV 210
No ratings yet
DMDV 210
61 pages
Data Mining Guidelines
No ratings yet
Data Mining Guidelines
4 pages
Recent Trends in IT Practical Solutions
No ratings yet
Recent Trends in IT Practical Solutions
11 pages
DMDV 210
No ratings yet
DMDV 210
63 pages
DWDM Final Lab Syllabus
No ratings yet
DWDM Final Lab Syllabus
2 pages
DMW FIle
No ratings yet
DMW FIle
27 pages
DMDV
No ratings yet
DMDV
22 pages
Data Warehousing Record
No ratings yet
Data Warehousing Record
26 pages
DMW LabFile 0901CS243D11 Swastik
No ratings yet
DMW LabFile 0901CS243D11 Swastik
25 pages
Weka Activity Report
No ratings yet
Weka Activity Report
30 pages
Latest Data Mining Lab Manual
No ratings yet
Latest Data Mining Lab Manual
74 pages
GPP CM Iv Sem Q Papers Even-24
No ratings yet
GPP CM Iv Sem Q Papers Even-24
13 pages
Weka Data Mining Lab Guide
No ratings yet
Weka Data Mining Lab Guide
20 pages
AI-43 Data Mining
No ratings yet
AI-43 Data Mining
96 pages
ML Lab External QP
No ratings yet
ML Lab External QP
2 pages
DWDM Lab Manual
No ratings yet
DWDM Lab Manual
44 pages
Declaration - 1008253347 - 19-04-2025 15 - 23 - 51
No ratings yet
Declaration - 1008253347 - 19-04-2025 15 - 23 - 51
2 pages
Cargo Operations and Fleet Management
No ratings yet
Cargo Operations and Fleet Management
5 pages
7 Primary and Parallel Planning Fillable
No ratings yet
7 Primary and Parallel Planning Fillable
4 pages
A Checklist For Bipa Module A1-C2
No ratings yet
A Checklist For Bipa Module A1-C2
3 pages
J2EE Interview Questions
No ratings yet
J2EE Interview Questions
8 pages
Q3e RW3 U0506A Student
No ratings yet
Q3e RW3 U0506A Student
5 pages
Community Immersion Guide
No ratings yet
Community Immersion Guide
3 pages
Judge
No ratings yet
Judge
11 pages
MOE Questionnaire - Region #1
No ratings yet
MOE Questionnaire - Region #1
2 pages
Self-Discipline Guide for Teens
No ratings yet
Self-Discipline Guide for Teens
11 pages
Resume / C.V. / Bio Data
No ratings yet
Resume / C.V. / Bio Data
15 pages
Immigration Law Career Profile
No ratings yet
Immigration Law Career Profile
3 pages
Engleza Cls A 9 A B Bar2023
No ratings yet
Engleza Cls A 9 A B Bar2023
3 pages
MAINSTREAM
No ratings yet
MAINSTREAM
8 pages
The Influence of Principal Leadership On Teacher Collaboration
No ratings yet
The Influence of Principal Leadership On Teacher Collaboration
130 pages
Special Education in Contemporary Society: An Introduction To Exceptionality, 7th Edition (Ebook PDF) PDF Download
50% (8)
Special Education in Contemporary Society: An Introduction To Exceptionality, 7th Edition (Ebook PDF) PDF Download
52 pages
Medical Titles (7!17!2015)
No ratings yet
Medical Titles (7!17!2015)
10 pages
De Waal 1997 Are We in Anthropodenial
No ratings yet
De Waal 1997 Are We in Anthropodenial
5 pages
TM-08 - Ii Year - Wef 1ST April
No ratings yet
TM-08 - Ii Year - Wef 1ST April
5 pages
Tor Pci Alumni Gathering June 2022
No ratings yet
Tor Pci Alumni Gathering June 2022
11 pages
Electrical Installation PDF
No ratings yet
Electrical Installation PDF
1 page
Experience: References
No ratings yet
Experience: References
2 pages
Detecting Low-Rate DoS/DDoS with Deep Learning
No ratings yet
Detecting Low-Rate DoS/DDoS with Deep Learning
7 pages
Geometry Support Class Guide
No ratings yet
Geometry Support Class Guide
3 pages
Activity: Let's Brainstorm: Fo Un Da Tio Na L Co Ur Se in en TR Ep Re Ne Ur
100% (1)
Activity: Let's Brainstorm: Fo Un Da Tio Na L Co Ur Se in en TR Ep Re Ne Ur
1 page
Study Plan (2020) : CSIR-NET PART-A (General Aptitude)
No ratings yet
Study Plan (2020) : CSIR-NET PART-A (General Aptitude)
17 pages
Principles and Practice of Physics 1st Edition Eric Mazur Solutions Manual Digital Version 2025
100% (4)
Principles and Practice of Physics 1st Edition Eric Mazur Solutions Manual Digital Version 2025
72 pages
Using Makaton As An Effective Communication System (Presentation) Author Jackie Webb
No ratings yet
Using Makaton As An Effective Communication System (Presentation) Author Jackie Webb
34 pages
4245 Eberlin Jason Unitplan
No ratings yet
4245 Eberlin Jason Unitplan
44 pages