KEMBAR78
DA Unitwise Notes With Diagrams | PDF
0% found this document useful (0 votes)
3 views5 pages

DA Unitwise Notes With Diagrams

The document outlines a comprehensive introduction to data analytics, covering types of data, sources, applications, and the data analytics lifecycle. It details various data analysis techniques, including regression, classification, and neural networks, as well as exploratory data analysis methods and data mining techniques like clustering and association rules. Additionally, it discusses frameworks and visualization tools such as Hadoop, Spark, R, Tableau, and Power BI for handling big data and enhancing data storytelling.

Uploaded by

saumya2213215
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views5 pages

DA Unitwise Notes With Diagrams

The document outlines a comprehensive introduction to data analytics, covering types of data, sources, applications, and the data analytics lifecycle. It details various data analysis techniques, including regression, classification, and neural networks, as well as exploratory data analysis methods and data mining techniques like clustering and association rules. Additionally, it discusses frameworks and visualization tools such as Hadoop, Spark, R, Tableau, and Power BI for handling big data and enhancing data storytelling.

Uploaded by

saumya2213215
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5

Unit 1: Introduction to Data Analytics

This unit introduces the fundamentals of data analytics:

- Types of Data: Structured, Semi-structured, Unstructured.

- Sources of Data: Internal, External, Primary, Secondary, Real-time.

- Applications: Business, Healthcare, E-commerce, Social media.

- Data Analytics Lifecycle: Data Collection, Cleaning, Exploration, Modeling, Evaluation,

Deployment.
Unit 2: Data Analysis Techniques

This unit focuses on core data analysis algorithms:

- Regression (Linear, Logistic): Predict continuous or binary outcomes.

- Classification (SVM, Decision Trees): Assign labels to data.

- Bayesian Modeling: Probabilistic models for predictions.

- Neural Networks: Deep learning for image, speech, text.

- Fuzzy Logic: Handling uncertain or imprecise data.


Unit 3: Exploratory Data Analysis (EDA)

Exploratory Data Analysis (EDA) includes:

- Data Cleaning: Removing nulls, duplicates.

- Data Transformation: Normalization, Encoding.

- Statistical Summaries: Mean, Median, Std Dev, Outliers.

- Visualization Tools: Bar charts, Box plots, Heatmaps.


Unit 4: Frequent Itemsets and Clustering

This unit explains data mining techniques:

- Apriori Algorithm: Candidate generation + support counting.

- FP-Growth: Tree-based frequent pattern mining.

- K-Means: Partition data into K clusters based on distance.

- Hierarchical Clustering: Bottom-up or top-down dendrograms.

- Association Rules: Extract insights like "if X then Y".


Unit 5: Frameworks and Visualization Tools

Big data and visualization:

- Hadoop: HDFS + MapReduce for big data storage and processing.

- Spark: Faster than Hadoop using in-memory operations.

- R Language: Used for statistical computing and graphics.

- Tableau & Power BI: Tools for business intelligence and data storytelling.

You might also like