Course Curriculum
DATA ANALYTICS
JOB READY PROGRAM
DATICTION
Dhaka, Bangladesh
Module 01: Excel for Data Analytics
Class 01: Introduction to Excel and Data Basics
❑ Overview of Excel
• Key features and applications in analytics.
❑ Navigating the Interface
• Ribbon, toolbar, and worksheet navigation.
❑ Basic Operations
• Creating and saving workbooks.
• Entering, editing, and formatting data.
❑ Descriptive Statistics
• Mean, median, mode, variance, and standard deviation.
❑ Data Analysis ToolPak
• Using built-in tools for statistical calculations.
❑ Introduction to Basic Charts
• How visualization enhances understanding.
Class 02: Formulas and Functions in Excel
❑ Basic Formulas and Functions
• Creating formulas and using functions (SUM, AVERAGE, COUNT).
• AutoSum and relative/absolute referencing.
❑ Text Functions
• String manipulation, text formatting, and searching.
❑ Date and Time Functions
• Calculating differences, extracting components.
❑ Logical Functions
• IF, AND, OR, and nested functions.
❑ Excel Productivity Features
• AutoFill, Flash Fill, and importing data.
Class 03: Data Cleaning and Preprocessing
❑ Data Cleaning Techniques
• Removing duplicates, handling missing data.
• Converting text to columns and changing data types.
❑ Using Filters and Advanced Filters
• Filtering large datasets effectively.
❑ Power Query
• Importing and transforming data.
❑ Error Checking and Troubleshooting
• Identifying and resolving formula errors.
Class 04: Data Manipulation and Management
❑ Basic Data Manipulation
• Sorting, grouping, and transposing data.
• Pivoting and naming columns/rows.
❑ Working with Tables
• Creating, formatting, and managing tables.
• Using structured references.
❑ Using Conditional Formatting
• Highlighting trends, duplicates, and outliers dynamically.
Class 05: Data Visualization Techniques
❑ Creating Charts
• Bar, column, line, and pie charts.
• Customizing charts with labels, titles, and legends.
❑ Advanced Charting
• Combo charts, sparklines, and dynamic charts with named ranges.
❑ Interactive Dashboards
• Using slicers, timelines, and linking tables for dynamic updates.
Class 06: Advanced Excel Techniques
❑ PivotTables and PivotCharts
• Creating, customizing, and grouping data.
• Using calculated fields and items.
❑ Goal Seek and What-If Analysis
• Setting up scenarios and performing sensitivity analysis.
❑ Introduction to Power Pivot
• Handling large datasets and creating data models.
Class 07: Advanced Visualization Techniques
❑ Creating Advanced Charts
• Waterfall charts for financial data.
• Funnel charts for sales and conversion analysis.
• Radar/spider charts for performance metrics.
❑ Interactive Visualizations
• Dynamic charts with dropdowns and slicers.
• Using form controls for interactivity.
• Animated visualizations for trend analysis.
❑ Advanced Formatting for Visual Impact
• Using custom themes and templates.
• Incorporating shapes, icons, and smart art for storytelling.
• Applying advanced conditional formatting for visuals.
❑ Real-World Visualization Project
• Build a complex visualization showcasing multi-variable data.
• Integrate advanced charts and interactive components.
Project on Excel
Project 01: Business Data Analytics Dashboard
Objective:
Create an interactive dashboard to analyze and present sales, inventory, and customer
data for a retail company.
Requirements:
1. Data Preparation:
• Clean and validate the dataset to ensure data integrity.
• Apply conditional formatting to highlight inconsistencies.
2. Data Analysis:
• Use PivotTables to summarize sales by product, region, and customer segment.
• Calculate key metrics like profit margins and average order value.
3. Visualizations:
• Include:
o Sales Trend Chart: Monthly sales trends (line chart).
o Top Products Chart: Top products by revenue (bar chart).
o Customer Distribution Chart: Regional distribution (pie chart).
o Inventory Chart: Stock levels vs reorder thresholds (combo chart).
4. Dashboard Design:
• Add slicers and dropdowns for filtering data dynamically.
• Apply professional formatting and dynamic titles.
Submission:
• Submit the Excel file with a functional dashboard.
• Include a brief summary of key insights (2-3 sentences).
Evaluation Criteria:
1. Data Cleaning and Preparation (20%)
2. Analysis and Formula Usage (30%)
3. Visualization Quality (30%)
4. Dashboard Design and Interactivity (20%)
Module 02: Power BI for Data Analytics
Class 08: Introduction to Power BI
❑ Introduction to Power BI
• What is Power BI? Overview and components.
• Installing and setting up Power BI Desktop.
• Navigating the Power BI interface.
❑ Data Sources and Collection Methods
• Identifying relevant data sources and methods.
❑ Importing Data
• Connecting to various data sources.
• Overview of connectors and initial transformations.
❑ Basic Data Exploration
• Previewing Imported Data
• Detecting Data Quality Issues
Class 09: Data Cleaning and Preprocessing
❑ Data Cleaning Techniques
• Using Power Query Editor for cleaning data.
• Handling missing values, duplicates, and errors.
• Transforming data types and formatting.
❑ Advanced Data Transformations
• Splitting and merging columns.
• Pivoting and unpivoting data.
• Grouping and summarizing data.
Class 10: Data Modeling and Relationships
❑ Building a Data Model
• Importance of data modeling and schema design (Star vs. Snowflake).
❑ Establishing Relationships
• Creating and managing relationships between tables.
• Role-playing dimensions.
❑ Advanced Modeling Concepts
• Calculated tables and columns.
• Managing relationship properties.
❑ Introduction to DAX Basics
• Understanding DAX syntax and creating simple measures.
Class 11: Intermediate DAX and Analytics
❑ Intermediate DAX Functions
• Aggregation, logical, and date-time functions.
❑ Advanced DAX Techniques
• Using CALCULATE and FILTER.
• Creating time intelligence measures (e.g., YOY, YTD).
❑ Building Analytics with DAX
• Context in DAX: Row vs. filter context.
• Creating dynamic measures for analysis.
Class 12: Data Visualization Techniques
❑ Creating Effective Visuals
• Overview of chart types (bar, line, scatter, etc.).
• Best practices for clear and engaging visuals.
❑ Advanced Visualizations
• Using slicers, filters, and drill-throughs.
• Creating interactive visuals and tooltips.
❑ Custom Visualizations
• Importing visuals from the Power BI marketplace.
• Customizing visuals with themes and formatting.
Class 13: Advanced Reporting and Collaboration
❑ Creating Reports and Dashboards
• Assembling visuals into cohesive reports.
• Designing interactive dashboards with slicers and bookmarks.
❑ Collaboration with Power BI Service
• Publishing reports and setting up refresh schedules.
• Sharing dashboards with permissions and access levels.
❑ Mobile Optimization
• Creating mobile-optimized dashboards.
Class 14: Performance Optimization in Power BI
❑ Optimizing Data Models
• Best practices for efficient data models.
• Using aggregations and reducing data size.
❑ Query Performance Techniques
• Understanding query folding.
• Optimizing Power Query steps.
❑ DAX Optimization
• Writing efficient DAX formulas.
• Reducing computational overhead.
Class 15: Advanced Analytics and AI in Power BI
❑ Using AI Visuals
• Key Influencers and Decomposition Tree.
❑ Integrating with Advanced Tools
• Connecting with Azure ML for predictive analysis.
• Leveraging Python or R for custom analytics and visuals.
❑ Time-Series Analysis and Forecasting
• Analyzing trends and creating forecasts.
Project on Power BI
Project 02: Interactive Business Dashboard
Objective:
Create a fully functional and interactive Power BI dashboard showcasing insights and
actionable recommendations from a real-world dataset.
Requirements:
1. Dataset:
• Use the provided dataset or select a relevant one (e.g., sales, finance, or
operations).
• Ensure the dataset has multiple tables to practice data modeling.
2. Deliverables:
• Dashboard Components:
o Overview Page: Key performance indicators (KPIs).
o Detail Pages: At least 3 pages focusing on trends, segmentation,
and performance.
o Insights Page: Actionable recommendations using DAX and
visuals.
• Features to Include:
o Data Cleaning: Handle missing values, duplicates, and
transformations in Power Query.
o Data Modeling: Create relationships, calculated columns, and
measures.
o Visualizations: Diverse and advanced visuals like slicers, drill-
throughs, and custom charts.
o DAX Measures: At least 5 (e.g., YOY growth, retention rate).
o Publish: Share report via Power BI Service with refresh schedules.
Submission:
• Power BI file (*.pbix).
• A brief summary of the process and key insights.
• Optional: Short recorded walkthrough of the dashboard.
Evaluation Criteria:
• Data Preparation & Modeling (40%).
• Visual Design & Interactivity (30%).
• Insights & Recommendations (30%).
Module 03: SQL for Analytics
Class 16: Introduction to Databases and SQL Basics
❑ Overview of Databases
• What are databases? Importance in data management.
• Relational databases: Components and structure (tables, rows, columns).
• Types of databases: Relational vs. Non-relational.
❑ Introduction to SQL
• What is SQL? History and standards.
• SQL dialects (MySQL, PostgreSQL, SQLite).
• Setting up a database environment.
❑ Basic SQL Commands
• Connecting to a database.
• SELECT, FROM, WHERE: Retrieving data.
• Filtering data using comparison and logical operators.
• Sorting and limiting results (ORDER BY, LIMIT).
Class 17: Data Manipulation and Aggregation
❑ Data Manipulation Commands
• INSERT: Adding data to tables.
• UPDATE: Modifying existing data.
• DELETE: Removing data.
❑ Aggregating Data
• Aggregate functions: COUNT, SUM, AVG, MIN, MAX.
• GROUP BY for data grouping.
• HAVING for filtering aggregated results.
❑ Window Functions
• Introduction to window functions.
• Ranking (ROW_NUMBER, RANK).
• Aggregate and value-based window functions.
Class 18: Joining Tables and Advanced Query Techniques
❑ Understanding Joins
• Inner join, left join, right join, full outer join.
• Cross join and self join with use cases.
❑ Advanced Joins
• Combining joins with aggregation.
• Joining multiple tables effectively.
❑ Subqueries
• Subqueries in SELECT, FROM, WHERE, and HAVING clauses.
• Correlated vs. non-correlated subqueries.
❑ Common Table Expressions (CTEs)
• Writing and using CTEs for complex queries.
Class 19: Advanced SQL Techniques and Performance Optimization
❑ Advanced SQL Functions
• String functions (CONCAT, SUBSTRING).
• Date and time functions (NOW, DATEDIFF).
• Conditional expressions (CASE, COALESCE).
❑ Views and Indexes
• Creating and using views for reusable queries.
• Introduction to indexes and their impact on query performance.
❑ Transactions and Data Integrity
• Transactions and ACID properties.
• Data integrity constraints: Primary key, foreign key, unique, not null.
❑ SQL Performance Tuning
• Basics of query optimization using EXPLAIN/EXPLAIN PLAN for
performance analysis.
Project on SQL
Project 03: Inventory and Sales Management
Objective:
Create a database to manage an online store's inventory, suppliers, and sales orders.
Requirements:
1. Database Setup:
• Create the following tables:
o Products (ProductID, ProductName, Category, Price,
StockQuantity)
o Suppliers (SupplierID, SupplierName, ContactName, Phone)
o Orders (OrderID, ProductID, QuantityOrdered, OrderDate,
SupplierID)
• Insert data into these tables (at least 5 rows for each).
2. Data Manipulation:
• Insert new products into the Products table (at least 3 products).
• Update the stock quantity of a product (e.g., add 10 more units).
• Delete an order from the Orders table.
3. Join and Aggregation:
• Join the Products and Orders tables to list orders with product names and
quantities.
• Calculate total value of products ordered (Price * QuantityOrdered).
• Show total stock quantity for each product and the number of orders.
4. Advanced SQL Functions:
• Use aggregate functions to calculate the average price of products in
each category.
• Create a view showing total sales per product (Price * QuantityOrdered).
• Find products ordered more than 10 times using conditional
expressions.
Submission:
1. SQL Script:
Submit a script containing:
• SQL code for table creation and data insertion.
• SQL queries for each of the tasks above.
2. (Optional) Report:
A short summary explaining:
• The database structure.
• How the SQL queries were used.
Evaluation Criteria:
1. Database Setup & Data Insertion (20%)
2. Data Manipulation (25%)
3. Joins & Aggregation (25%)
4. Advanced SQL Functions (20%)
5. Code Organization (10%)
6. Report (Optional, 5%)
Module 04: Python for Data Analytics
Class 20: Fundamental Python
❑ Introduction to Python
• What is Python and why it’s popular for data analytics
• Setting up the Python environment
• Writing and running Python code
❑ Python Syntax and Structure
• Understanding indentation and code blocks
• Commenting code for clarity
❑ Variables, Data Types, and Basic Operators
• Defining variables and understanding different data types
• Working with numbers, strings, and booleans
• Using basic operators (arithmetic, comparison, logical)
❑ Control Flow
• Conditional Statements: if, elif, else
• Loops: for, while, and break, continue
❑ Functions
• Defining and calling functions
• Function arguments, return values, and scope
• Lambda functions and anonymous functions
❑ Working with Collections
• Lists, Tuples, Dictionaries, Sets
• Indexing, slicing, and iterating over collections
Class 21: Intermediate Python
❑ File Handling
• Reading and writing text and CSV files
• Working with file paths and directories
• Using context managers (with statement)
❑ Error Handling
• Understanding exceptions and handling errors using try, except, else,
finally
• Raising custom exceptions
❑ Working with Libraries
• Importing libraries and modules
• Installing external packages with pip
• Introduction to Python's standard library (e.g., math, datetime)
❑ Basic Object-Oriented Programming (OOP)
• Introduction to classes and objects
• Methods, attributes, and the self keyword
• Basic inheritance, class hierarchies, and method overriding
• Understanding encapsulation and the importance of OOP in data analytics
Class 22: NumPy and Pandas for Data Manipulation
❑ Introduction to NumPy
• Understanding arrays and how they differ from lists
• Array slicing, indexing, and reshaping
• Mathematical operations and broadcasting
• Common NumPy functions (np.sum, np.mean, np.dot, etc.)
❑ Introduction to Pandas
• Creating and exploring Pandas Series and DataFrames
• Loading data from CSV, Excel, and other sources
• Data cleaning: handling missing data, duplicates, and data types
• Data manipulation: filtering, sorting, and grouping
❑ Advanced Pandas Operations
• Merging, joining, and concatenating DataFrames
• Reshaping and pivoting data
• Applying functions to columns/rows (apply, map, lambda)
• Time series data manipulation and indexing
Class 23: Data Visualization with Matplotlib & Seaborn
❑ Introduction to Matplotlib
• Creating basic plots: line charts, bar charts, histograms, scatter plots
• Customizing plots: titles, labels, gridlines, colors
• Saving and exporting plots to files (e.g., PNG, PDF)
❑ Introduction to Seaborn
• Creating advanced visualizations: boxplots, pair plots, heatmaps
• Customizing Seaborn plots with themes, color palettes, and styling
• Visualizing categorical and continuous data together
❑ Interactive Visualizations (Optional)
• Introduction to Plotly for interactive charts
• Creating interactive visualizations (e.g., scatter plots with hover data)
❑ Best Practices for Visualizing Data
• Choosing the right chart for the data
• Effective use of color and labels
• Avoiding misleading visualizations
Class 24: Data Exploration and Analysis (EDA)
❑ Exploratory Data Analysis (EDA)
• What is EDA and why it is crucial for data analysis
• Summarizing data: mean, median, mode, standard deviation, percentiles
• Using describe() and other summary functions in Pandas
❑ Visualization for EDA
• Visualizing distributions with histograms, boxplots, and density plots
• Correlation analysis: pair plots, heatmaps, and scatter matrix
❑ Handling Outliers and Missing Data
• Identifying and treating outliers
• Filling or removing missing data: strategies like mean imputation, forward
fill, and interpolation
• Using dropna() and fillna() in Pandas
❑ Practical EDA with a Real Dataset
• Applying EDA techniques to a real-world dataset (e.g., sales data, weather
data)
• Presenting findings with clear visualizations and insights
• Case study: Exploring a dataset end-to-end using EDA methods
Project on Python
Project 04: Data Analysis and Visualization with Python
Objective:
The goal of this project is to apply the concepts learned throughout the Python for Data
Analytics module to clean, manipulate, analyze, and visualize a real-world dataset.
Requirements:
1. Dataset Selection:
• Choose a dataset from sources like Kaggle or UCI (e.g., sales, weather, or
student performance data).
• The dataset should have at least 5 columns, with both numerical and
categorical data.
2. Data Cleaning:
• Import the dataset using Pandas.
• Handle missing values and remove duplicates.
• Convert data types where necessary (e.g., dates, categorical data).
3. Data Analysis:
• Perform basic data manipulation (filter, sort, group) using Pandas.
• Use descriptive statistics (mean, median, etc.).
• Apply simple aggregations (e.g., total sales by month).
4. Data Visualization:
• Visualize the data using Matplotlib and Seaborn:
▪ Create histograms, boxplots, bar charts, or line plots.
▪ Show trends or correlations using heatmaps or pairplots.
5. Report:
• Provide a brief report (1-2 pages) summarizing your findings,
visualizations, and conclusions.
Submission:
1. Python Code: Jupyter notebook or Python script with data cleaning,
manipulation, and analysis.
2. Report: 1-2 page report summarizing insights and visualizations.
3. Dataset: The dataset used for analysis.
Evaluation Criteria:
1. Data Cleaning (25%) – Handling missing data and duplicates.
2. Analysis (30%) – Proper use of Pandas for data manipulation.
3. Visualization (25%) – Clear and appropriate visualizations.
4. Report (20%) – Clarity and conciseness in summarizing findings.
Module 05: Capstone Project
Project 05: Data Analytics Capstone
Objective: This project integrates skills learned in Excel, Power BI, SQL, and Python.
You'll analyze a real-world dataset, applying data cleaning, analysis, and visualization
techniques to provide actionable insights.
Project Requirements:
1. Data Collection & Import:
• Choose a dataset (minimum 1000+ rows and 7+ columns).
• Import the dataset into Excel, Power BI, or Python.
2. Data Cleaning & Preprocessing:
• Handle missing data, duplicates, and outliers.
• Apply necessary data transformations (use SQL for database operations
and Python or Excel for preprocessing).
3. Exploratory Data Analysis (EDA):
• Use Python (Pandas, NumPy) for data analysis.
• Visualize data using Matplotlib or Seaborn.
• Use Pivot Tables/Charts in Excel (if applicable).
4. Data Analysis & Visualization:
• Write SQL queries for data analysis.
• Use Power BI to create dashboards with interactive visualizations.
• Alternatively, create charts using Matplotlib and Seaborn in Python.
5. Final Reporting:
• Create a report summarizing your findings, visualizations, and insights.
• Use Excel/Power BI for the report, or Python if preferred.
Submission:
1. Final Report:
• A 3-4 page summary of the project with insights and visualizations.
2. Code/Workbook/Report:
• Python code (Jupyter/Script), Excel workbook, or Power BI report file.
3. Optional Presentation:
• PowerPoint presentation summarizing your project.
Evaluation Criteria:
1. Data Cleaning & Preprocessing (20%)
2. Exploratory Data Analysis (20%)
3. Data Analysis & Visualization (30%)
4. Reporting & Insights (20%)
5. Presentation (10%)
This capstone project encourages you to apply all the skills learned in a real-world
scenario, integrating SQL, Python, Excel, and Power BI.
Datiction!