Data analyses
Data
Is a collection of Facts
Data analyses
Is the collection, transformation, and organization of data in order to draw conclusions makes
predictions and drive informative decision making.
Two questions to be asked (why and what)
Analyst use following tools for a data driven decision.
Computer + your brain + your skills +your traits = job success
Data in every-day life
Step of data analyses
       ask
       prepare
       process
       analyze
       share
       act
   a)   in the asked phase you will work to understand the challenge to be solved or the question
        to be answered.
   b)   In the prepare phase you will find and collect the data you will need to answer your
        question
   c)   In the process phase you organize your data
   d)   In the analyses field your analysis, the data and uncover answer to the question
   e)   Share your outcome to decision makers though any mode which feasible to you
   f)   In last you put data in the action
A data analyst is an explorer, a detective, and an artist all rolled into one
Desplaines in data analysis
       machine learning,
       statistics,
       analytics
   a)   statistics: make few important decisions under uncertainty
   b)   machine learning (AI): make many decisions under uncertainty
   c)   analytics: if you do not know decision you have to make before you begin
the excellence of statistics is rigor, Statisticians are essentially philosophers,
epistemologists. They are very, very careful about protecting decision-makers from coming into
the wrong conclusion.
The excellence of an analyst is speed. How quickly can you surf through vast amounts of data to
explore it and discover the gems, the beautiful potential insights that are worth knowing about
and bringing to your decision-makers.
Performance is the excellence of the machine learning and AI engineer
    1. Ask: business challenge, objective, or question
    2. Prepare: data generation, collection, storage, and data management
    3. Process: data cleaning and data integrity
    4. Analyze: data exploration, visualization, and analysis
    5. Share: communicating and interpreting results
    6. Act: putting insights to work to solve the problem
Cloud
A place where you keep a Data online, rather than a computer hard drive.
Data scientist verses Data analyst
Data science
Create new ways of modeling and understanding the unknown by using the raw material.
Data scientist
Create new question by using data
Data analyst
Answer to the question by using data
Data driven decision making?
Using facts to guide business strategy
Steps:
   a) What Business need
Subject matter experts
People who know well the business analysis well.
Skills for data analyst
Analytical skills
Are qualities and characteristics associated with solving problems using facts
Five most relevant skills
     Curiosity
     Understanding context
     Having a technical mind set
     Data design
     Data strategy
Context: the condition in which something exist or happens
Technical mind set: the ability to break things down into smaller steps or pieces and work with
them in an orderly and logical way.
Data design: how you organize information
Data strategy: management of the people, process and tools used in data analysis.
Analytical thinking
Identifying and defining a problem and then solving it by using data in an organized step by step
manner.
Five key aspects of analytical thinking
    1. Visualization
    2. Strategy
    3. Problem orientation
    4. Correlation
    5. Big pic and problem orient thinking
Graphical representation of a data is called visualization.
Strategic thinking increase focus, productivity
Problem oriented all about keeping the problem on the top to be solved
Correlation relationship between data
Big picture thinking looking widely to the pic of the problem.
Core analytical thinking: creatively and critical thinking goes side by with analytical thinking.
Root cause: the reason why a problem occurs
Gap analysis: a method for examining and evaluating how a process work currently in order to
get where you want to be in the future.
Terms and definitions for Course 1, Module 1
Analytical skills: Qualities and characteristics associated with using facts to solve problems
Analytical thinking: The process of identifying and defining a problem, then solving it by using
data in an organized, step-by-step manner
Context: The condition in which something exists or happens
Data: A collection of facts
Data analysis: The collection, transformation, and organization of data in order to draw
conclusions, make predictions, and drive informed decision-making
Data analyst: Someone who collects, transforms, and organizes data in order to draw
conclusions, make predictions, and drive informed decision-making
Data analytics: The science of data
Data design: How information is organized
Data-driven decision-making: Using facts to guide business strategy
Data ecosystem: The various elements that interact with one another in order to produce,
manage, store, organize, analyze, and share data
Data science: A field of study that uses raw data to create new ways of modeling and
understanding the unknown
Data strategy: The management of the people, processes, and tools used in data analysis
Data visualization: The graphical representation of data
Dataset: A collection of data that can be manipulated or analyzed as one unit
Gap analysis: A method for examining and evaluating the current state of a process in order to
identify opportunities for improvement in the future
Root cause: The reason why a problem occurs
Technical mindset: The ability to break things down into smaller steps or pieces and work with
them in an orderly and logical way
Visualization: (Refer to data visualization)
11/05/2024
Data life-cycle
       Plan
       Capture
       Manage
       Analyze
       Archive
       Destroy
Planning: what kind of data needed how to manage it.
Capture: collecting the data
Manage: how we care data
Analyze: used to solve problem, make decision
Archive: store Data in safe place no need to work on it
Destroy: end up the Data
Detail explanation of data analyzes step
Ask: first problem to be solved, second understand the stakeholder expectation
Prepare: data anaylist collect and store the data to be used
Process: transforming the data into useful format
Analysis: use the tools to reach find out the solution of problem
Share: the data analyst shares the data with stakeholders to make informed decision
Act: practice the given information
Two tools used mostly to analyze the data
      Excel sheet
      Google sheet
Formulas and function are there in the spread sheet to analyze the data
Formulas: a set of instruction that perform a specific calculation using the data in a spread sheet.
Function: present a comment that automatically performs a specific task using the data a spread
sheet.
Query language: is a computer programming language that allows you to retrieve and
manipulate data from a database.
Database: is a collection of data stored in a computer system.
Data visualization: the graphical representation of information
Tableau and looker are two main visualization tools
12/05/2024
Step by step making spreadsheet
      Sorting in data
      Formula
      Chart google sheet
SQL Function
      store
      organize
      analyze
Three questions in SQL
      select
      from
      were
   stage of data life cycle
          planning: it starts before business start; it involves decision about the data.
          Recapture data: where we get data
          Manage: how we care data
          Analyses: need to analyses data
          Achieve: data available but does not need to be used
          Destruction: destroy data to keep the privacy
Stage of data analyses
          Ask: define problem to solve, and understand the stakeholder desire, expectation.
          Prepare: collect and store data
          Process step: cleaning data and transforming into useful data, removing the problem
          Analyze: using tools and transform data to make decision.
          Share: interpret result and share with stakeholder to act visualization help to
           understand well.
          Act
Data analyses tools
Attribute: are the quality of the data use to labels a column in a table
To Organize data from smallest to the largest using advance sorting from data
Formula is a set of instruction that perform a specific function using the data in the spreadsheets.
SQL: super-size spreadsheet,
Query: a request from a data or information from a database.
Structure thinking
The process of recognizing the current problem, or situation organizing the available information
revealing gaps and opportunities and identifying the options.
Course 2 ask phase of data analysis model one
Problem: determine what advertising method best for reaching anywhere Gaming repairs target
audience
Data analysis work with variety of problems in this video we are going to focus on six common
types which are
       Making predictions: using data to make an informed decision about how things may be
        in the future.
       Categorizing things: assigning information to different groups or clusters based on
        common features. Least product in cleaning or high productive in cooking
       Spotting something unusual: identify data that is different from the norm
       Identifying themes: grouping categorized information into broader concept (who was
        more productive to be rewarded)
       Discovering connections: finding similar challenges faced by different entities and
        combining data an insight to address them.
       Finding patterns: using historical data to find what happened in the past and is therefore,
        are likely to happen again
28/07/2024
Follow the smart methodology to ask efficient question:
SMART:
S=Specific: simple, significant and focus on the single topic or a few closely related ideas
M=Measurable: question should be quantified and assessed
A=Action oriented: question encourage change
R=Relevant: are important and have significance the problem you are trying to solve.
T=Time bond: specify the time to be studied
Fairness: make sure that your question does not create or reinforce bias.
Data analyst: Analyses data
Data engineer: Apply data
Data scientist: convert data into statistical model
Model 2: Course 2
Businesses and other organization use data to make better decision there are two ways to do this.
   1. Data driven decision making
   2. Data inspired decision making: explore different data sources to find out what they have
      in common.
   Algorithm: a process or set of rules to be followed for a specific task.
   Quantitative data analysis: specific and objective measure of numerical facts
   This can be
           The what
           How many
           How often
Use chart and or graphs to visualize and then qualitative data give us more understanding why
the numbers are there.
Qualitative: subjective or explanatory measures of qualities and characteristics.
Organizing data to turn into information and present to stakeholder there many other tools
two of them are as follow.
   1. Reports: static collection of data given to the stakeholder periodically
          Give high level historical data
          Esay to design
          Pre-cleaned and sorted
          Less visual appealing
          Continual maintenance
          Static
   2. Dashboard: monitors live, incoming data
          Dynamics, automatic and interactive
          More stakeholder accesses
          Lower maintenance
          Labor intensive design
          Can be confusing
          Potentially uncleaned data
To report we need to use pivot table
       It is a data summarization tool that is used in data processing, pivot table are used to
       summarize, sort, reorganize, group, count total or average data stored in a data set.
       Metric
       Single, quantifiable type of data that can be used for measurement.
Mathematical thinking
Is a powerful skill to be used to solve the problem and see your solution?
Error
Div/0 error=iferror
Dilamator error data cannot be interpreted as the input
#N/A Data in the formula cannot be found in the spreadsheet
 17/10/2024
Problem Domain
The specific area of analysis that encompasses every activity affecting or affected by the
problem.
Structured thinking
The process of recognizing the current problem or situation, organizing available
information, revealing GAP and opportunities and identifying the options.
Scope of Work
An agreed upon outlined of the work you are going to perform on a project.