KEMBAR78
IX Part B Unit 2 Data Literacy Notes | PDF | Survey Methodology | Methodology
0% found this document useful (0 votes)
130 views9 pages

IX Part B Unit 2 Data Literacy Notes

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
130 views9 pages

IX Part B Unit 2 Data Literacy Notes

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 9

Part B

UNIT 2 : Data Literacy


2.1 Basics of Data Literacy
❖ Define Data Literacy.
➢ Data literacy means knowing how to understand, work with, and talk about data. It's
about being able to collect, analyze, and show data in ways that make sense.
➢ Data Literate is a person who can interact with data to understand the world around
them.

❖ What is a Data Pyramid?


➢ It is made of different stages of working with data using DIKW model
➢ D - Data, I - Information, K - Knowledge, W - Wisdom
➢ Moving up from the bottom
● Data is available in a raw form. Data in this form is not very useful.
● Data is processed to give us information about the world.
● Information about the world leads to knowledge of how things are happening.
● Wisdom allows us to understand why things are happening in a particular way.

❖ Define Data Literacy Process Framework.


➢ The data literacy framework provides guidance on using data efficiently and with all
levels of awareness.
➢ Data literacy framework is an iterative process.
❖ What is data privacy?
➢ Data privacy referred to as information privacy is concerned with the proper handling
of sensitive data including personal data and other confidential data, to meet
regulatory requirements as well as protecting the confidentiality and immutability of
the data.

❖ Why is data privacy important?


➢ A data breach at a government agency can put top secret information in the hands of
an enemy state.
➢ A breach at a hospital can put personal health information in the hands of those who
might misuse it.
➢ A breach at a corporation can put proprietary data in the hands of a competitor.

❖ What is data security?


➢ Data security is the practice of protecting digital information from unauthorized
access, corruption, or theft throughout its entire lifecycle.

❖ Why is data security important?


➢ Due to the rising amount of data in the cloud there is an increased risk of cyber threats.
➢ The most possible reasons why data security is more important now are:
- Cyber-attacks affect all the people
- The fast-technological changes will boom cyber attacks
❖ What are the best practices for cyber security?

2.2 Acquiring Data, Processing, and Interpreting Data


❖ What are the different types of data?
➢ There are 2 types of data - Textual and Numeric data. The numeric data is further
classified into Continuous and Discrete data.

❖ Differentiate between textual data and numeric data.

❖ Differentiate between continuous and discrete data.


➢ Continuous data is numeric data that is continuous and can include fractional values.
E.g. height, weight, temperature, voltage
➢ Discrete data is numeric data that contains only whole numbers and cannot be
fractional. E.g. the number of students in the class – it can only be a whole number.

❖ What are the different types of data used in the 3 domains of AI?
➢ Natural Language Processing : Textual Data - Text, Audio
➢ Computer Vision : Visual data - Image, Video
➢ Data Science : Numeric data - Numbers, Tables, Excel sheets

❖ What is data acquisition?


➢ Data Acquisition, also known as acquiring data, refers to the procedure of gathering
data. This involves searching for datasets suitable for training AI models. The process
typically comprises three key steps:
■ Data Discovery - It refers to the process of searching for new
dataset
■ Data Augmentation - It refers to the process of adding more data to
the existing data
■ Data Generation - It refers to the process of generating data if it is
not available
➢ For example, if we need to make a CV model for a self-driving car
■ Data discovery - We require pictures of roads & the objects on
roads and for that we can search & download this data from the
internet
■ Data Augmentation - We can increase the amount of data by adding
copies of existing data with small changes. We get data by
changing different parameters like color and brightness.
■ Data Generation - We need to generate or record data using sensors
like recording temperature readings of a building and storing it in a
computer in a suitable form

❖ What are the different types of data sources?


➢ Primary Data Sources - Some of the sources for primary data include surveys,
interviews, experiments, etc. The data generated from the experiment is an example of
primary data.
➢ Secondary Data Sources - Secondary data collection obtains information from external
sources, rather than generating it personally. Ex: Kaggle, Government databases,
Government dataset search, UCI etc.

❖ Mention the checklist of factors which determines if a data is good or bad.


❖ Mention the ethical concerns in data acquisition.

❖ What are the features of data and data pre-processing?


➢ There are three primary factors determining the usability of data:
■ Structure - It defines how the data is stored.
■ Cleanliness - Clean data is free from duplicates, missing values,
outliers, and other anomalies that may affect its reliability.
■ Accuracy - Accuracy indicates how well the data matches real-
world values, ensuring reliability.
❖ Define data features.
➢ Data features are the characteristics or properties of the data which describe each
piece of information in a dataset. These features help us understand and analyze the
data.
➢ For example, in a table of student records, features could include things like the
student's name, age, or grade. In a photo dataset, features might be the colors present
in each image.
➢ In AI models, we need two types of features: independent and dependent.
■ Independent features are the input to the model—they're the
information we provide to make predictions.
■ Dependent features, on the other hand, are the outputs or results of
the model—they're what we're trying to predict

❖ Differentiate between data processing and data interpretation.


➢ Data Processing - It helps computers understand raw data. Use of computers to
perform different operations on data is included under data processing.
➢ Data Interpretation - It is the process of making sense out of data that has been
processed. The interpretation of data helps us answer critical questions using data.

❖ Mention the steps involved in data processing and interpretation.


➢ Acquire Data - Acquiring data is to collect data from various data sources.
➢ Data Processing - After raw data is collected, data is processed to derive meaningful
information from it.
➢ Data Analysis - Data analysis is to examine each component of the data in order to
draw conclusions.
➢ Data Interpretation - It is to be able to explain what these findings/conclusions mean
in a given context.
➢ Data Presentation - In this step, you select, organize, and group ideas and evidence in
a logical way.

❖ What are the different methods of data interpretation?


➢ Based on the two types of data, there are two ways to interpret data-
■ Quantitative Data Interpretation - Qualitative data tells us about the
emotions and feelings of people. It is focused on insights and
motivations of people. Example: Customer reviews or feedbacks
■ Qualitative Data Interpretation - Quantitative data interpretation is
made on numerical data. It helps us answer questions like “when,”
“how many,” and “how often”. Example – (how many) numbers of
likes on the Instagram post
❖ Differentiate between the different methods of data interpretation.

❖ What are the data collection methods for qualitative data interpretation?
➢ Record keeping: This method uses existing reliable documents and other similar
sources of information as the data source. It is similar to going to a library.
➢ Observation: In this method, the participant – their behavior and emotions – are
observed carefully
➢ Case Studies: In this method, data is collected from case studies.
➢ Focus groups: In this method, data is collected from a group discussion on relevant
topics.
➢ Longitudinal Studies: This data collection method is performed on the same data
source repeatedly over an extended period.
➢ One-to-One Interviews: In this method, data is collected using a one-to-one interview.

❖ What are the data collection methods for quantitative data interpretation?
➢ Interviews: Quantitative interviews play a key role in collecting information.
➢ Polls: A poll is a type of survey that asks simple questions to respondents. Polls are
usually limited to one question.
➢ Observations: Quantitative data can be collected through observations in a particular
time period
➢ Longitudinal Studies: A type of study conducted over a long time
➢ Survey: Surveys can be conducted for a large number of people to collect quantitative
data.

❖ What are the steps involved in qualitative data analysis?


1. Collect Data
2. Organize
3. Set a code to the Data Collected
4. Analyze your data
5. Reporting
❖ What are the steps involved in quantitative data analysis?
1. Relate measurement scales with variables
2. Connect descriptive statistics with data
3. Decide a measurement scale
4. Represent data in an appropriate format

❖ What are the different types of data interpretation?

➢ Textual DI
▪ The data is mentioned in the text form, usually in a paragraph.
▪ Used when the data is not large and can be easily comprehended by reading.
▪ Textual presentation is not suitable for large data.
➢ Tabular DI
▪ Data is represented systematically in the form of rows and columns.
▪ Title of the Table contains the description of the table content.
▪ Column Headings contains the description of information contained in
columns.
➢ Graphical DI
▪ Visual representation of data
Ex: Bar graph, Line chart, Pie chart

2.3 Project Interactive Data Dashboard & Presentation


❖ Mention the importance of data visualization.
➢ Informed Decision Making
■ A decision is only as good as the knowledge it is based on
■ Ex - Since the average height of students is known, school can custom
design the chairs and tables according to the requirement of the class
➢ Reduced Cost
■ Identifying needs can lead to reduction in cost
■ Ex - Restaurant owner could decide to drop/modify some dishes of the menu
which aren’t popular or have got bad reviews
➢ Identifying Needs
■ We can identify needs of people by data interpretation
■ Ex - Veg Farmhouse Pizza is a popular choice among age group 8-10

❖ Mention a few data visualization platforms.


➢ Tableau
➢ Data Wrapper
➢ MS Excel

You might also like