DATA PROCESSING
Umer Raza Khan
Computer Science Lecturer
Lahore UBAS
Table of Contents
Data Processing
The Data Processing Cycle
The Collection and Computing of data
Manual collection of data
The main methods of data input
Devices used to collect data
Data Verification
Data Validation
Output and Recording of data
Types of data processing systems
Types of Computer Operation
Batch Processing and Real-time Processing
Data Processing
Data processing is the series of
operations that are performed on data
to extract meaningful information,
convert it into a usable form, and
prepare it for storage, analysis, or
presentation.
Data processing can be done manually
or automatically using computer
systems and software applications.
The Data Processing Cycle
The Data Processing Cycle consists of a series of steps that
convert raw data into meaningful information. These steps
ensure that data is collected, processed, and presented in a
useful form. The main stages of the Data Processing Cycle are:
Data Collection
Gathering raw data from various sources such as surveys, sensors, transactions,
and online forms.
Ensuring data accuracy and relevance during the collection phase.
Data Preparation
Cleaning the collected data to remove errors, inconsistencies, and redundancies.
Organizing data into a structured format suitable for analysis.
This step often includes data transformation, normalization, and aggregation.
Data Input
Entering the prepared data into a computer system for further processing.
This can be done manually through data entry or automatically through data
import tools.
The Data Processing Cycle
Data Processing
Applying algorithms, computations, and transformations to the input data.
This step includes sorting, filtering, summarizing, and performing
calculations.
The goal is to convert raw data into meaningful insights.
Data Output
Presenting the processed data in a usable form such as reports, graphs, tables, or
dashboards.
Ensuring the output is clear, accurate, and actionable for decision-making purposes.
Data Storage
Saving the processed data for future use or reference.
Data can be stored in databases, data warehouses, or other storage systems.
Ensuring data security and accessibility during the storage phase.
Data Retrieval
Accessing stored data when needed for analysis, reporting, or decision-making.
Utilizing query tools and data retrieval techniques to extract specific information.
The Collection and Computing of
Data
Data Collection
Data collection is the process of gathering raw data from various
sources to be used for analysis, decision-making, and other purposes.
Effective data collection ensures that the data is accurate, relevant, and
complete. The main methods of data collection include:
Surveys and Questionnaires
Collecting data directly from individuals through structured questions.
Can be conducted online, via phone, or in person.
Sensors and IoT Devices
Using devices to automatically collect data from the environment.
Examples include temperature sensors, motion detectors, and smart
meters.
Transactional Data
Capturing data generated from business transactions such as sales,
purchases, and financial operations.
Often collected through point-of-sale systems and financial software.
The Collection and Computing of
Data
Online Forms and Web Scraping
Gathering data from web-based forms filled out by users.
Extracting data from websites using automated tools.
Observations
Manually recording data based on observations of events, behaviors, or
conditions.
Common in fields like market research and social sciences.
Interviews and Focus Groups
Collecting qualitative data through direct interactions with individuals
or groups.
Used to gain deeper insights into opinions, attitudes, and
experiences.
The Collection and Computing of
Data
Data Computing:
Data computing involves processing the collected data to
transform it into meaningful information. This process includes several steps:
Data Entry
Inputting collected data into computer systems for processing.
Can be done manually or through automated tools.
Data Cleaning
Removing errors, duplicates, and inconsistencies from the data.
Ensuring the data is accurate and ready for analysis.
Data Transformation
Converting data into a suitable format for analysis.
Includes normalization, aggregation, and encoding.
Data Integration
Combining data from different sources to create a unified dataset.
Ensures consistency and completeness of the data.
The Collection and Computing of
Data
Data Visualization
Presenting data in graphical formats such as charts, graphs, and
dashboards.
Helps in understanding trends, patterns, and relationships in the data.
Data Storage
Saving processed data in databases, data warehouses, or cloud storage for future use.
Ensuring data is secure and easily accessible.
Manual collection of data
Manual data collection involves gathering data by hand, without the use of
automated tools or devices. This method relies on human effort to capture
and record data, often requiring meticulous attention to detail and careful
documentation. Here are some common techniques and considerations for
manual data collection:
Surveys and Questionnaires
Paper-Based Surveys
Interviews
Observation
Direct Observation
Participant Observation
Field Notes
Logs and Diaries
Activity Logs
Diaries
Manual Data Entry
The main methods of data input
1. Manual Data Input
Keyboard Entry:
Mouse Input:
Touch Screen Input
Voice Input
Handwriting Recognition
2. Automated Data Input
Optical Character Recognition (OCR)
Barcode Scanning
Magnetic Stripe Readers
Radio Frequency Identification (RFID)
Sensors and IoT Devices
The main methods of data input
3. Specialized Data Input Devices
Digital Cameras and Scanners
Biometric Devices
Graphics Tablets
Game Controllers and Joysticks
4. Data Input from External Sources
Data Import Tools
Web Forms
APIs and Web Services
Devices used to collect data
1. Manual Data Collection Devices
Paper Forms and Notebooks
Clipboards
Manual Counting Devices
2. Digital Data Collection Devices
Smartphones
Tablets
Laptops
Desktops
Digital Cameras
Scanners
Devices used to collect data
3. Sensors and IoT Devices
Environmental Sensors
Wearable Devices
Smart Meters
4. Automated Data Collection Devices
Barcode Scanners
RFID Readers
Magnetic Stripe Readers
5. Specialized Data Collection Devices
GPS Devices
Biometric Devices
Digital Weighing Scales
Data Verification
The process of checking data to ensure it has been accurately
and correctly entered or transferred from one source to
another.
Ensures the data is consistent, accurate, and complete.
Confirms that data matches the original source.
Examples:
Comparing data from a paper form to what was entered into a
digital system.
Checking if the data transferred from one database to another is
identical.
Data Validation
The process of ensuring that data meets required formats,
criteria, and standards before it is processed.
Ensures the data is logical, reasonable, and suitable for further
processing and analysis.
Helps in maintaining data integrity and quality.
Examples:
Validating that a user’s age is a valid number within a realistic
range.
Ensuring that an email address entered has the correct structure
(e.g., username@domain.com).
Checking that a required field is not left empty.
Output and Recording of data
Output of Data:
The process of presenting processed data in a
usable format for users or systems. Types of Output:
Visual Output:
Displayed on monitors, screens, or projectors.
Printed Output:
Produced on paper through printers or plotters.
Audio Output:
Sound signals or spoken words generated by devices.
Digital Output:
Data files stored electronically for future reference.
Output and Recording of data
Recording of Data:
Storing processed or raw data for
future retrieval or analysis.
Methods of Recording:
Database Storage:
Organized storage within relational databases or data
warehouses.
File Storage:
Saving data in files on local drives or network storage.
Cloud Storage:
Storing data on remote servers accessed via the internet.
Backup Systems:
Ensuring data redundancy and disaster recovery.
Types of data processing systems
A data processing system is a combination of hardware, software, and
procedures designed to collect, store, process, and retrieve data. These
systems convert raw data into useful information for decision-making,
analysis, and other purposes.
Batch Processing System
Definition:
Processes data in large groups or batches at scheduled intervals.
Example:
Payroll systems that process all employee payments at the end of each
month.
Real-Time Processing System
Definition:
Processes data immediately as it is entered or received, providing instant
output.
Example:
An airline reservation system that updates seat availability in real-time as
bookings are made.
Types of data processing
systems
Online Processing System
Definition :
Continuously processes data as it is received, often through an online network.
Example:
E-commerce websites that process transactions as customers place orders online.
Distributed Processing System
Definition :
Spreads data processing tasks across multiple machines or locations to
enhance performance and reliability.
Example:
A distributed database system used by a multinational corporation to
manage data across various global offices.
Multiprocessing System
Definition :
Utilizes multiple CPUs to process data simultaneously, improving processing speed
and efficiency.
Example:
High-performance computing systems used for scientific simulations, where
multiple processors work together to perform complex calculations.
Types of Computer Operation
Types of computer operations refer to the different ways in which
computers can be used to process data and execute tasks,
depending on the nature of the tasks and the resources available.
Batch Processing
Definition:
Executing a series of non-interactive jobs all at once without manual
intervention.
Example:
Processing a batch of bank transactions overnight to update all accounts
simultaneously.
Real-Time Processing
Definition:
Processing data instantly as it is received, ensuring immediate output.
Example:
Air traffic control systems that monitor and manage aircraft in real-time.
Types of Computer Operation
Time-Sharing
Definition:
Multiple users share the same computer resources, with the
system allocating time slices to each user.
Example:
Multiple users working on a mainframe computer
simultaneously, such as in a university computer lab.
Multiprocessing
Definition:
A single computer uses multiple processors to execute
multiple tasks simultaneously.
Example:
A server running multiple virtual machines, each
performing different tasks concurrently.
Types of Computer Operation
Interactive Processing
Definition:
The user interacts directly with the computer during the
execution of a program.
Example:
Using a word processor to write a document, where the user
interacts with the software in real-time.
Distributed Processing
Definition:
Data processing tasks are divided among multiple
computers located in different places.
Example:
A cloud computing environment where tasks are distributed
across several servers in different locations.
Batch Processing and Real-time
Processing
Batch Processing Real-time Processing
Definition: Definition:
Processing large volumes of data in Processing data immediately as it is
groups or batches at specific intervals. received, providing instant results.
Use Cases: Use Cases:
End-of-day processing in banks:
Airline reservation systems: Updating
Updating all transactions and
seat availability as bookings are
generating reports overnight.
made.
Payroll systems: Calculating and
Stock trading platforms: Executing
distributing employee salaries at the
end of the month. trades and updating stock prices in
real-time.
Advantages:
Advantages:
Reduces operational costs as tasks are
Immediate processing and feedback.
processed together.
Efficient for handling large volumes of Crucial for time-sensitive operations
data. where delays are unacceptable.
Disadvantages: Disadvantages:
Not suitable for time-sensitive tasks, as Higher resource usage and
processing is delayed until the batch is operational costs.
run. More complex to implement and maintain.
ANY QUESTION
Thank You