KEMBAR78
Big Data Notes | PDF | Big Data | Analytics
0% found this document useful (0 votes)
374 views4 pages

Big Data Notes

Big data refers to large volumes of structured, unstructured, and semi-structured data that is difficult to process using traditional data processing applications. It is characterized by high volume, velocity, and variety. Businesses can analyze big data to gain insights that improve decision making, customer service, and operational efficiency. However, big data projects often fail in Pakistan due to a lack of quality data, unclear objectives, and an absence of skilled personnel to manage such projects.

Uploaded by

Vivek Kavta
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
374 views4 pages

Big Data Notes

Big data refers to large volumes of structured, unstructured, and semi-structured data that is difficult to process using traditional data processing applications. It is characterized by high volume, velocity, and variety. Businesses can analyze big data to gain insights that improve decision making, customer service, and operational efficiency. However, big data projects often fail in Pakistan due to a lack of quality data, unclear objectives, and an absence of skilled personnel to manage such projects.

Uploaded by

Vivek Kavta
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

BIG DATA

 Big data is a term that describes the large volume of data both structured and
unstructured that inundates a business on a day-to-day basis. 
OR
 The term “big data” refers to data that is so large, fast or complex that it’s difficult
or impossible to process using traditional methods. 
OR
 Big data” is high-volume, velocity, and variety information assets that demand
cost-effective, innovative forms of information processing for enhanced insight
and decision making.”
 It includes data mining, data storage, data analysis, data sharing, and data
visualization.
 Big data can be analyzed for insights that lead to better decisions and strategic
business moves.
 Big Data analytics examples includes stock exchanges, social media sites, jet
engines, etc.

TYPES OF BIG DATA

 STRUCTURED
 Any data that can be stored, accessed and processed in the form of fixed format
is termed as a 'structured' data.
 Example: Data stored in a relational database management system, employee
table in a company database.

 UNSTRUCTURED
 Any data with unknown form or the structure is classified as unstructured data. 
 Example: Email, Heterogeneous data source containing a combination of simple
text files, images, videos etc., the output returned by 'Google Search'.

 SEMI-STRUCTURED
 Semi-structured data pertains to the data containing both the formats mentioned
above, that is, structured and unstructured data.
 Example: Personal data stored in an XML file.

CHARACTERISTICS OF BIG DATA

 VOLUME
 We already know that Big Data indicates huge ‘volumes’ of data that is being
generated on a daily basis from various sources like social media platforms,
business processes, machines, networks, human interactions, etc.
 Such a large amount of data are stored in data warehouses.
 VARIETY
 Variety of Big Data refers to structured, unstructured, and semi structured data
that is gathered from multiple sources.
 Nowadays, unstructured data in the form of emails, photos, videos, monitoring
devices, PDFs, audio, etc. are also being considered in the analysis applications.

 VELOCITY
 Velocity essentially refers to the speed at which data is being created in real-
time.
 Big Data Velocity deals with the speed at which data flows in from sources like
business processes, application logs, networks, and social media sites,
sensors, Mobile devices, etc.

 VARIABILITY
This refers to the inconsistency which can be shown by the data at times.

BENEFITS OF BIG DATA PROCESSING

 Businesses can utilize outside intelligence while taking decisions


 Improved customer service
 Early identification of risk to the product/services, if any
 Better operational efficiency

WHY IS BIG DATA IMPORTANT?

 The importance of big data does not revolve around how much data a company
has but how a company utilizes the collected data.
 The company can take data from any source and analyze it to find answers
which will enable:
Cost reductions,
Time reductions,
New product development and optimized offerings, and
Smart decision making. 

 When you combine big data with high-powered analytics, you can accomplish
business-related tasks such as:
Determining root causes of failures, issues and defects in near-real time.
Generating coupons at the point of sale based on the customer’s buying
habits.
Recalculating entire risk portfolios in minutes.
Detecting fraudulent behavior before it affects your organization.
REASONS WHY BIG DATA NOT ACCEPTABLE IN PAKISTAN

 LACK OF DATA
 Poor data quality and accuracy is a major obstacle to the success of company’s
analytics efforts.
 Most analysts feel that the quality of data provided to them is inaccurate or
incomplete.

 THE STRUCTURE OF DATA


 Inaccurate, outdated and incomplete data is bad for business, especially in terms
of profitability and competitive advantage.
 Here comes the dire (urgent or serious) need for the data to be structured to be
put to best use.

 ASKING THE WRONG QUESTIONS OR LACK OF BUSINESS OBJECTIVES


 Companies often start with an overly ambitious project and fail to tackle it.
 But most importantly, companies lack objectives or are simply asking the wrong
questions. 

 POOR MANAGEMENT OVERVIEW


Big data projects succeed when they’re not “isolated projects” but instead are the
core of how the company plans to utilize its data. 

 MISSING TIMELINE
 Results produced much after the desired time is also a reason for failure among
smaller companies.
 A need for disciple and better time management must be instilled (taught or
introduced) from the beginning. 

 LACK OF SKILLED MANPOWER


 Too many big data projects fail due to the lack of skilled manpower.
 A critical element is to have a team which brings the right talent on board.

HOW BIG DATA WORKS

1) Set a big data strategy


2) Identify big data sources.
3) Access, manage and store the big data.
4) Analyze the big data.
5) Make intelligent, data-driven decisions

REAL-TIME BIG DATA ANALYTICS TOOLS


 Storm (Owned by Twitter)
 Gridgrain (Made for Java)
 SpaceCurve
 Cloudera

You might also like