KEMBAR78
Big Data Overview | PDF | Big Data | Data Management
0% found this document useful (0 votes)
47 views23 pages

Big Data Overview

Uploaded by

Srijan Pramanik
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
47 views23 pages

Big Data Overview

Uploaded by

Srijan Pramanik
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 23

A BIG DATA CHEAT SHEET:

THE BIG PHARMA EDITION


TAMARA DULL, DIRECTOR OF EMERGING TECHNOLOGIES

C o p yrig ht © 2 0 1 2 , S A S I ns titute I nc . A ll rig hts re s e rve d .


PATIENT HOSPITAL FINANCIAL INSURANCE
20%
SCHEDULING
Big data RECORDS ADMISSIONS DATA DATA DATA
is not
new. WORD
SPREAD-
EMAIL PDF FILES PROCESSING RFID TAGS
SHEETS DOCUMENTS

SOCIAL
WEB LOG SATELLITE
GPS PHOTOS MEDIA
DATA IMAGES
DATA
80%
RESEARCH LAB CLINICAL
FORUMS VIDEOS
DATA RESULTS TRIALS

MOBILE WEBSITE MARKETING AUDIO


OPEN DATA
DATA CONTENT DATA FILES

Copyright © 2012, SAS I ns titute I nc . All rights res erved.


HERE’S OUR 3-4-5 PLAN:

 3 Definitions

 4 Trends

 5 Questions

C o p yrig ht © 2 0 1 2 , S A S I ns titute I nc . A ll rig hts re s e rve d .


3 DEFINITIONS

C o p yrig ht © 2 0 1 2 , S A S I ns titute I nc . A ll rig hts re s e rve d .


THE DEFINITIONS BIG DATA

―Big Data refers to electronic health data


sets so large and complex that they are
difficult (or impossible) to manage with
traditional software and/or hardware; nor
can they be easily managed with
traditional or common data management ―That amount of data
tools and methods… or complexity which
puts you out of your
Volume, Velocity, and Variety—often comfort zone.‖
referred to as the three V’s of Big Data— Paul Kent
capture the true meaning of Big Data.‖ VP of Big Data
SAS Institute

SOURCE: Frost & Sullivan: “Drowning in Big Data? Reducing Information Technology
Complexities and Costs for Healthcare Organizations”

Copyright © 2012, SAS I ns titute I nc . All rights res erved.


THE DEFINITIONS HADOOP

Is it a project…

…or an ecosystem?

NOTE:
Hadoop is not
synonymous
with big data

Copyright © 2012, SAS I ns titute I nc . All rights res erved.


THE DEFINITIONS DATA LAKE

―If you think of a datamart as a store of


bottled water – cleansed and packaged
―A data lake is a storage repository that
and structured for easy consumption – the
holds a vast amount of raw data in its
data lake is a large body of water in a
native format, including structured, semi-
more natural state. The contents of the
structured, and unstructured data. The
data lake stream in from a source to fill
data structure and requirements are not
the lake, and various users of the lake can
defined until the data is needed.‖
come to examine, dive in, or take
samples.‖
James Dixon
CTO, Founder & Chief Geek
Pentaho

Copyright © 2012, SAS I ns titute I nc . All rights res erved.


4 TRENDS

C o p yrig ht © 2 0 1 2 , S A S I ns titute I nc . A ll rig hts re s e rve d .


The market is growing.

SOURCE: http://wikibon.org/wiki/v/Big_Data_Vendor_Revenue_and_Market_Forecast_2013-2017

Copyright © 2012, SAS I ns titute I nc . All rights res erved.


The success rate is meh.

Copyright © 2012, SAS I ns titute I nc . All rights res erved.


People issues trump technology issues.

Copyright © 2012, SAS I ns titute I nc . All rights res erved.


Analytics keeps them coming back.

Copyright © 2012, SAS I ns titute I nc . All rights res erved.


5 QUESTIONS

C o p yrig ht © 2 0 1 2 , S A S I ns titute I nc . A ll rig hts re s e rve d .


HERE’S THE 5 QUESTIONS:

1. What can Hadoop do that my data


warehouse can’t?
2. We’re not doing “big” data, so why do we
need Hadoop?
3. Is Hadoop enterprise-ready?
4. How is big data impacting Big Pharma
today?
5. What are the primary threats to big data
adoption?

C o p yrig ht © 2 0 1 2 , S A S I ns titute I nc . A ll rig hts re s e rve d .


QUESTION #1 WHAT CAN HADOOP DO THAT MY DATA WAREHOUSE CAN’T?

$ 1. Store data more cheaply.

2. Process data more quickly


(and cheaply).

Copyright © 2012, SAS I ns titute I nc . All rights res erved.


QUESTION #2 WE’RE NOT DOING “BIG” DATA, SO WHY DO WE NEED HADOOP?

Stage structured data. Process structured data. Archive any data.

Process any data. Access any data. Access any data.


(via data warehouse) (via Hadoop)

Copyright © 2012, SAS I ns titute I nc . All rights res erved.


QUESTION #3 IS HADOOP REALLY ENTERPRISE-READY?

For your organization: Maybe

For all organizations: No


Are we
there
yet?

Copyright © 2012, SAS I ns titute I nc . All rights res erved.


QUESTION #4 HOW IS BIG DATA IMPACTING BIG PHARMA TODAY?

Copyright © 2012, SAS I ns titute I nc . All rights res erved.


QUESTION #5 WHAT ARE THE PRIMARY THREATS TO BIG DATA ADOPTION?

IT
PRIVACY SECURITY

analytics

science business

SKILLS

Copyright © 2012, SAS I ns titute I nc . All rights res erved.


WRAP-UP

C o p yrig ht © 2 0 1 2 , S A S I ns titute I nc . A ll rig hts re s e rve d .


HERE ARE YOUR KEY TAKEAWAYS:

 It’s the big data technologies – not the


data itself – that’s new

 Understand the context when talking


about Hadoop

 If you’re doing big data without


analytics, you’re wasting your time

 Approach big data smartly and learn


from other…industries, mistakes, etc.

C o p yrig ht © 2 0 1 2 , S A S I ns titute I nc . A ll rig hts re s e rve d .


C o p yrig ht © 2 0 1 2 , S A S I ns titute I nc . A ll rig hts re s e rve d .
IT’S A BIG DATA WORLD OUT THERE.
NOW LET’S BE SAFE.

Tamara.Dull@sas.com
@tamaradull
C o p yrig ht © 2 0 1 2 , S A S I ns titute I nc . A ll rig hts re s e rve d .
sas.com

You might also like