KEMBAR78
Big data use cases | PPTX
Big Data Use Cases
Releasing the Power of Data to Drive Business Value
3/20/2013


                                1
•   Russ Lankenau
•   Solutions Architect at MapR (Chicago)
•   http://github.com/rlankenau
•   @RussLankenau
•   rlankenau@maprtech.com


                     2
Mobile
           Virtualization        B2B
 Social
                        Application Service Provider
 Media
                    Cloud
             Web 2.0


      Client/Server

Software-as-a-Service
                              Service Bureau

                        3
4
Business Value

      5
Business Value
      6
Big Data is not new!
 but the tools are.



         7
Ship the Function to the Data

Traditional Architecture        Distributed Computing
          function
                                function   function   function
          RDBMS                  data       data       data


                                function   function   function
  data      data     data
                                 data       data       data
  data     data      data

  data     data      data       function   function   function

  data     data      data        data       data       data


  data     data      data
                                function   function   function

SAN/NAS                          data       data       data




                            8
Big Data Ecosystem




                     9
Use Case
 Company
 Data Source(s)
 Technique(s)
 Business Value
           10
Proactive Monitoring
        11
Data Sources

 Server Telemetry
 Monitoring Logs
 Network Flow


               12
Techniques

 Pattern Recognition
 Proactive Monitoring
 Early Alert Delivery


               13
Business Value




14
Telecommunications Giant




       ETL Offload
           15
Telecommunications
                              Data Sources

 Customer Records
 Contract Data
 Purchase Orders
 Call Center
                         16
Telecommunications
                          Techniques
             ETL            Analytics




                     17
Telecommunications
                                Techniques


                          +


 ETL (Hadoop)                 Analytics (Teradata)
                     18
Telecommunications
                          Business Value




                     19
Waste & Recycling Leader




      Idle Alerts
           20
Data Sources

   Truck Geolocation Data
– 20,000 trucks
– 5 sec interval
   Landfill Geographic Boundaries
                   21
Techniques
                        Realtime Stream Computation    Immediate
                                   (Storm)               Alerts


  Truck                     Batch Computation         Tax Reduction
              Hadoop
Geolocation                    (MapReduce)
              Storage                                   Reporting
   Data


                               Shortest Path             Route
                              Graph Algorithm         Optimization


                                 22
Business Value




23
Healthcare Analytics
     Data Lake
         24
Data Sources

 Claims
 Payments
 Patient Records


               25
Techniques
                                    Standard
                                   Analyst Tools
 Billing                    Hive

           Data Lake
Claims     (Hadoop)

                            Pig
Payment                              Ad-Hoc
Records                              Analysis



                       26
Business Value




27
Machine Learning
Search Relevance
 DNA Matching
      28
Data Sources

 Birth, Death, Census, Military, I
  mmigration records
 Search Behavior Activity
 DNA SNP (snips)
                 29
Techniques
 Record Linking
 Search Relevance
 Clickstream Behavior
 Security Forensics
 DNA Matching
               30
Business Value




31
Similar Characteristics
 Lots of Data
 Structured, Semi-Structured, Unstructured
 Varied Systems Interoperating
  – Hadoop, Storm, Solr, MPP, Visualizations


 Increase Revenue
 Decrease Costs

                      32
Thank You




            33

Big data use cases