KEMBAR78
Bin3 Open Source BI, overhyped or undervalued? | PDF
Open Source Business Intelligence



                               or
                        yp ed
                     rh          d?
                 Ove     va lu e
                   nd er
                  U



                                      1
Jos van Dongen
> 20 yrs BI
Principal Consultant
Author/Speaker/Analyst
Proud member of #BBBT




Web:        www.tholis.com
Email:      jos<at>tholis.com
Phone:      +31-(0)6-51169606
Skype:      tholis.jos
LinkedIn:   jvdongen
Twitter:    josvandongen
IRC:        _grumpy
Java Server Pages
                                   JFreeReports




OpenFlashCharts                           Dashboard Framework



             JPivot

                                                    Mondrian
         MySQL
Key message(s)




                 6
The Industry Radar Screens




     Forrester Wave for BI, Q4 2010


                                      Gartner BI Magic Quadrant




                                                                  7
The value of the Radar Screens




                   Source: Doug Laney, May 13, 2008
                                                      8
Open Source Everywhere
      “By 2012, 80 percent of all
 commercial software will include
    elements of open-source
          technology                            ”
Many open-source technologies are mature, stable and well supported.
They provide significant opportunities for vendors and users to lower
their total cost of ownership and increase returns on investment.

Ignoring this will put companies at a serious competitive disadvantage.
Embedded open source strategies will become the minimal level of
investment that most large software vendors will find necessary to
maintain competitive advantages during the next five years.
Gartner Group, 2008


                                                                          9
Time to wake up!




                   10
Open Source Disrupts the Market
                                      Overkill
                                                                 st   omers
                                                High demanding cu




              Dis
                               What Customers Want
                  rup
                        tio
                           n
                                                                 stomers
                                                Low demanding cu




                               Perceived as a Toy



                                                                 Time
           Source: The Innovators Dilemma, Clayton Christensen
What is Open Source?
Informal:
If the software has an Open Source Licence, it's Open Source

Formal:
1. Free Redistribution
2. Source Code
3. Derived Works
4. Integrity of The Author's Source Code
5. No Discrimination Against Persons or Groups
6. No Discrimination Against Fields of Endeavor
7. Distribution of License
8. License Must Not Be Specific to a Product
9. License Must Not Restrict Other Software
10. License Must Be Technology-Neutral

                                                               12
Levels of Freedom

                                      Academic
                                      Licenses


                         Reciprocal
                         Licenses


            'Freeware'
            licences


Closed
Commerial
license




                                                 13
Licensing issues?
        B:BSD
                     AB:GPL




                    BC:Closed
        A:GPL



                     AC:GPL

       C:Closed




                                14
Why Open Source?




            Source:
            Open Source Adoption
            in the BI Market
            3rd Nature, 2009
                               15
...And Why Not?
                            Problem ReportedbyRespondents
                                   s

                Missing or incomplete features                                           72%

                           Scalabilityproblems                  34%

Required more internal expertise than expected                  32%

 Difficultyintegrating into current environment             29%

           Difficulty finding available solutions           28%

                           Reliability problems           25%

                   Lack of available consulting       21%
                                                                      Source:
                     Interoperability problems       19%              Open Source Adoption
                                                                      in the BI Market
                 Higher costs than anticipated      18%               3rd Nature, 2009

              Lack of vendor service or support     16%


T he biggest r eason is maturity of the softwar e.

                                                                                               16
The Open Core Dilemma
Open Source Cost Savings




   Source: Lowering the Cost of BI With Open Source, 3 rd Nature, 2010

                                                                         18
Open Source Cheap? It depends...
Based on 2010 public list prices




                                   19
Open Source BI Stack Maturity
     Portals                    GIS                    Search               Office
                        Information Delivery & Presentation
                         Information Delivery & Presentation


   Data Mining      Statistics                  Visualization            Text Mining
                                  (Advanced) Analytics
                                  (Advanced) Analytics


    Reports          OLAP                 Ad-hoc            Dashboards      CPM
                                  Reporting & Analysis
                                  Reporting & Analysis


     DBMS           Profiling           Data Quality         Modeling       MDM
                                      Data Management
                                      Data Management


      ETL              EAI                                      EII
                                 Information Integration
                                  Information Integration

     Operating Systems, Application
NOIV Jaarcongres 2010 Tholis Consulting      Servers, Programming languages
                                                                          20



                                                                                       20
#BigData, the new frontier




Yes, these (and more) ar e all Open Sour ce!


                                               21
BI Suites Overview




                     22
The 'BIG 4'
⇨ Palo, Jaspersoft & Pentaho: Community &
  Professional/Enterprise Editions
⇨ SpagoBI only real FOSS platform

    ⇨   Is licensed under LGPL (Yes, that's LGPL)
    ⇨   Has integrated DTAP migration tools
    ⇨   Can integrate multiple engines:




                                                    23
⇨   Project by Italian system integrator & software engineer
    ⇨   Engineering Ingegneria Informatica
⇨   Company founded 1980, listed on Milan Stock exchange
⇨   Invested €65 Mln from 2003-2006 in 40 R&D projects, including Spago BI
    (started 2004)
⇨   Highlights:
    ⇨   Only FOSS BI platform!
    ⇨   > 100 production sites
⇨
    Strengths:
    ⇨   Multi engine platform
    ⇨   Only OS BI tool to address KPI, Metadata management & Geo Info
⇨
    Weaknesses:
    ⇨   Support only from EII: Italian based

                                                                             24
SpagoBI Conceptual Schema




                            25
SpagoBI KPI Module




                     26
⇨
    2004: Company founded by BI industry veterans
⇨   Adopted existing projects (JFreeReport, Kettle, Mondrian, JPivot, Weka)
⇨   BI Platform is own development. 1st release: 2005
⇨
    Total funding to date: $33Mln

⇨   Highlights:
    ⇨   Plug-in architecture (extensibility)
    ⇨   Community Edition contains full BI Suite
⇨   Strengths:
    ⇨   Employment of component Chief Architects
    ⇨   Very strong & active community
⇨
    Weaknesses:
    ⇨   Diversity of components; aimed at developers
                                                                              27
End to End BI
Reporting
  Operational, Production
  Embedded
  Web-based Ad-hoc
Analysis
  Interactive slice, dice, and drill
  Web-based or Excel
Dashboards
  KPIs
  Mash-ups
Data Integration / ETL
Data Mining
BI Platform
  Scheduling & bursting
  Notification
  Content sharing
  Security integration


                                                   28
Highlights




             29
⇨   2001: JasperReports project started by 1 developer; Panscopic founded
⇨   2004: Panscopic adopts JasperReports, changes name to JasperSoft & goes OS
⇨   Adopted existing projects (iReport, Mondrian, JPivot, Talend)
⇨   BI platform is own development, total funding to date $45.5Mln

⇨   Highlights:
     ⇨   JasperBabylon (multi-language model)
     ⇨   Ingres Icebreaker appliance
⇨   Strenghts:
     ⇨   Complex, operational reporting
     ⇨   Scalability to >1.000 users
⇨   Weaknesses:
     ⇨   No control over main platform components (Analysis, ETL)
     ⇨   More and more a proprietary vendor with a free edition
                                                                                 30
Portfolio




            31
Jaspersoft Dashboard




                       32
Component based platforms




                            33
Palo BI Suite
⇨   2002: Jedox Founded by Kristian Raue
⇨   Main promise: “Excel without Hell”
⇨   BI platform is own development

⇨   Highlights:
     ⇨   Palo in memory OLAP database
     ⇨   Easy to use web-based ETL & modeling
⇨   Strenghts:
     ⇨   Budgetting, planning & analysis including write back
     ⇨   Excel & OpenOffice Calc as front-end
     ⇨   SAP/R3 connectors (Palo as BW alternative)
⇨   Weaknesses:
     ⇨   BI is more than cubes
     ⇨   Worksheet Server 3 still very buggy (as of Feb. 2010)
                                                                 34
PALO Overview




                35
Layered Architecture




                       36
Palo OS Ecosystem
                         PalOOCa
                          plugin




Palo ETL Server
                           Jpalo
                            web
                  PALO
                           client




                          Jpalo client




                                         37
Is Open Source BI for you?
          OR             ?
                             CSF
                             No 1!
    ⇨ Business Case
    ⇨ BI Maturity


    ⇨ Internal Skills


    ⇨ Culture


    ⇨ Infrastructure


    ⇨ Applications


    ⇨ Vendors


    ⇨ Support Partners


                                     38
Open Source BI To Do
⇨ Performance Management
⇨ Vertical/horizontal applications


⇨ Sharepoint & MS/Open Office integration


⇨ Collaboration (users & developers)


⇨ Interactive Visual Analysis


⇨ Drag & drop free form reporting


⇨ Meta data integration, lineage, impact


⇨ Books & documentation


⇨ Consulting, training, support



                                            39
Recommendations
1.Don't focus solely on cost savings.
  People did not mention as up-front
  reasons many of the benefits they
  discovered later.
2.Plan to augment, not replace,
  existing software with open source.
  Rather than trying to saving money by
  replacing software, look at gaps in the
  BI portfolio or data warehouse stack
  and use open source to supplement
  your systems.

  Source: Mark Madsen, Third Nature




                                            40
Recommendations
3.Consider developing open source
  policies. Most organizations are
  adopting open source in an ad-hoc
  fashion, project by project.
4.Evaluate open source like any
  other software. It doesn't matter if
  the software is free if it takes longer
  to build, manage and deploy
  solutions to end users, if it is
  unstable, or if it is missing a key
  feature
5.Make open source the default
  option. When there are no internal
  tools, open source should be the
  first alternative.
  Source: Mark Madsen, Third Nature



                                            41
Bin3 Open Source BI, overhyped or undervalued?

Bin3 Open Source BI, overhyped or undervalued?

  • 1.
    Open Source BusinessIntelligence or yp ed rh d? Ove va lu e nd er U 1
  • 2.
    Jos van Dongen >20 yrs BI Principal Consultant Author/Speaker/Analyst Proud member of #BBBT Web: www.tholis.com Email: jos<at>tholis.com Phone: +31-(0)6-51169606 Skype: tholis.jos LinkedIn: jvdongen Twitter: josvandongen IRC: _grumpy
  • 4.
    Java Server Pages JFreeReports OpenFlashCharts Dashboard Framework JPivot Mondrian MySQL
  • 6.
  • 7.
    The Industry RadarScreens Forrester Wave for BI, Q4 2010 Gartner BI Magic Quadrant 7
  • 8.
    The value ofthe Radar Screens Source: Doug Laney, May 13, 2008 8
  • 9.
    Open Source Everywhere “By 2012, 80 percent of all commercial software will include elements of open-source technology ” Many open-source technologies are mature, stable and well supported. They provide significant opportunities for vendors and users to lower their total cost of ownership and increase returns on investment. Ignoring this will put companies at a serious competitive disadvantage. Embedded open source strategies will become the minimal level of investment that most large software vendors will find necessary to maintain competitive advantages during the next five years. Gartner Group, 2008 9
  • 10.
  • 11.
    Open Source Disruptsthe Market Overkill st omers High demanding cu Dis What Customers Want rup tio n stomers Low demanding cu Perceived as a Toy Time Source: The Innovators Dilemma, Clayton Christensen
  • 12.
    What is OpenSource? Informal: If the software has an Open Source Licence, it's Open Source Formal: 1. Free Redistribution 2. Source Code 3. Derived Works 4. Integrity of The Author's Source Code 5. No Discrimination Against Persons or Groups 6. No Discrimination Against Fields of Endeavor 7. Distribution of License 8. License Must Not Be Specific to a Product 9. License Must Not Restrict Other Software 10. License Must Be Technology-Neutral 12
  • 13.
    Levels of Freedom Academic Licenses Reciprocal Licenses 'Freeware' licences Closed Commerial license 13
  • 14.
    Licensing issues? B:BSD AB:GPL BC:Closed A:GPL AC:GPL C:Closed 14
  • 15.
    Why Open Source? Source: Open Source Adoption in the BI Market 3rd Nature, 2009 15
  • 16.
    ...And Why Not? Problem ReportedbyRespondents s Missing or incomplete features 72% Scalabilityproblems 34% Required more internal expertise than expected 32% Difficultyintegrating into current environment 29% Difficulty finding available solutions 28% Reliability problems 25% Lack of available consulting 21% Source: Interoperability problems 19% Open Source Adoption in the BI Market Higher costs than anticipated 18% 3rd Nature, 2009 Lack of vendor service or support 16% T he biggest r eason is maturity of the softwar e. 16
  • 17.
  • 18.
    Open Source CostSavings Source: Lowering the Cost of BI With Open Source, 3 rd Nature, 2010 18
  • 19.
    Open Source Cheap?It depends... Based on 2010 public list prices 19
  • 20.
    Open Source BIStack Maturity Portals GIS Search Office Information Delivery & Presentation Information Delivery & Presentation Data Mining Statistics Visualization Text Mining (Advanced) Analytics (Advanced) Analytics Reports OLAP Ad-hoc Dashboards CPM Reporting & Analysis Reporting & Analysis DBMS Profiling Data Quality Modeling MDM Data Management Data Management ETL EAI EII Information Integration Information Integration Operating Systems, Application NOIV Jaarcongres 2010 Tholis Consulting Servers, Programming languages 20 20
  • 21.
    #BigData, the newfrontier Yes, these (and more) ar e all Open Sour ce! 21
  • 22.
  • 23.
    The 'BIG 4' ⇨Palo, Jaspersoft & Pentaho: Community & Professional/Enterprise Editions ⇨ SpagoBI only real FOSS platform ⇨ Is licensed under LGPL (Yes, that's LGPL) ⇨ Has integrated DTAP migration tools ⇨ Can integrate multiple engines: 23
  • 24.
    Project by Italian system integrator & software engineer ⇨ Engineering Ingegneria Informatica ⇨ Company founded 1980, listed on Milan Stock exchange ⇨ Invested €65 Mln from 2003-2006 in 40 R&D projects, including Spago BI (started 2004) ⇨ Highlights: ⇨ Only FOSS BI platform! ⇨ > 100 production sites ⇨ Strengths: ⇨ Multi engine platform ⇨ Only OS BI tool to address KPI, Metadata management & Geo Info ⇨ Weaknesses: ⇨ Support only from EII: Italian based 24
  • 25.
  • 26.
  • 27.
    2004: Company founded by BI industry veterans ⇨ Adopted existing projects (JFreeReport, Kettle, Mondrian, JPivot, Weka) ⇨ BI Platform is own development. 1st release: 2005 ⇨ Total funding to date: $33Mln ⇨ Highlights: ⇨ Plug-in architecture (extensibility) ⇨ Community Edition contains full BI Suite ⇨ Strengths: ⇨ Employment of component Chief Architects ⇨ Very strong & active community ⇨ Weaknesses: ⇨ Diversity of components; aimed at developers 27
  • 28.
    End to EndBI Reporting Operational, Production Embedded Web-based Ad-hoc Analysis Interactive slice, dice, and drill Web-based or Excel Dashboards KPIs Mash-ups Data Integration / ETL Data Mining BI Platform Scheduling & bursting Notification Content sharing Security integration 28
  • 29.
  • 30.
    2001: JasperReports project started by 1 developer; Panscopic founded ⇨ 2004: Panscopic adopts JasperReports, changes name to JasperSoft & goes OS ⇨ Adopted existing projects (iReport, Mondrian, JPivot, Talend) ⇨ BI platform is own development, total funding to date $45.5Mln ⇨ Highlights: ⇨ JasperBabylon (multi-language model) ⇨ Ingres Icebreaker appliance ⇨ Strenghts: ⇨ Complex, operational reporting ⇨ Scalability to >1.000 users ⇨ Weaknesses: ⇨ No control over main platform components (Analysis, ETL) ⇨ More and more a proprietary vendor with a free edition 30
  • 31.
  • 32.
  • 33.
  • 34.
    Palo BI Suite ⇨ 2002: Jedox Founded by Kristian Raue ⇨ Main promise: “Excel without Hell” ⇨ BI platform is own development ⇨ Highlights: ⇨ Palo in memory OLAP database ⇨ Easy to use web-based ETL & modeling ⇨ Strenghts: ⇨ Budgetting, planning & analysis including write back ⇨ Excel & OpenOffice Calc as front-end ⇨ SAP/R3 connectors (Palo as BW alternative) ⇨ Weaknesses: ⇨ BI is more than cubes ⇨ Worksheet Server 3 still very buggy (as of Feb. 2010) 34
  • 35.
  • 36.
  • 37.
    Palo OS Ecosystem PalOOCa plugin Palo ETL Server Jpalo web PALO client Jpalo client 37
  • 38.
    Is Open SourceBI for you? OR ? CSF No 1! ⇨ Business Case ⇨ BI Maturity ⇨ Internal Skills ⇨ Culture ⇨ Infrastructure ⇨ Applications ⇨ Vendors ⇨ Support Partners 38
  • 39.
    Open Source BITo Do ⇨ Performance Management ⇨ Vertical/horizontal applications ⇨ Sharepoint & MS/Open Office integration ⇨ Collaboration (users & developers) ⇨ Interactive Visual Analysis ⇨ Drag & drop free form reporting ⇨ Meta data integration, lineage, impact ⇨ Books & documentation ⇨ Consulting, training, support 39
  • 40.
    Recommendations 1.Don't focus solelyon cost savings. People did not mention as up-front reasons many of the benefits they discovered later. 2.Plan to augment, not replace, existing software with open source. Rather than trying to saving money by replacing software, look at gaps in the BI portfolio or data warehouse stack and use open source to supplement your systems. Source: Mark Madsen, Third Nature 40
  • 41.
    Recommendations 3.Consider developing opensource policies. Most organizations are adopting open source in an ad-hoc fashion, project by project. 4.Evaluate open source like any other software. It doesn't matter if the software is free if it takes longer to build, manage and deploy solutions to end users, if it is unstable, or if it is missing a key feature 5.Make open source the default option. When there are no internal tools, open source should be the first alternative. Source: Mark Madsen, Third Nature 41