Data & AI Modernization
IBM Cloud Pak for Data
The AI Ladder
A prescriptive approach to the journey to AI
INFUSE - Operationalize AI throughout the business
ANALYZE - Build and scale AI with trust and explainability
MODERNIZE
ORGANIZE - Create a business-ready analytics foundation Make your data ready for an
AI and hybrid multicloud
world
COLLECT - Make data simple and accessible
One Platform,
Talent &
Any Cloud
Skills
IBM Uniquely Delivers the IA Foundation
System Admin Data Engineer Data Scientist Business Analyst
Unified: APIs Integrated User Experience
Extensible: Accelerators and Solutions
Modular: - provision services & scale out when needed
Collect & Connect Services Organize and Integrate Services Analyze and Infuse Services
The AI ladder –
– Data virtualization & Connectors Data Discovery and search – Data Science and Visualizations
– Provision SQL and – Data transformation – Dashboards and BI Reporting
NOSQL Databases & – Data Curation – AUTO-AI, ML deployments
Warehouses
– Data cataloging and and operations
– Event Ingestion and Classification – AI Trust and Transparency -
Streaming Analytics
– Business glossary Explainability and Bias detection
– Distributed compute with
– Policies and rules – AI services – NLU, Sentiment &
Apache Spark
Text analytics, Speech-to-text ,
– Data Profiling & Quality Text-to-Speech, Chat interfaces
[Bedrock] – User Access Management – Manage:- Monitor & Meter – Operator: Install, Patch & Upgrade
Foundational – Security Contexts & RBAC – Scale
– Service Provisioning
services
– Volume Management – Diagnostics – Backup & Migrate
The IBM Data and AI Portfolio
Everything you need for enterprise AI, on any cloud
Pre-built Use Cases
Watson Applications
Prepare Build Run Manage
Watson Watson
Watson Watson
Knowledge Machine
Studio OpenScale
Catalog Learning
Hybrid Data Management Data Ops & Governance
Business and NPS & Db2 Family InfoSphere Family
technical services
Unified Hybrid Data and AI Platform
Cloud Pak for Data
Hyperconverged
System
4
Deployment flexibility
to run anywhere
A true
hybrid multicloud
strategy
Managed by Client
Managed by IBM/Vendor
Cloud Pak for Data v4.0 Packaging
Cloud Pak for Data Base Platform Services Cloud Pak for Data Cartridges
Db2 Warehouse Netezza Performance Server Db2 AESE Master Data Management
Data Virtualization Db2 Big SQL Informix Virtual Data Pipeline (Actifio)
IBM Streams Guardium Integration DataStage OpenPages
Watson Knowledge Catalog (including IGC) Data Management Console Information Server Open Data for Industries
Information Analyzer (included in WKC) Watson Machine Learning– Accelerator Cognos Analytics Knowledge Accelerators
Watson Studio (includes Data Refinery) Data Privacy (Beta)
Planning Analytics Product Master NEW!
Watson Assistant
Watson Machine learning (includes AutoAI ) IBM Match360 with Watson
Watson Discovery
Watson OpenScale SPSS Modeler
Cognos Dashboards Embedded Decision Optimization
NEW! Watson Speech Services
Financial Crimes Insights
Analytics Engine for Apache Spark Hadoop Execution Engine
Financial Services Workbench
Collect Make data simple and
accessible
Cloud Pak for Data:
• Data Virtualization
• Db2 Warehouse
• Performance Server
• Streams
Netezza Performance Server Highlights Business Value
Simplicity Build once, Run anywhere
Minimal administration and tuning Flexible deployment options, no
vendor lock-in, 100% compatibility,
Scalable Hybrid Analytics risk free frictionless migration
petabyte scaling, independently scale
compute and storage in cloud Faster Actionable insights
Blazing speeds, up to 3X
Seamless Data Integration performance and 2X concurrency
Built-in Data Virtualization to in-place improvement over legacy Netezza
connect, manage and query data as one systems
Resiliency for Business Continuity Data Science & ML at Scale
Infrastructure resiliency, backup to object Operationalize AI with built-in
storage, replicated to multiple availability Watson DS/ML and 200+ in-
zones database algorithms
Use cases
On-premise, Cloud or Hybrid environments: Flexibility in choosing deployment and
On Prem IBM Cloud consumption model with License Flexibility that best suit the business needs
AWS Azure
Hybrid scenarios:
Seamless on-ramp to
Managed Cloud ▪ Dev/test environments on Cloud: Easily spin up/down a cloud instance, seamlessly move
data from on-premise to cloud instance with a single command
▪ Disaster recovery on Cloud: Backup to cloud object storage and restore to your Cloud data
warehouse, switch over if disaster strikes
▪ Make the move to Cloud on your own terms: If Cloud is your strategic direction, start
8
small, scale storage and compute independently, when you’re ready
Organize Create a business-ready
analytics foundation
Cloud Pak for Data :
• Watson Knowledge Catalog
(including Information
Analyzer, IGC)
• DataStage
Watson Knowledge Catalog Highlights Business Value
• Enhanced usability and • Speeds up metadata
improved robustness and tuning classification time for
for Data Quality Projects regulations by 90%.
• Infrastructure needs reduced
• Define and manage additional 50% with end-to-end DataOps
custom attributes for custom services on Cloud Pak for Data
and OOTB asset types and up to 158% ROI
• Add user groups as • Productivity increased by 95%
collaborators in catalogs, when Watson Knowledge
categories, workflows and data Catalog and other CP4D
protection rules services are deployed.
Use Cases
• End-to-end data governance - Single integrated solution to serve customers’
data needs from data ingestion, governance, quality and consumption
• Self-service access to trusted data for analytics -Enable data consumers to
use a self-service, integrated experience to search through catalogs,
collaborate with other users and visualize, shape & analyze data
• Support regulatory compliance - Quickly discover and inventory assets into
the catalog, automatically classify and tag them with business terms to
detect sensitive data
10
Link: ibm.biz/wkc-sales-kit
DataStage and Information Highlights Value
Server for Cloud Pak for Data Dynamic configuration for
DataStage and QualityStage jobs
• Up to 30% performance
improvement when executing
flows due to dynamic resource
Modern Flow Designer design allocation
interface with improved
performance • Design environment
performance improvements that
Include unstructured data as part boost user productivity
of data integration flow design
• Leverage existing job designs
Enhancement to Classic Designer with benefits of containerized
(Windows Client) support deployment
What’s new:
Improved performance in a modern interface on key aspects
• Slowly Changing Dimension (SCD) stage – very critical data movement task to
track history of dimension records or structured data – save operation
performance improved by 52% in new interface
• Leading and common targets / sources (Db2, Snowflake, Salesforce) save
operation performance improved between 31 to 48%.
• Add Azure Data Lake Store and Redshift Connector to Flow Designer in Cloud Pak
for Data
• SAP Packs supported via classic designer (windows client)
• Removal of requirement of NodePort entries and addition of end to end encryption
More information here from Windows Client to CP4D cluster 11
Master Data Management Highlights Business Value
Utilize native REST APIs or • Majority of MDM workload are
IBM App Connect connectors read/inquiry type (typical 80-
to accelerate application 95%)
integration
• Accelerated processing time:
Deploy one or more cache 3,000 read side TPS rate
compared to 1,000 TPS rate
instances to service
Update (~ 200% increase)
Custom
er Store
consumers
New Contrac
Lead t • Easy to deploy, offers agility,
Search across entities and and provides more scalability
traverse relationships for new than on-premises software at
Update
Commissio
Cognitive
Enrichme
insights or push data to your an affordable cost
n nt
data warehouse
Update
New New
d
Use cases/Capabilities
Cart Ticket
Invoice
• Provide application developers with an instance of master data for faster time
to market
• Support mobile and online applications requiring extremely low latency &
A modernized MDM isn’t just an investment in MDM’s premium capabilities but an high availability
investment in the Next Generation of MDM, this means operating on a platform • Utilize master data for downstream analytics
with:
• New AI/ML driven entity match engine • Support applications needing more local access to global Master Data
• Auto configuration • Set up data filters and enforce publishing policies to users and geographies as
• Data driven data model definition required
• Completely new UX
• Tight integration with Watson Knowledge Catalog* Link: https://ibm.seismic.com/Link/Content/DCaD39KjB-6USDZCh0Eg1oZw 12
Analyze Build and scale AI with trust
and explainability
Cloud Pak for Data:
• Watson Studio & Watson ML
• AutoAI
• Watson OpenScale
• Decision Optimization
Custom Runtimes Auto AI
Watson Studio and Watson
Machine Learning
Users
Users can
can bring
bring in
in libraries
libraries of
of their
their Use SDK to run AutoAI experiment
choice via custom images to through programming without UI
choice via custom images to
analyze
analyze data,
data, build
build models
models in
in
notebooks Tech Preview Feature: AutoAI
notebooks oror scripts
scripts and
and deploy
deploy in
in support multiple input datasets with
WML.
WML. configurable join relationships
Business Value: provides the
Business Value: Business Value: New features make
extensibility andprovides
flexibilitythe
extensibility and flexibilityteams to AutoAI suitable for automated and
required by data science customized workflow through
required by data science teams to
create AI solutions effectively. programming, saving users time
create AI solutions effectively.
for data preparation when dealing
with multiple datasets
Watson Studio and Watson Machine Learning
• Use Python 3.7.9 version with Notebooks and Scripts to build model and
deploy in production with Watson Machine learning.
• Bitbucket server and self-signed certificates support for Git integration
• Introducing Multi-Cloud Machine Learning to Cloud Pak for Data (Tech Preview)
• Business Value: Keep up to date with the latest innovations in open source for
your AI lifecycle. Use enhanced git integration for collaboration on your data
science projects. Train your machine learning models by utilizing the data
distributed across multiple parties or locations.
AutoAI
weeks
200%
53%
Provided by users Automated by AutoAI Provided to users
Deployable pipeline
Model Hyper Parameter Feature
Raw Labeled Data Prep Model Building
selection Optimization Engineering
Data Set
Finds best Finds top models Optimize on Finds best data Optimize on models Python Notebook
preprocessing selected models transformation after Feature
imputation / encoding sequence Engineering
and scaling strategies
15
Business Value
Watson OpenScale Highlights
• Share Watson OpenScale across
Role based user access
multiple teams of business users
and data scientists with
Explainability enhancements appropriate content and function
including “what if” interactivity visible to each
and improved understandability
• Enable end users of AI
Monitor models for indirect bias applications to better understand
model decisions
• Detects potential fairness issues
due to unseen correlations in
input data
Use Cases
• Monitor production models to ensure accuracy, fairness and
explainability
• Ensure models continue to preform as expected over time by
detecting and evaluating impact of inputs drifting from data used to
build models
• Enable model validators and risk managers to run tests, compare
candidates, document results and determine when AI/ML models are
ready for production.
16
IBM Streams Highlights Business Value
Custom application resource Improved resource utilization
templates by specifying vCPU and memory
requirements for each
Edge Analytics (beta) to analyze application
and act on data where its
created Millisecond latency can be
achieved to act in the moment
Auto-creation of Cloud Pak for
Data service(s) from any point Immediate creation of
in a Streams application OpenShift services for discovery
and access to analytic
applications over standard
REST interfaces
Use Cases and Capabilities
• Intelligently Collect data to analyze, filter and summarize real time
data before landing it in persistent stores
• Agent Assist to convert speech to text and perform natural language
processing to provide recommendations to call center agents
• Real-time situational awareness infused with AI for hospital patients,
manufacturing devices and automobiles to improve operations
• Geospatial analytics to create alerts when people or things enter an
area of interest for marketing or safety concerns
Link: Streams Seismic Sales Kit 17
Infuse Operationalize AI
throughout the business
Cloud Pak for Data:
• Cognos Analytics
• Watson Apps
• Financial Crime Insight
Cognos Analytics Highlights Business Value
Save time with automated data Empowers users with AI-
preparation, data discovery and infused self-service capabilities
dynamic visualizations and find
deeper insights faster Easily visualize data and share
insights across your team to
Execute complex queries faster and
drive confident decisions
analyze data where it resides using
data virtualization
Reduce the complexity of
Create an analytics foundation by deploying and managing a BI
integrating business intelligence environment to meet your users
predictive, prescriptive and business needs
planning analytics
Use cases/Capabilities
• Get the self-service you expect, the data governance you require, and
the reporting you trust, with a secure business intelligence platform
• Deploy one AI-infused business intelligence platform for all analytics
use cases, from marketing campaign performance to human resources
analysis, customer sentiment analysis to sales pipeline analysis
• Ensure Managed Reporting Production workload SLAs are met with
confidence using modern container architecture
• Enable the most secure and compliant strategy with data governance
Link: < to the detailed
for managed reportingdeep diveexploration
and data deck or recording>
19
Planning Analytics Differentiators Business Value
Adjust financial plans in real • 63% time saved in
time across departments completing annual budgeting
cycles
Protect your investment in
Microsoft Excel while • 80% faster planning system
transcending limitations of processing
spreadsheets
• 20% time saved in
Uncover deep insights through completing forecasts
AI-infused planning, without the
need for help from a data
scientist
Plan for anything, be ready for everything
• Steer business performance by bridging operations and finance for
any department allowing you to adapt to changing business
conditions
• See impact before executing – explore what-if scenarios and assess
impact to determine the best course of action
• Make changes in real-time – pivot plans, budgets, and forecasts
quickly to meet changing demands and priorities
20
Watson pre-
built Apps on Watson Assistant Watson Assistant Voice Interaction
Cloud Pak for
Data
Watson Discovery
Includes:
• Content Mining
• Content Intelligence
• Watson Knowledge Studio
Watson API Kit
Speech to Text, Text to Speech, NLU, WKS, Language Translator
Thank You!
IBM Cloud / April 2019 / © 2019 IBM Corporation