SAP Data Intelligence Strategy
SAP Data Intelligence Strategy
PUBLIC
PUBLIC
© 2022 SAP SE or an SAP affiliate company. All rights reserved. PUBLIC 1
Disclaimer
The information in this presentation is confidential and proprietary to SAP and may not be disclosed without the permission of SAP.
Except for your obligation to protect confidential information, this presentation is not subject to your license agreement or any other service
or subscription agreement with SAP. SAP has no obligation to pursue any course of business outlined in this presentation or any related
document, or to develop or release any functionality mentioned therein.
This presentation, or any related document and SAP's strategy and possible future developments, products and or platforms directions and
functionality are all subject to change and may be changed by SAP at any time for any reason without notice. The information in this
presentation is not a commitment, promise or legal obligation to deliver any material, code or functionality. This presentation is provided
without a warranty of any kind, either express or implied, including but not limited to, the implied warranties of merchantability, fitness for a
particular purpose, or non-infringement. This presentation is for informational purposes and may not be incorporated into a contract. SAP
assumes no responsibility for errors or omissions in this presentation, except if such damages were caused by SAP’s intentional or gross
negligence.
All forward-looking statements are subject to various risks and uncertainties that could cause actual results to differ materially from
expectations. Readers are cautioned not to place undue reliance on these forward-looking statements, which speak only as of their dates,
and they should not be relied upon in making purchasing decisions.
01 - Data Integration
• Connectivity
• Replication & Federation
• Orchestration & Processing
07 - Analytics & Planning • Business Semantics Inheritance
• Business Intelligence
• Enterprise Planning
• Augmented Analytics DevOps & Security &
Lifecycle Authorization
Management
02 - Data Storage
• Data Tiering
Data Strategy • Data Persistency
• Multi-model data platform
Data
Democratization
06 - Data Sharing
Business Machine
•
Data Marketplace Learning
Content
• Role-based Data Spaces Access
03 - Data Cataloging
• Data Discovery & Exploration
SAP One Domain Model &
Master Data Management • Data Profiling
• Data Lineage
05 - Data Modelling
• Semantics Modelling
• Data Layer Modelling
• Data Spaces 04 - Data Preparation
• Data Quality
• Data Transformation
14
(Integration Suite,
Master Data
Business Intelligence Augmented Analytics Enterprise Planning
Governance, …)
data & analytics
Shared Services
Orchestration
Embedded
Data Catalog
engines
catalogs
SAP HANA Cloud
Non-SAP
SAP Applications Unstructured data Streaming data
Applications
© 2022 SAP SE or an SAP affiliate company. All rights reserved. PUBLIC 7
© 2022 SAP SE or an SAP affiliate company. All rights reserved. PUBLIC 8
SAP Data Intelligence
Overview
BW Standard
SAP BW/4 HANA SAP BW Hadoop / HDFS
Integration Connectors
(open & native
Data Integration protocols)
REST APIs
Data Ingestion / Data Enrichment / Data Workflows
SAP C/4HANA
Cloud Data Databases
Integration
… Public Clouds
Metadata explorer
SAP HANA SAP HANA SAP HANA SAP HANA SAP HANA
Modeler
Initialize HANA Table Operator that Initializes one or more tables on an SAP HANA database.
Read HANA Table Operator to read data from a table in SAP HANA.
Run HANA SQL Operator that executes user-provided SQL statements on an SAP HANA database.
Write to HANA Table Operator that executes user-provided SQL statements on an SAP HANA database.
Change Data Capture (CDC) is a Delta Capture Technique that uses triggers for Insert, Update, and
Table Replicator
Delete to track the change history for a specific table.
ABAP CDS Operator that supports initial load and delta replication of CDS views.
view reader
SLT connector Operator to communicate with SAP Landscape Transformation Replication Server;
operator leverage existing replication scenarios to bring data into SAP DI pipelines
Operator that can read from operational data provisioning out of SAP Business
ODP reader Warehouse or SAP BW/4HANA into SAP Data Intelligence
Custom ABAP Operator to implement your ABAP custom code that will be executed as part of a
operator pipeline in the connected ABAP system – for example, to call a function module
Integration requires a certain system level, planned for SAP S/4HANA 1909, SAP S/4HANA Cloud 1908, and SAP NetWeaver 7.00 with DMIS 2011/2018 Q4/2019 version, or higher.
Certain functionality can be made available for SAP Data Intelligence for certain release levels.
Read File Operator Operator that reads the content of files from various storage services.
Structured File Consumer Operator reads from any supported cloud storage.
The operator produces structured output, and you need to connect it to other operators from
Structured Data Operators category.
Operator receives data from any structured data operators and produces a file (CSV, ORC,
Structured File Producer
or PARQUET) in the specified storage.
Modeler
Explore your data assets, create and validate your models, and
orchestrate any mix of data processing engines, seamlessly within the
data pipelines that feed them
© 2022 SAP SE or an SAP affiliate company. All rights reserved. PUBLIC 22
SAP Data Intelligence – Data Processing
Search & Connection Data Data Model Integration into Automation &
Model Model Model
Browse /Storage Preparation Processing Validation Application Maintenance
Training Deployment
Data Management Creation
Connected Sources
Data discovery,
browsing and Business
searching Glossary
Data quality
Data profiling rules
Data quality
dashboards User ratings
© 2022 SAP SE or an SAP affiliate company. All rights reserved. PUBLIC *planned 32
Rolling out a catalog solution
Scaling the project scope along different axis
Rolling out to more USERS or LINE OF BUSINESS
or REGIONS or DEPARTMENTS
3rd Party
SAP HANA
Safeguard past investments
by reusing existing assets
SAP LT
SAP Replication
Integration Server
Cloud elasticity & agility
Suite
Included for free within DWC, with simplified resource Separate service, integrated ootb with DWC, with fine-
control grained resource control and autoscaling
Simplified no-code UI for citizen users, powered by the same Full-blown low-code UI to leverage the whole
technology as Data Intelligence Cloud functionality of the underlying pipeline engine
• Autoscaling
• Customers automatically scale their deployment based on resource
requirements and user defined resource limitations
• Billing and metering is recording system usage based on scale up /
down on an hourly basis
© 2022 SAP SE or an SAP affiliate company. All rights reserved. PUBLIC * Check the current limitations 38
SAP Data Intelligence
Estimating the required usage
Sizing Calculator: Link to online calculator
• Determine the required amount of Capacity Units based on the
chosen Data Intelligence services and usage volumes
• How to use?
• Data management & orchestration services:
• Users: Add the relevant amount of concurrent total and
Jupyter users
• Pipelines & Jobs: Add the relevant amount of active,
concurrent pipelines or jobs
Example: Execution of different pipelines and jobs to manage and orchestrate data
▪ 100 small pipelines (each 0.5 GB), 30 medium pipelines (each 1 GB) and 20 large pipelines (each 1.5 GB) -> Consumption of 100*0.5 GB
+ 30*1 GB + 20*1.5 GB = 110 GB memory
▪ 2 jobs extracting metadata into Metadata Explorer (each 1 GB), 2 jobs browsing and data profiling of connected systems (each 1 GB), 2
jobs executing data preparation processes of connected systems (each 1.5 GB) -> Consumption of 7 GB memory
▪ 3 concurrent users -> 8.7 GB of memory
▪ Memory consumption and nodes required
– Pipelines, data jobs and users: 125.7 GB memory
– Workload by basic system components (mandatory components to run the platform): 64 GB memory
– Total consumption of 189.7 GB memory -> Need for 6 Data Intelligence nodes (each 32 GB)
▪ Node hours per month: 6 * 720 hours (24 hours x 30 days) = 4,320 hours
▪ Capacity Units per month: 4,320h * 1.36 (Value of Data Intelligence Node) = 5,876 Capacity Units (rounded up)
SAP HANA Smart Data Integration SAP HANA Agile Data Preparation
On-prem On-prem
Synergy with HANA Get (even) Open Data Catalog and Unified & Hybrid
Cloud, DWC and SAC Smarter end-to-end Governance Technologies
Tighter integration of the BTP UD&A Apply ML and intelligence where possible. Extend metadata exchange and improve Fine tune consolidation and alignment
portfolio Augmented Data Integration & Active Metadata integration with 3rd party catalogs. Make data with on-prem EIM portfolio, enabling
& Data Governance, Automated Data quality rules smarter, and add enforcement for hybrid data fabric scenarios
Classification, Quality and Monetization data privacy rules. Improve collaboration
*This is the current state of planning and can be changed by SAP at any time
© 2022 SAP SE or an SAP affiliate company. All rights reserved. PUBLIC 43
SAP Data Intelligence
Q&A