Data Strategy and
Architecture
The modern      Your Data
data estate
                                 Org Data   CRM    Graph       Image      Social     IoT
leverages the   Hybrid Ecosystem
best of on-
premise and
cloud                  On-premises                                                         Cloud
                        Private cloud
                Management, Security,
                and Insights anywhere
                Operational databases             Data Warehouses           Data Lakes
                Key Benefits
                Reason over data, anywhere        Flexibility of choice     Security and Performance
challenges
    Continuum to unlock digital innovation…                                                                    Modernization
                                                                                                               Digital Transformation
                    Data migration
                                 Data modernization
Data Estate
                                               Advanced analytics
                                                                          Data intelligence
                                                                          (AI/ML)
        AND                                                                Convergence
Application Dev
                                                                          Intelligent apps
                                                                          (cloud native)
                                              Smart apps (infused with pre-built AI)
                             App modernization
                  App migration
                  Foundational                                                          Innovation
                                                                              Strategic value & capabilities
Data
                   Business
                   Use Cases
                   Operating
                    Model
       Executive                Technical
       Strategy                Capabilities
Develop an executive strategy based on the 3 guiding
principles that will enable a modern Data Estate
Modern Data Estate
                                                                          Preparation              Agility                   Resilience
                         Preparation
                         “Reduce Cost”                           Reduce costs though       Increase your agility         Be more resilient
                                                                     Preparation         through trusted insights       to sudden change
                                                                                          People and Processes
                                                               Governance               Architecture                Data Lifecyle
                                                               “manage the system”      “manage the container”      “manage the content”
                          Executive                            • Charter                • Data lake provisioning    • Ingestion for compute,
                                                               • Tenets                 • Master Data Management    • Handshaking
                          Strategy
         Agility                                               • Standards              • Metadata Management       • Control file
     “Increase agility
                                            Resilience         • Data Quality           • Common Data Model         • Discovery (Data Catalog)
     through trusted                      “Be more resilient                            • Data Access Management
                                                               • Security                                           • Lineage
         insights”                       to sudden change”
                                                               • Privacy                                            • Data Contract
                                                               • Ethics                                             • Linkage (Merge Service)
                                                                                                                    • Classification
                                                                                                                    • Retention
                                           data strategy
Data strategy                             Data Strategy
Value unlock                         Business Applications and Reporting
Technology
enablers
                Data Integration         Data Processing & ML              Data Access
                                            Data Lakehouse
Foundations
                                   Data Management and Governance
        reference data architecture
Systems of
engagement
Data access
                         Data lakehouse
Data storage,
processing and
analytics
Data
integration
Data
management
System of
records
  Current requirements   Future requirements
                                                              “north star”
                   Description                                           Example
Data architecture as an important organisational     Reusable data pipeline to transform data from
asset that can be lifted and shifted that enable     Azure Data Lake Service (ADLS) can be used other
several use cases                                    use cases requiring similar patterns
Data that is ingested, stored and curated in the     Leverage the rostering and planning data for both
data lake and database that can be utilised by use   reporting and roster optimisation
cases other than that for which it was ingested
for
We use the reference data architecture and requirements to
map the components in the solution architecture
                                         Key Requirements
Raw data PII columns will be hashed,     1.   Raw data including data including Personally
                                              Identifiable Information (PII) will stay in the source
and unnecessary columns will be
                                              systems.
                                         2.   Azure Data Factory removes PII and perform pre-
removed prior to load in the data lake        aggregation, if necessary, to de-identify the
                                              information in batch e.g., daily, monthly.
                                         3.   Streaming services removes PII and perform pre-
                                              aggregation, if necessary, to de-identify the
                                              information in rea-time.
                                         4.   The de-identified data is stored in the data lake and
                                              data warehouse in the cloud.
                                         5.   Databricks, Azure ML and cognitive services use de-
                                              identified data to perform big data analysis and
                                              machine learning.
                                         6.   Azure Purview is used to catalog and govern data
                                              available on-premise and in the cloud.
                                         7.   Data can be access via API or other data connectors.
                                         8.   The Power BI and/or analysis services is used to
                                              visualise the data into reports and dashboard
                                         9.   Business applications used data access layer to source
                                              insights and data
ALM use case (phase 1) involves portfolio data for retail, EBP
and non-EBP extracted in batch
Thank you.