LECTURE - 11
DATA RESOURCE
               MANAGEMENT
James A. O'Brien, and George Marakas
  Management Information Systems
                      Chapter 5 Data Resource ManagementChapter 5   2
Data Resource Management
Data Resource Management (or Data Administration) is
an organizational function working in the areas of
information systems and computer science that plans,
organizes, describes and controls data resources.
•Data resource management is a managerial activity
  • Uses data management, data warehousing,
    and other IS technologies
  • Manages data resources to meet the information
    needs of business stakeholders
                      Chapter 5 Data Resource ManagementChapter 5   3
Data Resource Management
• Data Stewards: A data steward is a person responsible
for the management and fitness of data elements - both
the content and metadata.
• A data steward is a job role that involves planning,
implementing and managing the sourcing, use and
maintenance of data assets in an organization. Data
stewards enable an organization to take control and
govern all the types and forms of data and their
associated libraries or repositories.
                                 Chapter 5 Data Resource ManagementChapter 5   4
Distributed Databases
• Distributed databases are copies or parts of databases stored on servers at
    multiple locations
     • Improves database performance at worksites
•   Advantages
     • Protection of valuable data
     • Data can be distributed into smaller databases
     • Each location has control of its local data
     • All locations can access any data, any where
•   Disadvantages
     • Maintaining data accuracy
•   Replication
     • Look at each distributed database and find changes
     • Apply changes to each distributed database
     • Very complex
•   Duplication
     • One database is master
     • Duplicate the master after hours, in all locations
     • Easier to accomplish
                      Chapter 5 Data Resource ManagementChapter 5   5
Operational Databases (ODB)
• An operational database is a database that is used to
  manage and store data in real time. ODB containing
  up-to-date, modifiable data, in contrast to a decision
  support database.
• Also called subject area databases (SADB), transaction
  databases, and production databases
   • Database examples: customer, human resource,
     inventory, SCM etc.
                    Chapter 5 Data Resource ManagementChapter 5   6
External Databases
• Databases available for a fee from commercial
online services, or free from the Web
 • Example: hypermedia databases, statistical
   databases, bibliographic and full text databases
 • Search engines like Google or Yahoo are
   external databases
                                    Chapter 5 Data Resource ManagementChapter 5   7
  Data Warehouses
• Organizations are required to keep all relevant data for several years for analyzing
 and comparing the current year data with that of the past years which also makes
 it easier to determine key trends taking place. All this data from previous years are
 stored in a database warehouse.
• Since the data stored has gone through all kinds of screening, editing and
 integration, it does not need any further editing or alteration.
• Central source of data that has been cleaned, transformed, and cataloged.
• Data is used for data mining, analytical processing, analysis, research, decision
 support
• Data warehouses may be divided into data marts
  • Subsets of data that focus on specific aspects of a company (department or
    business process)
                        Chapter 5 Data Resource ManagementChapter 5   8
Data Marts
• A data mart is a simple form of a data warehouse that is
 focused on a single subject (or functional area), such as
 Sales or Finance or Marketing.
              Data Warehouse Vs. Data Mart
          Chapter 5 Data Resource ManagementChapter 5   9
Applications and Data Marts
                          Chapter 5 Data Resource ManagementChapter 5   10
Data Mining
• Data mining is the process of finding anomalies, patterns
 and correlations within large data sets to predict
 outcomes. Using a broad range of techniques, we can use
 this information to increase revenues, cut costs, improve
 customer relationships, reduce risks and more.
• What Data Mining can do?
• Market segmentation - Identify the common characteristics of
  customers who buy the same products from the company.
• Customer churn - Predict which customers are likely to leave
  the company and go to a competitor.
• Fraud detection - Identify which transactions are most likely to
  be fraudulent.
                             Chapter 5 Data Resource ManagementChapter 5   11
What Data Mining can do?
• Direct marketing - Identify which prospects should be included in a
 mailing list to obtain the highest response rate.
• Interactive marketing - Predict what each individual accessing a
 Web site is most likely interested in seeing.
• Market basket analysis - Understand what products or services are
 commonly purchased together; e.g., beer and diapers.
• Trend analysis - Reveal the difference between a typical customer
 this month and last.
                  Chapter 5 Data Resource ManagementChapter 5   12
Traditional File Processing
• Data are organized, stored, and processed in
  independent files
   • Each business application designed to use
     specialized data files containing specific
     types of data records
• Problems
   • Data redundancy
   • Lack of data integration
   • Data dependence (files, storage devices,
     software)
   • Lack of data integrity or standardization
                             Chapter 5 Data Resource ManagementChapter 5   13
Database Management Approach
• In order to remove all limitations of the File Based Approach, a new
  approach was required that must be more effective known as
  Database approach.
• The foundation of modern methods of managing organizational data
   • Consolidates data records formerly in separate files into databases
   • Data can be accessed by many different application programs
   • A database management system (DBMS) is the software interface
     between users and databases
   A database management system (DBMS) is a software that: Acts
     as an interface between application programs and the data files.
     Separates the logical view of the database (how the data is
     perceived by end users) and the physical view (how the data is
     actually organized on storage media).
                     Chapter 5 Data Resource ManagementChapter 5   14
Database Management System
• Database Development
 • Defining and organizing the content,
     relationships, and structure of the data needed
     to build a database
• Database Application Development
   • Using DBMS to create prototypes of queries,
     forms, reports, Web pages
• Database Maintenance
   • Using transaction processing systems and other
     tools to add, delete, update, and correct data
       Chapter 5 Data Resource ManagementChapter 5   15
DBMS Major Functions
                             Chapter 5 Data Resource ManagementChapter 5   16
Database Interrogation
• Database Interrogation: Capability of a DBMS to report information from
  the database in response to End-Users’ request.
• Query Language: Allows easy, immediate access to ad hoc requests.
• SQL Queries
• Query form is SELECT…FROM…WHERE…
                            Chapter 5 Data Resource ManagementChapter 5   17
Questions
• Define Data Resource Management with its activities.3
• What is Data stewards and what capabilities a Data Stewards should
  have? 3
• Define Distributed Databases with advantages and disadvantages.4
• Explain Data warehouse, Data mining and Mata Marts with example.
  3
• How do you differentiate between Data Warehouse and Data Mart? 3
• Make a list of what Data Mining can do? 4
• Graphically represent the relation between Data Marts and their
  applications. 3