KEMBAR78
A Comprehensive Approach To Data Warehouse Testing | PDF | Conceptual Model | Data Warehouse
0% found this document useful (0 votes)
542 views8 pages

A Comprehensive Approach To Data Warehouse Testing

Testing is an essential part of the design life-cycle of any software product. This paper introduces a number of data warehousespecific testing activities. Data validation is one of the main goals of data warehouse testing.

Uploaded by

Dharmeshwar M
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
542 views8 pages

A Comprehensive Approach To Data Warehouse Testing

Testing is an essential part of the design life-cycle of any software product. This paper introduces a number of data warehousespecific testing activities. Data validation is one of the main goals of data warehouse testing.

Uploaded by

Dharmeshwar M
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 8

A Comprehensive Approach to Data Warehouse Testing

Matteo Golfarelli Stefano Rizzi


DEIS - University of Bologna DEIS - University of Bologna
Via Sacchi, 3 VIale Risorgimento, 2
Cesena, Italy Bologna, Italy
matteo.golfarelli@unibo.it stefano.rizzi@unibo.it

ABSTRACT warehouse testing is to know the data and what the


Testing is an essential part of the design life-cycle of any answers to user queries are supposed to be.
software product. Nevertheless, while most phases of data • Differently from generic software systems, data ware-
warehouse design have received considerable attention in the house testing involves a huge data volume, which sig-
literature, not much has been said about data warehouse nificantly impacts performance and productivity.
testing. In this paper we introduce a number of data mart-
specific testing activities, we classify them in terms of what • Data warehouse testing has a broader scope than soft-
is tested and how it is tested, and we discuss how they can ware testing because it focuses on the correctness and
be framed within a reference design methodology. usefulness of the information delivered to users. In
fact, data validation is one of the main goals of data
Categories and Subject Descriptors warehouse testing.
H.4.2 [Information Systems Applications]: Types of • Though a generic software system may have a large
Systems—Decision support; D.2.5 [Software Engineering]: number of different use scenarios, the valid combina-
Testing and Debugging tions of those scenarios are limited. On the other hand,
data warehouse systems are aimed at supporting any
General Terms views of data, so the possible combinations are virtu-
ally unlimited and cannot be fully tested.
Verification, Design
• While most testing activities are carried out before de-
Keywords ployment in generic software systems, data warehouse
testing activities still go on after system release.
data warehouse, testing
• Typical software development projects are self-contained.
1. INTRODUCTION Data warehousing projects never really come to an end;
it is very difficult to anticipate future requirements for
Testing is an essential part of the design life-cycle of any
the decision-making process, so only a few require-
software product. Needless to say, testing is especially crit-
ments can be stated from the beginning. Besides, it
ical to success in data warehousing projects because users
is almost impossible to predict all the possible types
need to trust in the quality of the information they access.
of errors that will be encountered in real operational
Nevertheless, while most phases of data warehouse design
data. For this reason, regression testing is inherently
have received considerable attention in the literature, not
involved.
much has been said about data warehouse testing.
As agreed by most authors, the difference between test- Like for most generic software systems, different types of
ing data warehouse systems and generic software systems or tests can be devised for data warehouse systems. For in-
even transactional systems depends on several aspects [21, stance, it is very useful to distinguish between unit test,
23, 13]: a white-box test performed on each individual component
considered in isolation from the others, and integration test,
• Software testing is predominantly focused on program
a black-box test where the system is tested in its entirety.
code, while data warehouse testing is directed at data
Also regression test, that checks that the system still func-
and information. As a matter of fact, the key to data
tions correctly after a change has occurred, is considered to
be very important for data warehouse systems because of
their ever-evolving nature. However, the peculiar character-
Permission to make digital or hard copies of all or part of this work for istics of data warehouse testing and the complexity of data
personal or classroom use is granted without fee provided that copies are warehouse projects ask for a deep revision and contextualiza-
not made or distributed for profit or commercial advantage and that copies tion of these test types, aimed in particular at emphasizing
bear this notice and the full citation on the first page. To copy otherwise, to the relationships between testing activities on the one side,
republish, to post on servers or to redistribute to lists, requires prior specific design phases and project documentation on the other.
permission and/or a fee.
DOLAP’09, November 6, 2009, Hong Kong, China. From the methodological point of view we mention that,
Copyright 2009 ACM 978-1-60558-801-8/09/11 ...$10.00. while testing issues are often considered only during the very

17
last phases of data warehouse projects, all authors agree that 2. RELATED WORKS
advancing an accurate test planning to the early projects The literature on software engineering is huge, and it in-
phases is one of the keys to success. The main reason for cludes a detailed discussion of different approaches to the
this is that, as software engineers know very well, the ear- testing of software systems (e.g., see [16, 19]). However,
lier an error is detected in the software design cycle, the only a few works discuss the issues raised by testing in the
cheapest correcting that error is. Besides, planning early data warehousing context.
testing activities to be carried out during design and before [13] summarizes the main challenges of data warehouse
implementation gives project managers an effective way to and ETL testing, and discusses their phases and goals dis-
regularly measure and document the project progress state. tinguishing between retrospective and prospective testing. It
Since the correctness of a system can only be measured also proposes to base part of the testing activities (those
with reference to a set of requirements, a successful test- related to incremental load) on mock data.
ing begins with the gathering and documentation of end- In [20] the authors propose a basic set of attributes for a
user requirements [8]. Since most end-users requirements data warehouse test scenario based on the IEEE 829 stan-
are about data analysis and data quality, it is inevitable dard. Among these attributes, the test purpose, its perfor-
that data warehouse testing primarily focuses on the ETL mance requirements, its acceptance criteria, and the activi-
process on the one hand (this is sometimes called back-end ties for its completion.
testing [23]), on reporting and OLAP on the other (front-end [2] proposes a process for data warehouse testing centered
testing [23]). While back-end testing aims at ensuring that on a unit test phase, an integration test phase, and a user
data loaded into the data warehouse are consistent with the acceptance test phase.
source data, front-end testing aims at verifying that data are [21] reports eight classical mistakes in data warehouse
correctly navigated and aggregated in the available reports. testing; among these: not closely involving end users, testing
From the organizational point of view, several roles are in- reports rather than data, skipping the comparison between
volved with testing [8]. Analysts draw conceptual schemata, data warehouse data and data source data, and using only
that represent the users requirements to be used as a refer- mock data. Besides, unit testing, system testing, acceptance
ence for testing. Designers are responsible for logical schemata testing, and performance testing are proposed as the main
of data repositories and for data staging flows, that should be testing steps.
tested for efficiency and robustness. Testers develop and ex- [23] explains the differences between testing OLTP and
ecute test plans and scripts. Developers perform white box OLAP systems and proposes a detailed list of testing cate-
unit tests. Database administrators test for performance gories. It also enumerates possible test scenarios and differ-
and stress, and set up test environments. Finally, end-users ent types of data to be used for testing.
perform functional tests on reporting and OLAP front-ends. [8] discusses the main aspects in data warehouse testing.
In this paper we propose a comprehensive approach to In particular, it distinguishes the different roles required in
testing data warehouse systems. More precisely, consider- the testing team, and the different types of testing each role
ing that data warehouse systems are commonly built in a should carry out.
bottom-up fashion, by iteratively designing and implement- [4] presents some lessons learnt on data warehouse testing,
ing one data mart at a time, we will focus on the test of a emphasizing the role played by constraint testing, source-to-
single data mart. The main features of our approach can be target testing, and validation of error processing procedures.
summarized as follows: All papers mentioned above provide useful hints and list
some key testing activities. On the other hand, our paper
• A consistent portion of the testing effort is advanced is the first attempt to define a comprehensive framework for
to the design phase to reduce the impact of error cor- data mart testing.
rection.
3. METHODOLOGICAL FRAMEWORK
• A number of data mart-specific testing activities are
identified and classified in terms of what is tested and To discuss how testing relates to the different phases of
how it is tested. data mart design, we adopt as a methodological framework
the one described in [7]. As sketched in Figure 1, this frame-
work includes eight phases:
• A tight relationship is established between testing ac-
tivities and design phases within the framework of a • Requirement analysis: requirements are elicited from
reference methodological approach to design. users and represented either informally by means of
proper glossaries or formally (e.g., by means of goal-
• When possible, testing activities are related to quality oriented diagrams as in [5]);
metrics to allow their quantitative assessment.
• Analysis and reconciliation: data sources are inspected,
The remainder of the paper is organized as follows. Af- normalized, and integrated to obtain a reconciled schema;
ter briefly reviewing the related literature in Section 2, in • Conceptual design: a conceptual schema for the data
Section 3 we propose the reference methodological frame- mart –e.g., in the form of a set of fact schemata [6]– is
work. Then, Section 4 classifies and describes the data mart- designed considering both user requirements and data
specific testing activities we devised, while Section 5 briefly available in the reconciled schema;
discusses some issues related to test coverage. Section 6 pro-
poses a timeline for testing in the framework of the reference • Workload refinement: the preliminary workload ex-
design methodology, and Section 7 summarizes the lessons pressed by users is refined and user profiles are singled
we learnt. out, using for instance UML use case diagrams;

18
in the light of the complexity of data warehouse projects and
of the close relationship between good design and good per-
formance, we argue that testing the design quality is almost
equally important. Testing design quality mainly implies
verifying that user requirements are well expressed by the
conceptual schema of the data mart and that the conceptual
and logical schemata are well-built. Overall, the items to be
tested can then be summarized as follows:
• Conceptual schema: it describes the data mart from an
implementation-independent point of view, specifying
the facts to be monitored, their measures, the hierar-
chies to be used for aggregation. Our methodological
framework adopts the Dimensional Fact Model to this
end [6].
• Logical schema: it describes the structure of the data
repository at the core of the data mart. If the im-
plementation target is a ROLAP platform, the logical
schema is actually a relational schema (typically, a star
schema or one of its variants).
• ETL procedures: the complex procedures that are in
charge of feeding the data repository starting from
data sources.
• Database: the repository storing data.
Figure 1: Reference design methodology • Front-end: the applications accessed by end-users to
analyze data; typically, these are either static reporting
tools or more flexible OLAP tools.
• Logical design: a logical schema for the data mart (e.g.,
in the form of a set of star schemata) is obtained by As concerns the second coordinate, how, the eight types
properly translating the conceptual schema; of test that, in our experience, best fit the characteristics of
data warehouse systems are summarized below:
• Data staging design: ETL procedures are designed con-
sidering the source schemata, the reconciled schema, • Functional test: it verifies that the item is compliant
and the data mart logical schema; and with its specified business requirements.

• Physical design: this includes index selection, schema • Usability test: it evaluates the item by letting users
fragmentation, and all other issues related to physical interact with it, in order to verify that the item is easy
allocation. to use and comprehensible.

• Implementation: this includes implementation of ETL • Performance test: it checks that the item performance
procedures and creation of front-end reports. is satisfactory under typical workload conditions.

Note that this methodological framework is general enough • Stress test: it shows how well the item performs with
to host both supply-driven, demand-driven, and mixed ap- peak loads of data and very heavy workloads.
proaches to design. While in a supply-driven approach con-
• Recovery test: it checks how well an item is able to re-
ceptual design is mainly based on an analysis of the available
cover from crashes, hardware failures and other similar
source schemata [6], in a demand-driven approach user re-
problems.
quirements are the driving force of conceptual design [3]. Fi-
nally, in a mixed approach requirement analysis and source • Security test: it checks that the item protects data and
schema analysis are carried out in parallel, and requirements maintains functionality as intended.
are used to actively reduce the complexity of source schema
analysis [1]. • Regression test: It checks that the item still functions
correctly after a change has occurred.
4. TESTING ACTIVITIES Remarkably, these types of test are tightly related to six of
In order to better frame the different testing activities, we the software quality factors described in [12]: correctness,
identify two distinct, though not independent, classification usability, efficiency, reliability, integrity, flexibility.
coordinates: what is tested and how it is tested. The relationship between what and how is summarized in
As concerns the first coordinate, what, we already men- Table 1, where each check mark points out that a given type
tioned that testing data quality is undoubtedly at the core of of test should be applied to a given item. Starting from this
data warehouse testing. Testing data quality mainly entails table, in the following subsections we discuss the main test-
an accurate check on the correctness of the data loaded by ing activities and how they are related to the methodological
ETL procedures and accessed by front-end tools. However, framework outlined in Section 3.

19
4.1 Testing the Conceptual Schema
Table 1: What vs. how in testing
Software engineers know very well that the earlier an er-

Conceptual schema
ror is detected in the software design cycle, the cheapest

ETL procedures
correcting that error is. One of the advantages of adopt-

Logical schema
ing a data warehouse methodology that entails a conceptual
design phase is that the conceptual schema produced can

Front-end
Database
be thoroughly tested for user requirements to be effectively
supported.
We propose two main types of test on the data mart con-
Functional X X X X ceptual schema in the scope of functional testing. The first,
Usability X X X that we call fact test, verifies that the workload preliminar-
Performance X X X X ily expressed by users during requirement analysis is actu-
Stress X X X ally supported by the conceptual schema. This can be easily
Recovery X X achieved by checking, for each workload query, that the re-
Security X X X quired measures have been included in the fact schema and
Regression X X X X X that the required aggregation level can be expressed as a
Analysis & design Implementation valid grouping set on the fact schema. We call the second
type of test a conformity test, because it is aimed at as-
sessing how well conformed hierarchies have been designed.
This test can be carried out by measuring the sparseness
of the bus matrix [7], which associates each fact with its
We preliminarily remark that a requirement for an effec-
dimensions, thus pointing out the existence of conformed
tive test is the early definition, for each testing activity, of
hierarchies. Intuitively, if the bus matrix is very sparse,
the necessary conditions for passing the test. These condi-
the designer probably failed to recognize the semantic and
tions should be verifiable and quantifiable. This means that
structural similarities between apparently different hierar-
proper metrics should be introduced, together with their ac-
chies. Conversely, if the bus matrix is very dense, the de-
ceptance thresholds, so as to get rid of subjectivity and am-
signer probably failed to recognize the semantic and struc-
biguity issues. Using quantifiable metrics is also necessary
tural similarities between apparently different facts.
for automating testing activities.
Other types of test that can be executed on the conceptual
While for some types of test, such as performance tests,
schema are related to its understandability, thus falling in
the metrics devised for generic software systems (such as
the scope of usability testing. An example of a quantitative
the maximum query response time) can be reused and ac-
approach to this test is the one described in [18], where a set
ceptance thresholds can be set intuitively, for other types of
of metrics for measuring the quality of a conceptual schema
test data warehouse-specific metrics are needed. Unfortu-
from the point of view of its understandability are proposed
nately, very few data warehouse-specific metrics have been
and validated. Example of these metrics are the average
defined in the literature. Besides, the criteria for setting
number of levels per hierarchy and the number of measures
their acceptance thresholds often depend on the specific fea-
per fact.
tures of the project being considered. So, in most cases, ad
hoc metrics will have to be defined together their thresholds.
An effective approach to this activity hinges on the following
4.2 Testing the Logical Schema
main phases [18]: Testing the logical schema before it is implemented and
before ETL design can dramatically reduce the impact of
1. Identify the goal of the metrics by specifying the un- errors due to bad logical design. An effective approach
derlying hypotheses and the quality criteria to be mea- to functional testing consists in verifying that a sample of
sured. queries in the preliminary workload can correctly be for-
mulated in SQL on the logical schema. We call this the
2. Formally define the metrics. star test. In putting the sample together, priority should
be given to the queries involving irregular portions of hier-
archies (e.g., those including many-to-many associations or
3. Theoretically validate the metrics using either axiomatic
cross-dimensional attributes), those based on complex ag-
approaches or measurement theory-based approaches,
gregation schemes (e.g., queries that require measures ag-
to get a first assessment of the metrics correctness and
gregated through different operators along the different hi-
applicability.
erarchies), and those leaning on non-standard temporal sce-
narios (such as yesterday-for-today).
4. Empirically validate the metrics by applying it to data In the scope of usability testing, in [17] some simple met-
of previous projects, in order to get a practical proof of rics based on the number of fact tables and dimension ta-
the metrics capability of measuring their goal quality bles in a logical schema are proposed. These metrics can be
criteria. This phase is also aimed at understanding the adopted to effectively capture schema understandability.
metrics implications and fixing the acceptance thresh- Finally, a performance test can be carried out on the log-
olds. ical schema by checking to what extent it is compliant with
the multidimensional normal forms, that ensure summariz-
5. Apply and accredit the metrics. The metrics defini- ability and support an efficient database design [11, 10].
tions and thresholds may evolve in time to adapt to Besides the above-mentioned metrics, focused on usabil-
new projects and application domains. ity and performance, the literature proposes other metrics

20
for database schemata, and relates them to abstract quality tests simulate an extraordinary workload due to a signifi-
factors. For instance, in the scope of maintainability, [15] cantly larger amount of data.
introduces a set of metrics aimed at evaluating the quality The recovery test of ETL checks for robustness by simu-
of a data mart logical schema with respect to its ability to lating faults in one or more components and evaluating the
sustain changes during an evolution process. system response. For example, you can cut off the power
supply while an ETL process is in progress or you can set
4.3 Testing the ETL Procedures a database offline while an OLAP session is in progress to
check for restore policies’ effectiveness.
ETL testing is probably the most complex and critical
As concerns ETL security, there is a need for verifying
testing phase, because it directly affects the quality of data.
that the database used to temporarily store the data being
Since ETL is heavily code-based, most standard techniques
processed (the so-called data staging area) cannot be vio-
for generic software system testing can be reused here.
lated, and that the network infrastructure hosting the data
A functional test of ETL is aimed at checking that ETL
flows that connect the data sources and the data mart is
procedures correctly extract, clean, transform, and load data
secure.
into the data mart. The best approach here is to set up unit
tests and integration tests. Unit tests are white-box test
that each developer carries out on the units (s)he devel-
4.4 Testing the Database
oped. They allow for breaking down the testing complexity, We assume that the logical schema quality has already
and they also enable more detailed reports on the project been verified during the logical schema tests, and that all is-
progress to be produced. Units for ETL testing can be ei- sues related to data quality are in charge of ETL tests. Then,
ther vertical (one test unit for each conformed dimension, database testing is mainly aimed at checking the database
plus one test unit for each group of correlated facts) or hor- performances using either standard (performance test) or
izontal (separate tests for static extraction, incremental ex- heavy (stress test) workloads. Like for ETL, the size of the
traction, cleaning, transformation, static loading, incremen- tested databases and their data distribution must be dis-
tal loading, view update); the most effective choice mainly cussed with the designers and the database administrator.
depends on the number of facts in the data marts, on how Performance tests can be carried out either on a database
complex cleaning and transformation are, and on how the including real data or on a mock database, but the database
implementation plan was allotted to the developers. In par- size should be compatible with the average expected data
ticular, crucial aspects to be considered during the loading volume. On the other hand, stress tests are typically car-
test are related to both dimension tables (correctness of roll- ried out on mock databases whose size is significantly larger
up functions, effective management of dynamic hierarchies than what expected. Standard database metrics –such as
and irregular hierarchies), fact tables (effective management maximum query response time– can be used to quantify the
of late updates), and materialized views (use of correct ag- test results. To advance these testing activities as much as
gregation functions). possible and to make their results independent of front-end
After unit tests have been completed, an integration test applications, we suggest to use SQL to code the workload.
allows the correctness of data flows in ETL procedures to be Recovery tests enable testers to verify the DBMS behav-
checked. Different quality dimensions, such as data coher- ior after critical errors such as power leaks during update,
ence (the respect of integrity constraints), completeness (the network fault, and hard disk failures.
percentage of data found), and freshness (the age of data) Finally, security tests mainly concern the possible adop-
should be considered. Some metrics for quantifying these tion of some cryptography technique to protect data and the
quality dimensions have been proposed in [22]. correct definition of user profiles and database access grants.
During requirement analysis the designer, together with
users and database administrators, should have singled out 4.5 Testing the Front-End
and ranked by their gravity the most common causes of Functional testing of the analysis front-ends must neces-
faulty data, aimed at planning a proper strategy for dealing sarily involve a very large number of end-users, who gener-
with ETL errors. Common strategies for dealing with errors ally are so familiar with application domains that they can
of a given kind are “automatically clean faulty data”, “reject detect even the slightest abnormality in data. Nevertheless,
faulty data”, “hand faulty data to data mart administrator”, wrong results in OLAP analyses may be difficult to recog-
etc. So, not surprisingly, a distinctive feature of ETL func- nize. They can be caused not only by faulty ETL proce-
tional testing is that it should be carried out with at least dures, but even by incorrect data aggregations or selections
three different databases, including respectively (i) correct in front-end tools. Some errors are not due to the data mart;
and complete data, (ii) data simulating the presence of faulty instead, they result from the overly poor data quality of the
data of different kinds, and (iii) real data. In particular, tests source database. In order to allow this situation to be rec-
using dirty simulated data are sometimes called forced-error ognized, a common approach to front-end functional testing
tests: they are designed to force ETL procedures into error in real projects consists in comparing the results of OLAP
conditions aimed at verifying that the system can deal with analyses with those obtained by directly querying the source
faulty data as planned during requirement analysis. databases. Of course, though this approach can be effective
Performance and stress tests are complementary in as- on a sample basis, it cannot be extensively adopted due to
sessing the efficiency of ETL procedures. Performance tests the huge number of possible aggregations that characterize
evaluate the behavior of ETL with reference to a routine multidimensional queries.
workload, i.e., when a domain-typical amount of data has A significant sample of queries to be tested can be selected
to be extracted and loaded; in particular, they check that in mainly two ways. In the “black-box way”, the workload
the processing time be compatible with the time frames ex- specification obtained in output by the workload refinement
pected for data-staging processes. On the other hand, stress phase (typically, a use case diagram where actors stand for

21
Table 2: Coverage criteria for some testing activities; the expected coverage is expressed with reference to
the coverage criterion
Testing activity Coverage criterion Measurement Expected coverage
each information need expressed percentage of queries in the prelim- partial, depending on the ex-
fact test by users during requirement anal- inary workload that are supported tent of the preliminary work-
ysis must be tested by the conceptual schema load
all data mart dimensions must be
conformity test bus matrix sparseness total
tested
usability test of the all facts, dimensions, and measures
conceptual metrics total
conceptual schema must be tested
ETL unit test all decision points must be tested correct loading of the test data sets total
all error types specified by users correct loading of the faulty data
ETL forced-error test total
must be tested sets
at least one group-by set for each
correct analysis result of a real
front-end unit test attribute in the multidimensional total
data set
lattice of each fact must be tested

user profiles and use cases represent the most frequent anal- such as data marts, concerns checking for new components
ysis queries) is used to determine the test cases, much like and new add-on features to be compatible with the operation
use case diagrams are profitably employed for testing-in-the- of the whole system. In this case, the term regression test is
large in generic software systems. In the “white-box way”, used to define the testing activities carried out to make sure
instead, the subset of data aggregations to be tested can be that any change applied to the system does not jeopardize
determined by applying proper coverage criteria to the mul- the quality of preexisting, already tested features and does
tidimensional lattice1 of each fact, much like decision, state- not corrupt the system performances.
ment, and path coverage criteria are applied to the control Testing the whole system many times from scratch has
graph of a generic software procedure during testing-in-the- huge costs. Three main directions can be followed to reduce
small. these costs:
Also in front-end testing it may be useful to distinguish
between unit and integration tests. While unit tests should • An expensive part of each test is the validation of the
be aimed at checking the correctness of the reports involving test results. In regression testing, it is often possible
single facts, integration tests entail analysis sessions that ei- to skip this phase by just checking that the test results
ther correlate multiple facts (the so-called drill-across queries) are consistent with those obtained at the previous it-
or take advantage of different application components. For eration.
example, this is the case when dashboard data are “drilled
through” an OLAP application. • Test automation allows the efficiency of testing ac-
An integral part of front-end tests are usability tests, that tivities to be increased, so that reproducing previous
check for OLAP reports to be suitably represented and com- tests becomes less expensive. Test automation will be
mented to avoid any misunderstanding about the real mean- briefly discussed in Section 6.
ing of data.
A performance test submits a group of concurrent queries • Impact analysis can be used to significantly restrict
to the front-end and checks for the time needed to process the scope of testing. In general, impact analysis is
those queries. Performance tests imply a preliminary spec- aimed at determining what other application objects
ification of the standard workload in terms of number of are affected by a change in a single application ob-
concurrent users, types of queries, and data volume. On the ject [9]. Remarkably, some ETL tool vendors already
other hand, stress tests entails applying progressively heav- provide some impact analysis functionalities. An ap-
ier workloads than the standard loads to evaluate the system proach to impact analysis for changes in the source
stability. Note that performance and stress tests must be data schemata is proposed in [14].
focused on the front-end, that includes the client interface
but also the reporting and OLAP server-side engines. This
means that the access times due to the DBMS should be 5. TEST COVERAGE
subtracted from the overall response times measured. Testing can reduce the probability of a system fault but
Finally, in the scope of security test, it is particularly im- cannot set it to zero, so measuring the coverage of tests is
portant to check for user profiles to be properly set up. You necessary to assess the overall system reliability. Measuring
should also check for single-sign-on policies to be set up prop- test coverage requires first of all the definition of a suitable
erly after switching between different analysis applications. coverage criterion. Different coverage criteria, such as state-
ment coverage, decision coverage, and path coverage, were
4.6 Regression Tests devised in the scope of code testing. The choice of one or
A very relevant problem for frequently updated systems, another criterion deeply affects the test length and cost, as
1
The multidimensional lattice of a fact is the lattice whose well as the achievable coverage. So, coverage criteria are
nodes and arcs correspond, respectively, to the group-by sets chosen by trading off test effectiveness and efficiency. Ex-
supported by that fact and to the roll-up relationships that amples of coverage criteria that we propose for some of the
relate those group-by sets. testing activities described above are reported in Table 2.

22
Design and testing process

Analyst / designer Tester Developer DBA End -user

Test planning
Requirement Analysis and
analysis reconciliation

Conc. schema:
Conceptual
conformity &
design
usability test

Workload Conc. schema:


refinement fact test

Logical schema:
Logical design functional, perf.
& usability test

Front -end
Staging design Physical design
implementation

ETL
implementation

ETL: integration
ETL: unit test
test

Front -end:
Front -end:
functional &
security test
usability test

ETL: forced -error,


perf., stress, recov.
& secur. test
Front -end:
performance & Database: all
stress test tests

Figure 2: UML activity diagram for design and testing

6. A TIMELINE FOR TESTING outlined in Section 3. This diagram can be used in a project
From a methodological point of view, the three main phases as a starting point for preparing the test plan.
of testing are [13]: It is worth mentioning here that test automation plays
a basic role in reducing the costs of testing activities (es-
• Create a test plan. The test plan describes the tests pecially regression tests) on the one hand, on increasing
that must be performed and their expected coverage test coverage on the other [4]. Remarkably, commercial
of the system requirements. tools (such as QACenter by Computerware) can be used
for implementation-related testing activities to simulate spe-
• Prepare test cases. Test cases enable the implementa- cific workloads and analysis sessions, or to measure a process
tion of the test plan by detailing the testing steps to- outcome. As to design-related testing activities, the metrics
gether with their expect results. The reference databases proposed in the previous sections can be measured by writ-
for testing should be prepared during this phase, and ing ad hoc procedures that access the meta-data repository
a wide, comprehensive set of representative workloads and the DBMS catalog.
should be defined.

• Execute tests. A test execution log tracks each test 7. CONCLUSIONS AND LESSONS LEARNT
along and its results. In this paper we proposed a comprehensive approach which
adapts and extends the testing methodologies proposed for
Figure 2 shows a UML activity diagram that frames the general-purpose software to the peculiarities of data ware-
different testing activities within the design methodology house projects. Our proposal builds on a set of tips and sug-

23
gestions coming from our direct experience on real projects, [3] R. Bruckner, B. List, and J. Schiefer. Developing
as well as from some interviews we made to data warehouse requirements for data warehouse systems with use
practitioners. As a result, a set of relevant testing activi- cases. In Proc. Americas Conf. on Information
ties were identified, classified, and framed within a reference Systems, pages 329–335, 2001.
design methodology. [4] R. Cooper and S. Arbuckle. How to thoroughly test a
In order to experiment our approach on a case study, we data warehouse. In Proc. STAREAST, Orlando, 2002.
are currently supporting a professional design team engaged [5] P. Giorgini, S. Rizzi, and M. Garzetti. GRAnD: A
in a large data warehouse project, which will help us better goal-oriented approach to requirement analysis in data
focus on relevant issues such as test coverage and test doc- warehouses. Decision Support Systems, 5(1):4–21,
umentation. In particular, to better validate our approach 2008.
and understand its impact, we will apply it to one out of [6] M. Golfarelli, D. Maio, and S. Rizzi. The dimensional
two data marts developed in parallel, so as to assess the fact model: A conceptual model for data warehouses.
extra-effort due to comprehensive testing on the one hand, International Journal of Cooperative Information
the saving in post-deployment error correction activities and Systems, 7(2-3):215–247, 1998.
the gain in terms of better data and design quality on the [7] M. Golfarelli and S. Rizzi. Data warehouse design:
other. Modern principles and methodologies. McGraw-Hill,
To close the paper, we would like to summarize the main 2009.
lessons we learnt so far:
[8] D. Haertzen. Testing the data warehouse.
• The chance to perform an effective test depends on the http://www.infogoal.com, 2009.
documentation completeness and accuracy in terms [9] R. Kimball and J. Caserta. The Data Warehouse ETL
of collected requirements and project description. In Toolkit. John Wiley & Sons, 2004.
other words, if you did not specify what you want from [10] J. Lechtenbörger and G. Vossen. Multidimensional
your system at the beginning, you cannot expect to get normal forms for data warehouse design. Information
it right later. Systems, 28(5):415–434, 2003.
[11] W. Lehner, J. Albrecht, and H. Wedekind. Normal
• The test phase is part of the data warehouse life-cycle, forms for multidimensional databases. In Proc.
and it acts in synergy with design. For this reason, the SSDBM, pages 63–72, Capri, Italy, 1998.
test phase should be planned and arranged at the be- [12] J. McCall, P. Richards, and G. Walters. Factors in
ginning of the project, when you can specify the goals software quality. Technical Report AD-A049-014, 015,
of testing, which types of tests must be performed, 055, NTIS, 1977.
which data sets need to be tested, and which quality
[13] A. Mookerjea and P. Malisetty. Best practices in data
level is expected.
warehouse testing. In Proc. Test, New Delhi, 2008.
• Testing is not a one-man activity. The testing team [14] G. Papastefanatos, P. Vassiliadis, A. Simitsis, and
should include testers, developers, designers, database Y. Vassiliou. What-if analysis for data warehouse
administrators, and end-users, and it should be set up evolution. In Proc. DaWaK, pages 23–33, Regensburg,
during the project planning phase. Germany, 2007.
[15] G. Papastefanatos, P. Vassiliadis, A. Simitsis, and
• Testing of data warehouse systems is largely based on Y. Vassiliou. Design metrics for data warehouse
data. A successful testing must rely on real data, but evolution. In Proc. ER, pages 440–454, 2008.
it also must include mock data to reproduce the most [16] R. Pressman. Software Engineering: A practitioner’s
common error situations that can be encountered in approach. The McGraw-Hill Companies, 2005.
ETL. Accurately preparing the right data sets is one [17] M. Serrano, C. Calero, and M. Piattini. Experimental
of the most critical activities to be carried out during validation of multidimensional data models metrics. In
test planning. Proc. HICSS, page 327, 2003.
• No matter how deeply the system has been tested: it is [18] M. Serrano, J. Trujillo, C. Calero, and M. Piattini.
almost sure that, sooner or later, an unexpected data Metrics for data warehouse conceptual models
fault, that cannot be properly handled by ETL, will oc- understandability. Information & Software Technology,
cur. So keep in mind that, while testing must come to 49(8):851–870, 2007.
an end someday, data quality certification is an ever [19] I. Sommerville. Software Engineering. Pearson
lasting process. The borderline between testing and Education, 2004.
certification clearly depends on how precisely require- [20] P. Tanuška, W. Verschelde, and M. Kopček. The
ment were stated and on the contract that regulates proposal of data warehouse test scenario. In Proc.
the project. ECUMICT, Gent, Belgium, 2008.
[21] A. van Bergenhenegouwen. Data warehouse testing.
8. REFERENCES http://www.ti.kviv.be, 2008.
[22] P. Vassiliadis, M. Bouzeghoub, and C. Quix. Towards
[1] A. Bonifati, F. Cattaneo, S. Ceri, A. Fuggetta, and
quality-oriented data warehouse usage and evolution.
S. Paraboschi. Designing data marts for data
In Proc. CAiSE, Heidelberg, Germany, 1999.
warehouses. ACM Transactions on Software
[23] Vv. Aa. Data warehouse testing and implementation.
Engineering Methodologies, 10(4):452–483, 2001.
In Intelligent Enterprise Encyclopedia. BiPM
[2] K. Brahmkshatriya. Data warehouse testing.
Institute, 2009.
http://www.stickyminds.com, 2007.

24

You might also like