Data Pipeline Testing
Learning Objectives
u Data Pipeline tests categories
u Standard tests types
Data Pipeline Testing
u Data quality tests: test the quality of the data
u Apply check constraints to Delta tables
u Standard tests: test the code logic
u Unit testing
u Integration testing
u End-to-end testing
Unit testing
u Approach to testing individual units of code, such as functions.
u If you make any changes to them in the future, you can determine
whether they still work as you expect them to.
u This helps you find problems with your code faster and earlier in the
development life cycle.
Unit Test Assertions
u An assertion is a statement that enables you to test the assumptions
you have made in your code
u assert func() == expected_value
Integration Testing
u Approach to testing the interaction between subsystems of an
application
u Software modules are integrated logically and tested as a group.
End-to-End Testing
u Approach to ensure that your application can run properly under
real-world scenarios
u Simulate a user experience from start to finish