Data Quality Skills Tutorial: Learn how to profile and validate data (for free) using this DataCleaner tutorial
Data Quality Skills Tutorial: Learn how to profile and validate data (for free) using this DataCleaner tutorial
Yesterday, we published an interview with Kasper Sørensen to discuss his free, open source DataCleaner product and his experiences of going down the open source route (click here for article).
To help our readers understand some of the core data quality capabilities provided by DataCleaner and also give you some practical experience of a real-life data quality scenario we have put together a detailed tutorial, sample data and dictionary files to help you learn some of the essentials of:
- Data profiling
- Data matching
- Data validation
- Dictionary management
- Pattern analysis
The tutorial workbook walks you through each exercise required to validate a simulated job feed of data from an upstream supplier. It also shows you how to enact the data quality rules that will help you identify and eliminate defective data at source.
DataCleaner is a great tool to use for all abilities as it is simple to use, easy to install and requires no investment.
We hope you find the tutorial useful and do let us know if you want to see more tutorials like this by adding your comments below.
To download the data quality tutorial please follow these instructions:
- Are you a registered member of Data Quality Pro? Registration is required to access this tutorial but membership is absolutely free and only takes about 20 seconds (just click here)
- Navigate to the Download Centre (you will need to be registered and logged in to enter the download centre)
- Navigate to the "Tutorial Materials" folder
- Download the DataCleaner Tutorial file
- Unzip the .zip file and follow the instructions in the DataCleaner Tutorial pdf file to install the software and work through the tutorial
Useful Resources
- Interview with Kasper Sorensen, creator of DataCleaner
- http://datacleaner.eobjects.org/ - The DataCleaner website
- Data Profiling for Beginners - download a complete tutorial including free software to start your own data profiling initiative
- Free Data Profiling Tutorial: Discovering Dependency Rules
- Data Profiling Tutorial: Data Profiling for Beginners
- Data Quality Assessment Tutorial: Pattern Analysis in Excel
- Free DQ Pattern Analyser for Microsoft Access (part 1 in the series)
- Need to trap data defects in Oracle? Download this free data quality pattern analyser
See all: DQ Techniques,
Methodology,
Personal Development,
Technology,
Tutorial


DQ Techniques
Reader Comments