Job of the day: Direct Line Group (Bromley UK) need a Data Quality Coordinator with data governance expertise

Print to Page   |   Contact Us   |   Report Abuse   |   Sign In   |   Become a member
How to Create a Data Quality Firewall and Data Quality SLA
Share |

Data Quality Essentials: How to Create a Data Quality Firewall and Data Quality SLA



Data Quality Firewall & Data Quality SLAHow do you prevent poor levels of supplier data quality impacting your organisation?

The answer is through the creation of a robust Data Quality Firewall and ongoing Data Quality SLA process. This post creates a blueprint for creating your own data quality firewall and ensuring your data quality SLA provides certified, high quality data throughout your organisation.

The Data Quality Essentials series identifies widely accepted data quality best-practices that your organisation should be adopting in an effort to improve and maintain data quality.




Author: Dylan Jones
Published: 18th November 2011

Editorial Categories: [TB.3] Data Quality Rules and Requirements, [TB.5] Data Quality Improvement and Control




How to Create a Data Quality Firewall and Data Quality SLA


First, a short story.

Back in 2005 I was consulting at a small utilities company running a data quality assessment project prior to a data migration. I got everyone in a room and started a simple information chain workshop task, sticking a load of Post-It’s on the whiteboard mapping the people, process and data chains that mattered to this team.

We talked about the issues in the data. People were generally happy but one person seemed a little flat so I asked what they did. It was their role, every month, to take a vital piece of data from a major supplier of data (also their primary partner) into the business.

Each month they laboriously sat at a computer fixing defects in the data by hand. The same type of defects were fixed every single month.

Now, it’s obviously reassuring they had some kind of firewall but there were obvious flaws:

  1. The company refused to push back on the supplier and demand a data quality Service Level Agreement (SLA), (sometimes this is tough to do but in this case they really hadn’t exhausted the options for push-back)
  2. This person had huge amounts of undocumented knowledge in his head and was demoralised, a high employee churn risk and obviously a major challenge to the company if he quit
  3. Doing stuff over and over again in the name of data quality is just bad practice
  4. The lead times for clean up meant major hassles downstream

So what were they doing right? Well you could say that…

  1. Someone was assigned to this "firewall”, okay it was a screwed up process but at least there was ownership
  2. They were diligent, nothing bad really got through according to downstream users
  3. They were trapping the data at source, nothing sneaked past this guy until it was ready

The fact is that many organisations get the data supplier relationship wrong by:

  1. Applying blanket trust and assuming the data will be sound
  2. Applying manual hacks and tweaks downstream
  3. Applying a dedicated person or team (costly) at the point of entry

It’s incredible how many companies I meet that accept data from outside the corporate boundary and assume it is correct. 

Don't make this assumption, take a proactive stance using these steps as an outline guide.


Simple Roadmap for a Data Quality Firewall and Data Quality SLA

  • Map all the information sources flowing into your organisation (we’re talking a Post-It session here, not a £20,000 data modelling tool, and bring donuts, that helps)
  • Where information comes from an external source identify whether the following exists:
    • Formal data specifications outlining things like field formats, frequency of delivery, expected values, permitted ranges
    • Escalation procedures or standard operating procedures for when defects are found
  • No documentation or process exists? Then create one listing the elements above but also add:
    • Simple information chain diagram explaining where this information comes in and feeds to
    • Names and contact details for everyone involved with this data, both on the supplier and consumer side
  • Present this to the relevant stakeholder explaining your desire to improve this inbound information source with a view to:
    • Preventing embarrassment to the stakeholder when bad data impacts other business units (I would probably open with this)
    • Reducing the cost savings of building a process that eliminates endless amounts of scrap and rework activity (costs based on past incidents is ideal here)
    • Increasing overall perception of the stakeholders team as a highly professional, innovative resource (they will like this)
    • Reducing the lead times and improving various other metrics the stakeholder will be held accountable for
  • Get them to sign off that they are ultimately responsible for this process and you will provide them regular reports on how the process is working and the value that it (and they) are bringing to the business
  • Document all known issues with this inbound information source, surveys and casual conversations with data workers, DBA’s, app designers and anyone else who touches this data downstream are the way forward here
  • Profile the data using one of the many free tools now available
  • Rigorously document the data quality rules you feel the data should be managed against e.g.
    • Timeliness – the data must arrive between 9am and 10am every weekday morning
    • Completeness – the customer name and address fields must be populated, there must be a valid order number etc.
    • Formatting – the order number must be in the format of NN-LLLL-NN, with no exceptions
    • Overloading – each record must only have one entry in the part code, there must not be multiple part codes in the same field
    • etc….
  • Convert these data quality rules into a live monitoring process, e.g.
    • Use one of the many free data quality and data integration tools available
    • Use standard scripts in SQL or Unix, whatever your data processing platform uses
  • Implement the process and start to track the issues discovered
  • Run this process for at least a month, discovering issues and fixing manually
  • Create a comprehensive file of all the issues found and the impacts they’re having in your organisation
  • Approach the data supplier outlining your findings, the innovations you have made and the issues you are picking up
  • If no SLA or formal definitions and agreements around data quality exist, work with your stakeholder and the supplier to get something in place
  • Agree to share your technology and approach with the supplier so that they can improve their data quality (chances are it’s being supplied to other companies too)
  • Work together to iteratively create the most robust data quality firewall and SLA process possible

Okay, so this may deviate depending on your particular situation but I’ve used variations on this in the past and it works. In several cases the data supplier had no idea their data was defective because no-one had raised it. In my story above, no-one had pushed back on the supplier so the manual fire-fighting had continued for many months.

If you have implemented something similar then I would love to hear about it and perhaps feature your story on the site.



Useful Resources Related to this Feature:

[TB.3] Data Quality Rules and Requirements

Item Name Posted By Date Posted
DQ and Business Rules Explained with Ronald G.Ross Link  more ] Administration 04/11/2011
Data Quality Rules Process for Data Migration Link  more ] Administration 07/10/2011
Data Quality-Centric Data Migration, John Morris-2 Link  more ] Administration 07/10/2011
Data Quality-Centric Data Migration, John Morris-1 Link  more ] Administration 07/10/2011
Data Quality Rules: General Attribute Dependencies Link  more ] Administration 07/10/2011
Data Quality Rules: Rules for Historical Data Link  more ] Administration 07/10/2011
Data Quality Rules: Attribute Domain Constraints Link  more ] Administration 07/10/2011
Data Quality Rules: Integrity Constraints Link  more ] Administration 07/10/2011
Data Quality Rules: State-Dependent Objects Link  more ] Administration 07/10/2011

[TB.5] Data Quality Improvement and Control

Item Name Posted By Date Posted
Identifying Duplicate Customers by Jim Harris-1 Link  more ] Administration 18/11/2011
Identifying Duplicate Customers by Jim Harris-2 Link  more ] Administration 18/11/2011
Identifying Duplicate Customers by Jim Harris-3 Link  more ] Administration 18/11/2011
Data Matching Better Practice Guidelines, Wayne Co Link  more ] Administration 18/11/2011
The Freakish World of Root Cause Analysis Link  more ] Administration 18/11/2011
Data Profiling Tutorial - 1 (with Free Software) Link  more ] Administration 18/11/2011
7 Simple Ways to Improve Data Entry Data Quality Link  more ] Administration 18/11/2011
How to Create a Data Quality Firewall and SLA Link  more ] Administration 18/11/2011
Search Data Quality Pro
Please sign in here >
Data Quality Journal
Event Calendar

17/05/2012
[SEMINAR]: Solvency II Briefing (Manchester) with DataFlux

25/06/2012 » 28/06/2012
The Data Governance and Information Quality Conference (DGIQ2012), June 25-28, 2012, San Diego

Online Surveys
Popular Demos