Skip to content
All work
Data Quality· Enterprise sales organisation· 2023· ~10 weeks

CRM Data Quality Framework

'The reports are wrong' was always a data problem, never a reporting one. Built the framework to prove it and fix it.

  • Salesforce
  • Power BI
  • Power Query
  • Python (pandas)
  • Duplicate accounts
    −60%
    on the core book of business
  • Key-field completeness
    +28pp
    on industry, segment, country
  • DQI
    built + adopted
    reviewed weekly by the team

Business problem

Every reporting issue the commercial team raised ended up in the same place: duplicates, empty fields, inconsistent picklists. Leadership was losing trust in the numbers but kept aiming at the dashboards.

Data available

  • Salesforce Account, Contact, Opportunity full snapshot
  • Third-party enrichment sample for validation
  • Field usage statistics per team and region

Solution implemented

  1. 1Profiled the CRM with reproducible SQL + pandas, quantified duplicates, completeness and conformity per field, per team.
  2. 2Defined a Data Quality Index (DQI) weighted by the impact each field has on revenue workflows, not by how 'full' it looks.
  3. 3Built dedup and standardisation rules (industry, country, phone) partly automated in Salesforce, partly in a clean-up playbook for sales ops.
  4. 4Published the DQI in Power BI and made it the opening slide of the weekly ops meeting with owners per failing dimension.

Impact

Duplicate accounts
−60%
on the core book of business
Key-field completeness
+28pp
on industry, segment, country
DQI
built + adopted
reviewed weekly by the team

What I learned

The report is never the problem. It is always the field upstream and you only fix it once there is a scoreboard.