Skip to content

Data Cleaning

Data cleaning is the process of transforming data into regular, structured, usable, trustable formats.


Setting expectations

Data is never clean...even after you clean it! Do not expect perfection. Instead, we manage data cleanliness. 99% perfect is close enough. Getting another 99% of that last 1% is amazing. If you wanted to be exactly perfect, that would be your full time job.


Target a table

bg w:800px schema


Make a query

w:1000px query


Download a CSV

w:1000px csv


Inspect and filter your data

w:1000px filter


Edit and Inspect

w:1000px edit


Create a sample

w:1000px sample


Import data to Workbench

w:1000px sample


Map edits and save upload plan

w:1000px map


Validate and upload

w:1000px validate


Verify through original query

w:1000px verify


Rollback (if you want) and repeat for full set

w:1000px rollback