An overview of the Data profiling process as a part of ETL and data integration, also known as data assessment, data discovery or data quality analysis

For tabular data sets of < ~1M rows, OpenRefine (ex-Google Refine) provides a powerful set of. What are the best open source data cleansing tools/software available? Which tools do you use for data cleansing, or what is the process to do.

OpenRefine : A free, open source, power tool for working with messy data. ( formerly Google Refine) is a powerful tool for working with messy data: cleaning it ;.

Data quality tools vendors that combine data quality software, data integration tools and MDM capabilities are tops in the new Gartner Magic Quadrant for Data Quality.

