Who uses DataCleaner?
On this page we've collected a few cases from where DataCleaner is being used in practice.
DataCleaner & Pentaho Solutions (book)
Having used DataCleaner for data profiling on a couple of occasions during data warehouse projects, it was one of the obvious choices we made when we started laying out the concepts and content for the Pentaho Solutions book. Jos was the first to discover DataCleaner when he was researching available open source data quality tools for an article in the Dutch Database Magazine and immediately favored it over other available solutions. Since then he's been using it at client engagements as well. Roland also immediately liked the product, its ease of use and powerful regular expression validation options. We hope that using it as our data profiling tool of choice in 'Pentaho Solutions' will trigger more people to try and use DataCleaner for their profiling and data quality tasks!

DataCleaner & FAP Europe
Our industry relies on client specific data sets that present themselves in multiple formats. These formats often vary from year to year, with and without warning. DataCleaner allows us to accurately predict the impact of these changes and manage new data sets accordingly.
Without DataCleaner, our only way of determining data validity would be after a transformation script is written and data is uploaded. The data may be erroneous and therefore redundant; a costly way to validate the data.
DataCleaner allows us to analyse the data and pass a validation mark against it before embarking on an invalid transformation process.
The collaborative participation aspect of DataCleaner brings together a vast array of ideas and offers us new ways of using the software.
As part of the DataCleaner development, we can work on improvements at our own pace. We can
work on an idea and test it locally before submitting it as an improvement or fix.
DataCleaner & ITMATTER Inc.
ITMATTER Inc. uses DataCleaner as a part of our software development life cycle for data-driven enterprise applications. We rely on the data profiler and the data validator components in the Research & Development and Quality Assurance iteration phases of our agile development methodology.
Overall, DataCleaner fits into our philosophy of leveraging open source software wherever possible. The DataCleaner development team is highly motivated and responsive to their end-user and developer community taking a collaborative approach to ongoing support and new feature development.

DataCleaner & Lund&Bendsen
At Lund&Bendsen we apply DataCleaner for our customers when integrating applications with existing datastores. Often you need an in-depth understanding of the datastore to know how to integrate succesfully and we find that the DataCleaner project is a very useful project to work with in this regard - both from a user perspective and for enterprise Java development.
DataCleaner provides us with a firm basis for building data-centric applications and a framework for ensuring quality within enterprise-level databases.
DataCleaner & Ben Bor
As an Information Architect I often implement solutions that integrate data from several sources, internal and external. This could be a data warehouse, a reporting system or any other large information integration programme. Without exception, the biggest problem in each of these programmes is Data Quality. Without it, the business can not rely on the information the system is supplying. I always insist on ensuring Data Quality in any project I am involved in. Open-source data profiling is a huge help, in that I can install and run a profiler in a day (compared to the weeks it would take to evaluate, agree, purchase and install a ‘commercial’ product). This gives me the ability to profile everything before it hits the integrated data store (or data warehouse).
I have evaluated several open-source data profiling solutions and find DataCleaner to be the best.
It provides me with an easy way to run most of the profiling tasks that I need. The project team is
very responsive: they have implemented several of my suggestions in a very short time, thus making
the product even more suitable to my needs. I currently use DataCleaner for each and every project
I am involved in.
