Topic: Output formats from command line DataCleaner?
Output formats from command line DataCleaner?
I was just browsing the roadmap and realized that the next version of DataCleaner will include a command-line interface. I think this is great news, but I'm curious - what kind of output will this be? Will it resemble the output in the GUI, just in "raw data" or will it be a formatted report or?
Personally, I would prefer the raw data so that I could make my own reporting system on top of that. Will this be possible? And what will the data look like (schema/table layout)?
Personally, I would prefer the raw data so that I could make my own reporting system on top of that. Will this be possible? And what will the data look like (schema/table layout)?
Hi beno,
We will make an option to provide for different output formats. To begin with we are just going to make an XML output with some fitting format. We haven't implemented this yet though, so if you (or anybody else) has anything to say on this topic, please speak your mind.
Off the top of my head, I imagine an XML format similar to this (example):
We will make an option to provide for different output formats. To begin with we are just going to make an XML output with some fitting format. We haven't implemented this yet though, so if you (or anybody else) has anything to say on this topic, please speak your mind.
Off the top of my head, I imagine an XML format similar to this (example):
<profilingResult profile="Standard Measures">
<measure value="0" column="MY_COLUMN" row="min value" />
...
</profileResult>
I think maybe we should go for one of the OASIS standards for table models:
?
?
I think the easiest way to output data is just to output to a flat file with raw data. I know we will lose type safety if we use a csv but on the other hand the profiling operate on known data and thus known the definition.
So I think xml option would be a nice feature, but I predict that I would more often use a simple csv output
So I think xml option would be a nice feature, but I predict that I would more often use a simple csv output
With the new release candidate 2 of DC 1.5, you can check it out for your selves.
Log in by clicking the login link at the top of the screen
Go back to forum.


