Hello Dominick D.,
The cleaning and scrubbing of data is very subjective so to that end there
is no system sp to do this for you.
I use the following methodology
Extract | Cleanse | Conform | Present
The cleansing could be address validation/getting rid of duplicates
Conforming could be making sure that NI numbers are in the correct format
for the transfer to the presentation layer or maybe I need to make sure that
every employee has a manager
I do this through stored procs, TSQL, VBScript, external exes. They are
all coded by me. sure some of these things apply to every ETL project you
will work on but there is still no inbuilt way to validate this through something
provided out of the box
In SSIS I can go further in the ETL process through things like Fuzzy Lookups
and Grouping.
HTH
Allan Mitchell
www.SQLDTS.com
www.SQLIS.com
www.Konesans.com
Quote:
Hello, Guru's:
I am new to the world of DTS; I am learning different methods of data
import/export, using various tools. I do know and feel that data
should be cleansed and scrubbed before intiating data transfer, using
the various methods. Now, are there system stored procedures that one
can use to validate external data to be imported? Can you give me an
example of an sp_ that is used to clean/scrub external data? Thanks
fellas. |