dbTalk Databases Forums  

Data Cleansing and Srcubbing

microsoft.public.sqlserver.dts microsoft.public.sqlserver.dts


Discuss Data Cleansing and Srcubbing in the microsoft.public.sqlserver.dts forum.



Reply
 
Thread Tools Display Modes
  #1  
Old   
Dominick D.
 
Posts: n/a

Default Data Cleansing and Srcubbing - 05-26-2006 , 11:39 AM






Hello, Guru's:

I am new to the world of DTS; I am learning different methods of data
import/export, using various tools. I do know and feel that data should be
cleansed and scrubbed before intiating data transfer, using the various
methods. Now, are there system stored procedures that one can use to validate
external data to be imported? Can you give me an example of an sp_ that is
used to clean/scrub external data? Thanks fellas.

Reply With Quote
  #2  
Old   
Allan Mitchell
 
Posts: n/a

Default Re: Data Cleansing and Srcubbing - 05-26-2006 , 12:30 PM






Hello Dominick D.,

The cleaning and scrubbing of data is very subjective so to that end there
is no system sp to do this for you.

I use the following methodology

Extract | Cleanse | Conform | Present


The cleansing could be address validation/getting rid of duplicates
Conforming could be making sure that NI numbers are in the correct format
for the transfer to the presentation layer or maybe I need to make sure that
every employee has a manager

I do this through stored procs, TSQL, VBScript, external exes. They are
all coded by me. sure some of these things apply to every ETL project you
will work on but there is still no inbuilt way to validate this through something
provided out of the box

In SSIS I can go further in the ETL process through things like Fuzzy Lookups
and Grouping.


HTH


Allan Mitchell
www.SQLDTS.com
www.SQLIS.com
www.Konesans.com

Quote:
Hello, Guru's:

I am new to the world of DTS; I am learning different methods of data
import/export, using various tools. I do know and feel that data
should be cleansed and scrubbed before intiating data transfer, using
the various methods. Now, are there system stored procedures that one
can use to validate external data to be imported? Can you give me an
example of an sp_ that is used to clean/scrub external data? Thanks
fellas.




Reply With Quote
Reply




Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off



Powered by vBulletin Version 3.5.3
Copyright ©2000 - 2012, Jelsoft Enterprises Ltd.