dbTalk Databases Forums  

Data Extraction

comp.databases.olap comp.databases.olap


Discuss Data Extraction in the comp.databases.olap forum.



Reply
 
Thread Tools Display Modes
  #1  
Old   
shaab mohagir
 
Posts: n/a

Default Data Extraction - 08-18-2003 , 08:44 AM






Hello,

The sys admin of the AS/400 (our online system) suggested to prepare
file(s) that contains all the data required by the data warehouse, the
file(s) is then downloaded nightly instead of downloading from
individual tables like the customer, product tables etc..

I personally feel reluctant to go with the prepared file(s) approach,
I think it is redundant plus it adds one extra step that needs to
happen before the downloiad and also makes me so dependent on people
whom aren't in the same office. I also didn't see this approch in any
of the text books about data warehousing which makes me more reluctant
to go with it.

Your comments and suggestions on this matter are very important.

Another question:

Are there best practices for the Data Extracton process, is it simply
initial download of the required tables/fields to the staging are from
the source system, then the rest is nighltly downloads to trace
changes
and do the transformation and loading to the actual DW ?

Thanks,

Reply With Quote
  #2  
Old   
Joerg Narr
 
Posts: n/a

Default Re: Data Extraction - 08-19-2003 , 04:34 AM






"shaab mohagir" <shaab.mohagir (AT) bonbon (DOT) net> schrieb im Newsbeitrag
news:e1b3da1b.0308180544.af4ae22 (AT) posting (DOT) google.com...
Quote:
Hello,

The sys admin of the AS/400 (our online system) suggested to prepare
file(s) that contains all the data required by the data warehouse, the
file(s) is then downloaded nightly instead of downloading from
individual tables like the customer, product tables etc..

I personally feel reluctant to go with the prepared file(s) approach,
I think it is redundant plus it adds one extra step that needs to
happen before the downloiad and also makes me so dependent on people
whom aren't in the same office. I also didn't see this approch in any
of the text books about data warehousing which makes me more reluctant
to go with it.
Hello Shaab,

in practice this method is quite often used to extract data from a source
system. In fact a lot of SAP R/3-Sites use this method to extract data using
ABAP-Code. The decision whether this method is reasonable or not can be made
from the following questions:

1. How many data sources (systems and tables) have to be used?
The more systems/tables you connect the higher the indication towards an ETL
tool.

2. Which transformations need to be applied to the data?
If you have to create a lot of transformations you might be better off with
an ETL tool since these tools provide sophisticated and comfortable tools to
manage this task.

3. Are there sophisticated processes or process chains to be created to
transform and load the data?
Again: If there are a lot of data sources to integrate in a predefined
manner and you want to have full control over the process flow sophisticated
ETL tools can help you manage this.

4. Do you want to have a consistent meta data management?
This is again a question of the overall complexity of your ETL process. If
you need functionality like data lineage (data flow) or impact analysis you
can use an ETL tool. They also have standardized interfaces to data
modelling tools like ERWin or Rational and can exchange meta data.

5. Data Quality management
Some ETL tools provide functionalities to at least check address data.

5. Price
ETL tools usually have very high license cost. You need to check and
calculate for a certain time period if the efforts to maintain a data
warehouse without an ETL tool and the (maybe lower) quality reachable by
this method are higher than the license fees you have to pay.


Quote:
Your comments and suggestions on this matter are very important.

Another question:

Are there best practices for the Data Extracton process, is it simply
initial download of the required tables/fields to the staging are from
the source system, then the rest is nighltly downloads to trace
changes
and do the transformation and loading to the actual DW ?

Thanks,
This highly depends on the scenario. The ETL process for a multinational,
decentral, real-time data warehouse is different from the process for a data
warehouse of a small to medium sized data warehouse. I saw warehouses where
the employees of the company had excellent knowledge of the data structures
in the source systems and the were easily able to maintain the process
without an ETL tool.

Kind regards,

Joerg




Reply With Quote
  #3  
Old   
shaab mohagir
 
Posts: n/a

Default Re: Data Extraction - 08-19-2003 , 09:10 AM



Hello Joerg Narr,

Quote:
in practice this method is quite often used to extract data from a
source
system. In fact a lot of SAP R/3-Sites use this method to extract
data using
ABAP-Code. The decision whether this method is reasonable or not can
be made
from the following questions:
I wasn't asking about whether I want to use ETL tools or not. Why do
you think the decision can be made from these questions,

My original quesion was about prepared files on the online system, and
the prons and cons of this appoach, the system admin brought this
idea. Iinstead of me ODBC into the online system and download data
directly (using whatever tool), he wanted to prepare the data for me,
so instead of going and pulling down the shipment master table for
example, he would prepare some table for me that I can use for my DW.

Thanks


Reply With Quote
  #4  
Old   
Joerg Narr
 
Posts: n/a

Default Re: Data Extraction - 08-19-2003 , 10:48 AM



Hello Shaab,

I misunderstood. Maybe this is the reason why nobody else answered. It is
not a technical issue.

The DBA just doesn't want to give you access to the data base. He wants to
be in control. In this case a best practice is to resolve the problems since
you stated all the pros for a direct access and he stated his argument
(security issues). Of course a "direct" access is a better practice to fill
a DWH - if you don't have security issues.

Kind regards,

Joerg



Reply With Quote
Reply




Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off



Powered by vBulletin Version 3.5.3
Copyright ©2000 - 2012, Jelsoft Enterprises Ltd.