dbTalk Databases Forums  

Datawarehouse->(ETL)->Datamart; Where should ETL be physicallylocated?

comp.databases.oracle.misc comp.databases.oracle.misc


Discuss Datawarehouse->(ETL)->Datamart; Where should ETL be physicallylocated? in the comp.databases.oracle.misc forum.



Reply
 
Thread Tools Display Modes
  #1  
Old   
guser78
 
Posts: n/a

Default Datawarehouse->(ETL)->Datamart; Where should ETL be physicallylocated? - 03-18-2010 , 04:29 AM






In the 'Dependent Data mart' case, there is an ETL layer to extract
the data from the central Data Warehouse to adapt and Load onto the
'Data mart'.

Assume that the 'Data warehouse' and 'Data mart' run on separate
physical servers. In such a case, where does the ETL layer (which is
used to load onto Datamart from Datawarehouse) fit better? Should it
be part of 'Data warehouse' box, or the 'Datamart' box? Why?

Reply With Quote
  #2  
Old   
Carlos
 
Posts: n/a

Default Re: Datawarehouse->(ETL)->Datamart; Where should ETL be physicallylocated? - 03-18-2010 , 09:03 AM






On Mar 18, 11:29*am, guser78 <qazmlp1... (AT) rediffmail (DOT) com> wrote:
Quote:
In the 'Dependent Data mart' case, there is an ETL layer to extract
the data from the central Data Warehouse to adapt and Load onto the
'Data mart'.

Assume that the 'Data warehouse' and 'Data mart' run on separate
physical servers. In such a case, where does the ETL layer (which is
used to load onto Datamart from Datawarehouse) fit better? Should it
be part of 'Data warehouse' box, or the 'Datamart' box? Why?
"Should it be part of 'Data warehouse' box, or the 'Datamart' box?"

Neither of them. It should be on a separate ETL server with the ETL
tools and processes installed on it (and a lot of free HD space too).

HTH.

Cheers.

Carlos.

Reply With Quote
  #3  
Old   
Nithyanandham M
 
Posts: n/a

Default Re: Datawarehouse->(ETL)->Datamart; Where should ETL be physicallylocated? - 03-19-2010 , 03:27 AM



On Mar 18, 8:03*pm, Carlos <miotromailcar... (AT) netscape (DOT) net> wrote:
Quote:
On Mar 18, 11:29*am, guser78 <qazmlp1... (AT) rediffmail (DOT) com> wrote:

In the 'Dependent Data mart' case, there is an ETL layer to extract
the data from the central Data Warehouse to adapt and Load onto the
'Data mart'.

Assume that the 'Data warehouse' and 'Data mart' run on separate
physical servers. In such a case, where does the ETL layer (which is
used to load onto Datamart from Datawarehouse) fit better? Should it
be part of 'Data warehouse' box, or the 'Datamart' box? Why?

"Should it be part of 'Data warehouse' box, or the 'Datamart' box?"

Neither of them. It should be on a separate ETL server with the ETL
tools and processes installed on it (and a lot of free HD space too).

I agree with that. But, what do you choose when you have the option of
having only 2 servers (1->DWH, 1->Datamart, ETL->where?)?

Reply With Quote
  #4  
Old   
guser78
 
Posts: n/a

Default Re: Datawarehouse->(ETL)->Datamart; Where should ETL be physicallylocated? - 03-19-2010 , 03:32 AM



On Mar 18, 8:03*pm, Carlos <miotromailcar... (AT) netscape (DOT) net> wrote:
Quote:
On Mar 18, 11:29*am, guser78 <qazmlp1... (AT) rediffmail (DOT) com> wrote:

In the 'Dependent Data mart' case, there is an ETL layer to extract
the data from the central Data Warehouse to adapt and Load onto the
'Data mart'.

Assume that the 'Data warehouse' and 'Data mart' run on separate
physical servers. In such a case, where does the ETL layer (which is
used to load onto Datamart from Datawarehouse) fit better? Should it
be part of 'Data warehouse' box, or the 'Datamart' box? Why?

"Should it be part of 'Data warehouse' box, or the 'Datamart' box?"

Neither of them. It should be on a separate ETL server with the ETL
tools and processes installed on it (and a lot of free HD space too).
I agree with that. But, what do you do when you have the option of
having only 2 physical servers (1->DWH, 1->Datamart, ETL:where?)?

Reply With Quote
  #5  
Old   
Carlos
 
Posts: n/a

Default Re: Datawarehouse->(ETL)->Datamart; Where should ETL be physicallylocated? - 03-19-2010 , 11:51 AM



On 19 mar, 10:32, guser78 <qazmlp1... (AT) rediffmail (DOT) com> wrote:
Quote:
On Mar 18, 8:03*pm, Carlos <miotromailcar... (AT) netscape (DOT) net> wrote:

On Mar 18, 11:29*am, guser78 <qazmlp1... (AT) rediffmail (DOT) com> wrote:

In the 'Dependent Data mart' case, there is an ETL layer to extract
the data from the central Data Warehouse to adapt and Load onto the
'Data mart'.

Assume that the 'Data warehouse' and 'Data mart' run on separate
physical servers. In such a case, where does the ETL layer (which is
used to load onto Datamart from Datawarehouse) fit better? Should it
be part of 'Data warehouse' box, or the 'Datamart' box? Why?

"Should it be part of 'Data warehouse' box, or the 'Datamart' box?"

Neither of them. It should be on a separate ETL server with the ETL
tools and processes installed on it (and a lot of free HD space too).

I agree with that. But, what do you do when you have the option of
having only 2 physical servers (1->DWH, 1->Datamart, ETL:where?)?
"I agree with that. But, what do you do when you have the option of"
having only 2 physical servers (1->DWH, 1->Datamart, ETL:where?)?"

At a first glance, on the DataMart machine, but it depends. I would
resume it as "...where the harm is smaller" If you put the ETL on the
Datamart machine and the performance decreases dramatically, and you
put the ETL on the DW machine and it can cope with the overhead (DW
performance, general performance, filesystems, etc) then put it there.

But I would avoid to put it on the DW machine (been there, done
that...)

HTH.

Cheers.

Carlos.

Reply With Quote
  #6  
Old   
Geoff Muldoon
 
Posts: n/a

Default Re: Datawarehouse->(ETL)->Datamart; Where should ETL be physically located? - 03-21-2010 , 04:20 PM



@e7g2000yqf.googlegroups.com>, guser78 says...

Quote:
Assume that the 'Data warehouse' and 'Data mart' run on separate
physical servers. In such a case, where does the ETL layer (which is
used to load onto Datamart from Datawarehouse) fit better? Should it
be part of 'Data warehouse' box, or the 'Datamart' box? Why?

"Should it be part of 'Data warehouse' box, or the 'Datamart' box?"

Neither of them. It should be on a separate ETL server with the ETL
tools and processes installed on it (and a lot of free HD space too).

I agree with that. But, what do you do when you have the option of
having only 2 physical servers (1->DWH, 1->Datamart, ETL:where?)?
Given that restriction, I'd usually go "on the target rather than the
source", because some of my ETL processes are designed so that the
transform and load components should only occur if a full extract is
successfully completed, and it's easier to deal with network outages in
these instances if ETL is on the target.

GM

Reply With Quote
Reply




Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off



Powered by vBulletin Version 3.5.3
Copyright ©2000 - 2012, Jelsoft Enterprises Ltd.