Am Thu, 18 Jan 2007 17:49:24 -0800 schrieb Gene Wirchenko:
Quote:
"Robert" <callingrw (AT) yahoo (DOT) com> wrote:
I'm looking for information. Where do people discuss ETL issues? I see
^^^
What is it? |
Extraction, Transformation, Load. In data warehouses, you collect
(*extract*) data from different sources (flat files, production
databases, spreadsheets) with independent, hence different data
representations (so the sex of a person can be m/f in one system , 0/1
in another; system X gives you litres, system X gallons; system A has
percent full, system B has percent free), which have to be
*transformed* to a consolidated form so they can be *loaded* into the
data warehouse.
Many use Perl as a language to do those transformations (as it is
particularly good at finding patterns and transforming strings); there
are also a lot of tools around that let you set the ETL rules using a
GUI.
Joachim