![]() | |
#1
| |||
| |||
|
#2
| ||||
| ||||
|
|
Hi! Today we had a hot discussion about the performance of LOAD vs. IMPORT. I'd like to hear what you have to say: - Should LOAD be faster that IMPORT in every circumstance? - Is IMPORT more suited for small files and LOAD for large files? - Is LOAD more suited for table that have many indexes and IMPORT for tables that have few or no indexes? Best regards, * * * * Kovi |
|
- Should LOAD be faster that IMPORT in every circumstance? I tried a small table with a lot of indexes. A load operation takes |
|
- Is IMPORT more suited for small files and LOAD for large files? I think you need to consider at least the points above before taking |
|
- Is LOAD more suited for table that have many indexes and IMPORT for tables According to my little test on a small table with lots of indexes (15 |
#3
| |||
| |||
|
#4
| |||
| |||
|
|
More indexes will likely hurt import more than load, because the data isn't sorted, and the index inserts are a single row at a time. *Load will be faster with larger files regardless of the number of indexes |
#5
| |||
| |||
|
|
On May 20, 8:29 am, Serge Rielau<srie... (AT) ca (DOT) ibm.com> wrote: More indexes will likely hurt import more than load, because the data isn't sorted, and the index inserts are a single row at a time. Load will be faster with larger files regardless of the number of indexes I've found that indexes are my enemy with the load utility - even though I can load the data in parallel and only do incremental index builds, the index build phase isn't parallel. Rebuilding indexes isn't a great strategy for real-time data warehousing, and I don't yet have enough funding to go to DPF. My current server has a decent number of rather slow CPUs. So, in this scenario I can split a file into 8 individual pieces then import each simultaneously and get better overall performance. Unfortunately. Think I might be missing anything? Just a hunch. You are on DB2 9.1 or Db2 8? I.e. prior to the |
#6
| |||
| |||
|
|
Kovi, Feedback from backstage: Load is faster than import in most of cases. Better to describe cases where import can be faster: 1) Correct about small files. Load has a startup and shutdown cost. 2) Parallel import can catch up and beat load (ie, running lots of parallel import command with the data split already) specifically when XML is in the table. More indexes will likely hurt import more than load, because the data isn't sorted, and the index inserts are a single row at a time. Load will be faster with larger files regardless of the number of indexes |
|
Gregor Kovac | Gregor.Kovac (AT) mikropis (DOT) si | | http://kovica.blogspot.com | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ In A World Without Fences Who Needs Gates? | Experience Linux. | -~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~- |
#7
| |||
| |||
|
|
Hi! Thanks for the feedback. I am/was under the impression that IMPORT does a exclusive table lock. How can I do parallel import then? Best regards, * * * * Kovi Serge Rielau wrote: Kovi, Feedback from backstage: Load is faster than import in most of cases. * Better to describe cases where import can be faster: 1) Correct about *small files. *Load has a startup and shutdown cost. 2) Parallel import can catch up and beat load (ie, running lots of parallel import command with the data split already) specifically when XML is in the table. More indexes will likely hurt import more than load, because the data isn't sorted, and the index inserts are a single row at a time. *Load will be faster with larger files regardless of the number of indexes -- -~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~- | Gregor Kovac | * *Gregor.Ko... (AT) mikropis (DOT) si * * *| | * * * * * * *| * *http://kovica.blogspot.com*| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | *In A World Without Fences Who Needs Gates? * | | * * * * * * *Experience Linux. * * * * * * * *| -~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~-~- |
![]() |
| Thread Tools | |
| Display Modes | |
| |