![]() | |
#1
| |||
| |||
|
#2
| |||
| |||
|
|
Is there a limit (within reason) to the maximum modulo of a file for a D3/NT system? I need to resize a file and from my calculations, it should be 300,001 frames in size at 4K per frame. I seem to remember that older Pick systems had a limit to the modulo (with a vague recollection that perhaps 300K was around that limit). No mention of this in Books Online. This file will be in the FSI. thanks |
#3
| |||
| |||
|
|
The only real size limitiation is the 2GB limit for any single item in Windows, at least up to NT. Since a D3 "file" is a single "item" as far as NT is concerned, that's the limit. 300K * 4K is only 1.2GB so you should be safe. Mark Brown "Brian Bond" <brian (AT) XYZXYZXYZXYZbonanzapress (DOT) com> wrote in message news:11crcd216o8an36 (AT) corp (DOT) supernews.com... Is there a limit (within reason) to the maximum modulo of a file for a D3/NT system? I need to resize a file and from my calculations, it should be 300,001 frames in size at 4K per frame. I seem to remember that older Pick systems had a limit to the modulo (with a vague recollection that perhaps 300K was around that limit). No mention of this in Books Online. This file will be in the FSI. thanks |
#4
| |||
| |||
|
|
Mark, are you referring to the memory limit, or maybe FAT16 and not NTFS? According to this page <http://tinyurl.com/3827d> NTFS has a file size limit in the terabyte range. Brian, disk/modulo shouldn't be an issue at all. Just make sure you have enough memory and paging space to accommodate this file in addition to all other system requirements. If all you're concerned about is the modulo, just make the frame size 8k, 16k, or larger and divide the mod by 2 or 4 as appropriate. That's a luxury of FSI not available in a VME, then again we used to have Separation too... It would be very interesting to see some benchmarks for different kinds of data in files that are more "deep" like this rather than "wide". You might have problems indexing files that are this large. Test, Test, and Test before running files like this in a production mode. BTW, - which release of D3NT are you running? - why don't you just e-mail RD Support to get the answer to this? T "Mark Brown" <mbrown (AT) drexelmgt (DOT) com> wrote: The only real size limitiation is the 2GB limit for any single item in Windows, at least up to NT. Since a D3 "file" is a single "item" as far as NT is concerned, that's the limit. 300K * 4K is only 1.2GB so you should be safe. Mark Brown "Brian Bond" <brian (AT) XYZXYZXYZXYZbonanzapress (DOT) com> wrote in message news:11crcd216o8an36 (AT) corp (DOT) supernews.com... Is there a limit (within reason) to the maximum modulo of a file for a D3/NT system? I need to resize a file and from my calculations, it should be 300,001 frames in size at 4K per frame. I seem to remember that older Pick systems had a limit to the modulo (with a vague recollection that perhaps 300K was around that limit). No mention of this in Books Online. This file will be in the FSI. thanks |
#5
| |||
| |||
|
|
All, Thanks for the information. I didn't email RD because I'd just sent them a couple of other questions and didn't feel like being a nuisance <g>. We are running 7.4.4. I am really not concerned about the modulo size per se, I just needed to ensure D3 wouldn't blow up if it was too big. Agreed that I could certainly halve it by doubling the frame size, and while I would expect a performance hit, I doubt it would be enough to matter. Basically, I guess I have several modulo/frame size combinations to work with, each with its own particular combination of speed vs. storage efficiency. And I bet all of them are well within the performance parameters I need (defined as "it works"). I was using the 4K frame simply because it is the default. The file in question has no indexes, just a lot of records that we don't want to purge. And I do not want the file into overflow, as it is now. "Frame size" seems to have replaced "separation" as a file tuning parameter, presumably providing a bit more efficiency due to the lack of links for the frames that formerly comprised a primary group. I did some benchmarks several years back on a UV system (perhaps not completely relevant here, since the internal file structures are different, but since they do both have primary and overflow sections I am guessing there is some pertinence), comparing separations and performance when the file went into overflow. If I can track them down, I'll post them, but in a nutshell, IF you end up with overflow, larger seperations yielded faster select times at the cost of more wasted space; conversely, smaller seperations yielded better space efficiency but were significantly slower for selects when the file is in overflow. There were some other interesting observations as well, but I forget what they were, so perhaps they weren't THAT interesting.... - BB "Tony Gravagno" <g6q3x9lu53001 (AT) sneakemail (DOT) com.invalid> wrote in message news:37hsc1dbcv412f6eadvut14ts8qou2ulna (AT) 4ax (DOT) com... Mark, are you referring to the memory limit, or maybe FAT16 and not NTFS? According to this page <http://tinyurl.com/3827d> NTFS has a file size limit in the terabyte range. Brian, disk/modulo shouldn't be an issue at all. Just make sure you have enough memory and paging space to accommodate this file in addition to all other system requirements. If all you're concerned about is the modulo, just make the frame size 8k, 16k, or larger and divide the mod by 2 or 4 as appropriate. That's a luxury of FSI not available in a VME, then again we used to have Separation too... It would be very interesting to see some benchmarks for different kinds of data in files that are more "deep" like this rather than "wide". You might have problems indexing files that are this large. Test, Test, and Test before running files like this in a production mode. BTW, - which release of D3NT are you running? - why don't you just e-mail RD Support to get the answer to this? T "Mark Brown" <mbrown (AT) drexelmgt (DOT) com> wrote: The only real size limitiation is the 2GB limit for any single item in Windows, at least up to NT. Since a D3 "file" is a single "item" as far as NT is concerned, that's the limit. 300K * 4K is only 1.2GB so you should be safe. Mark Brown "Brian Bond" <brian (AT) XYZXYZXYZXYZbonanzapress (DOT) com> wrote in message news:11crcd216o8an36 (AT) corp (DOT) supernews.com... Is there a limit (within reason) to the maximum modulo of a file for a D3/NT system? I need to resize a file and from my calculations, it should be 300,001 frames in size at 4K per frame. I seem to remember that older Pick systems had a limit to the modulo (with a vague recollection that perhaps 300K was around that limit). No mention of this in Books Online. This file will be in the FSI. thanks |
#6
| |||
| |||
|
|
Creating a file with separation simply acknowledges that there will be overflow, so get ready by pre-assigning the first X layers of overflow frames. When the base frame overflows, the system won't have to go to the pool to carve another frame. It's great for dynamic files (end of period work files) because it saves a lot of time fiddling with the overflow table. Even with a separation of 101, select performance shouldn't be an issue if the data is all in base frames because the search stops at the last segment mark in the group. Mark Brown |
#7
| |||
| |||
|
|
Creating a file with separation simply acknowledges that there will be overflow, so get ready by pre-assigning the first X layers of overflow frames. When the base frame overflows, the system won't have to go to the pool to carve another frame. It's great for dynamic files (end of period work files) because it saves a lot of time fiddling with the overflow table. Even with a separation of 101, select performance shouldn't be an issue if the data is all in base frames because the search stops at the last segment mark in the group. |
|
Mark Brown "Brian Bond" <brian (AT) XYZXYZXYZXYZbonanzapress (DOT) com> wrote in message news:11ctgrm5ctqii6e (AT) corp (DOT) supernews.com... All, Thanks for the information. I didn't email RD because I'd just sent them a couple of other questions and didn't feel like being a nuisance <g>. We are running 7.4.4. I am really not concerned about the modulo size per se, I just needed to ensure D3 wouldn't blow up if it was too big. Agreed that I could certainly halve it by doubling the frame size, and while I would expect a performance hit, I doubt it would be enough to matter. Basically, I guess I have several modulo/frame size combinations to work with, each with its own particular combination of speed vs. storage efficiency. And I bet all of them are well within the performance parameters I need (defined as "it works"). I was using the 4K frame simply because it is the default. The file in question has no indexes, just a lot of records that we don't want to purge. And I do not want the file into overflow, as it is now. "Frame size" seems to have replaced "separation" as a file tuning parameter, presumably providing a bit more efficiency due to the lack of links for the frames that formerly comprised a primary group. I did some benchmarks several years back on a UV system (perhaps not completely relevant here, since the internal file structures are different, but since they do both have primary and overflow sections I am guessing there is some pertinence), comparing separations and performance when the file went into overflow. If I can track them down, I'll post them, but in a nutshell, IF you end up with overflow, larger seperations yielded faster select times at the cost of more wasted space; conversely, smaller seperations yielded better space efficiency but were significantly slower for selects when the file is in overflow. There were some other interesting observations as well, but I forget what they were, so perhaps they weren't THAT interesting.... - BB "Tony Gravagno" <g6q3x9lu53001 (AT) sneakemail (DOT) com.invalid> wrote in message news:37hsc1dbcv412f6eadvut14ts8qou2ulna (AT) 4ax (DOT) com... Mark, are you referring to the memory limit, or maybe FAT16 and not NTFS? According to this page <http://tinyurl.com/3827d> NTFS has a file size limit in the terabyte range. Brian, disk/modulo shouldn't be an issue at all. Just make sure you have enough memory and paging space to accommodate this file in addition to all other system requirements. If all you're concerned about is the modulo, just make the frame size 8k, 16k, or larger and divide the mod by 2 or 4 as appropriate. That's a luxury of FSI not available in a VME, then again we used to have Separation too... It would be very interesting to see some benchmarks for different kinds of data in files that are more "deep" like this rather than "wide". You might have problems indexing files that are this large. Test, Test, and Test before running files like this in a production mode. BTW, - which release of D3NT are you running? - why don't you just e-mail RD Support to get the answer to this? T "Mark Brown" <mbrown (AT) drexelmgt (DOT) com> wrote: The only real size limitiation is the 2GB limit for any single item in Windows, at least up to NT. Since a D3 "file" is a single "item" as far as NT is concerned, that's the limit. 300K * 4K is only 1.2GB so you should be safe. Mark Brown "Brian Bond" <brian (AT) XYZXYZXYZXYZbonanzapress (DOT) com> wrote in message news:11crcd216o8an36 (AT) corp (DOT) supernews.com... Is there a limit (within reason) to the maximum modulo of a file for a D3/NT system? I need to resize a file and from my calculations, it should be 300,001 frames in size at 4K per frame. I seem to remember that older Pick systems had a limit to the modulo (with a vague recollection that perhaps 300K was around that limit). No mention of this in Books Online. This file will be in the FSI. thanks |
#8
| |||
| |||
|
|
That's a luxury of FSI not available in a VME, then again we used to have Separation too... It would be very interesting to see some benchmarks for different kinds of data in files that are more "deep" like this rather than "wide". |
#9
| |||
| |||
|
#10
| |||
| |||
|
|
I found the results of a performance test I conducted serveral years ago. The tests were performed on a UV/NT system, the details of what kind of processor, UV version, etc are lost to time. But I believe that these numbers are still good for showing relative performance and the adverse effects of overflow. I would also expect that they are reasonably relevent for non-UV systems as well. Records used an incrementing "zero-filled" sequential record key (this provides the most even hashing). Records were equally sized; the intent was to squeeze as many records as possible into a group without creating overflow (unless indicated). I do not recall if the records were loading sequentially (which would have created the most efficient physical layout of overflow groups), or if I had scrambled the loading sequence (which would have caused much less physical efficiency in the overflow). But in all liklihood, records were created sequentially, so overflow effects on a production system would be even worse that shown below. "Load time" is the number of seconds it took to create all the records. The worst aspect of this data is that I am not sure if the benchmark times are HH:MM or MM:SS! So make of it what you will. Regardless, the numbers do show just how bad or a preformance hit overflow causes, albeit less on adding records than with selects. I think the select and sselect tests were based on the key, not any attributes, but I could be wrong about this. One test I didn't run and should have, is the performance of partially primary groups. Anyway, I hope this data is of some use. Tests A, B, and C tested various separations of a file without going into overflow. Tests D and E tested overflow of one frame and two frames respectively, and would best be compared to test A. (note: set to fixed width font for best display) test modulo sep record physical data load records sselect select count size size time per group time time A: 20011 4 1700935 40984576 40982528 1345 85 2:35 0:24 B: 40499 2 1700935 41472000 40822440 650 41 2:37 0:23 C: 80996 1 1700935 41480704 41470408 393 21 2:30 N/A D: 10005 4 1700935 41156608 40982520 1162 170 4:25 2:11 E: 4988 8 1700935 61296640 40822440 2320 341 5:36 2:51 N/A = not available (didn't run) The system was rebooted for each test, after the records were loaded into the file and before benchmarking. |
![]() |
| Thread Tools | |
| Display Modes | |
| |