![]() | |
![]() |
| | Thread Tools | Display Modes |
#1
| |||
| |||
|
#2
| |||
| |||
|
|
Basically, unless you're a high power database person who knows the ins and outs of each vendor's products, how to optimize performance (that's $150K/year easy), and can design and administer such systems, I think you're going to have do the research *own* your own. |

#3
| |||
| |||
|
|
For example, I remember the horrors and upsets of REPAIR TABLE statements that took almost a day. That was with a MyISAM table around 40 gigs (I do not remember the size exactly). |
|
This means that I cannot use MyISAM, period, end of story, for something 100 times the size of the above. |
#4
| |||
| |||
|
|
For example, I remember the horrors and upsets of REPAIR TABLE statements that took almost a day. That was with a MyISAM table around 40 gigs (I do not remember the size exactly). Yeah, they do. I'm most of the way through repairing a 57G table which takes about a day and a half. |
|
This means that I cannot use MyISAM, period, end of story, for something 100 times the size of the above. Hmmn. This suggests that you might want to reread the manual. A few moments spent looking at the MySQL manual reveals partitioned tables and MERGE tables, both of which are reasonable ways to divide one logical table into several physical ones. If your data just grows, but the oldest stuff never changes, merged tables may be appropriate, since you can make the older tables packed and read-only. Either way, the individual tables can be much smaller and more tractable. |
|
If your data naturally falls into slices (by numeric value in 5.1, number or string in 5.5), partitioned tables may work, particularly if you can construct your queries to take advantage of partition pruning in your queries. |
#5
| |||
| |||
|
|
So... Can Postgres support a 5-10 TB database with the use pattern stated above? |
#6
| |||
| |||
|
|
Ignoramus5263 <ignoramus5263 (AT) NOSPAM (DOT) 5263.invalid> wrote: So... Can Postgres support a 5-10 TB database with the use pattern stated above? If nobody here gives you a concrete answer, I suggest asking on the pgsql-performance mailing list; it's the sort of question they're used to answering. http://archives.postgresql.org/pgsql-performance/ -M- |
#7
| |||
| |||
|
|
On 2011-09-10, John Levine <johnl (AT) iecc (DOT) com> wrote: For example, I remember the horrors and upsets of REPAIR TABLE statements that took almost a day. That was with a MyISAM table around 40 gigs (I do not remember the size exactly). Yeah, they do. I'm most of the way through repairing a 57G table which takes about a day and a half. Then you know! This means that I cannot use MyISAM, period, end of story, for something 100 times the size of the above. Hmmn. This suggests that you might want to reread the manual. A few moments spent looking at the MySQL manual reveals partitioned tables and MERGE tables, both of which are reasonable ways to divide one logical table into several physical ones. If your data just grows, but the oldest stuff never changes, merged tables may be appropriate, since you can make the older tables packed and read-only. Either way, the individual tables can be much smaller and more tractable. At first, I really liked your idea of a MERGE. It does fit my dataset very well. I had NO idea that it was possible. I thought that I can, essentially, split my data by month. Then I realized that I perform my updates for the more recent items by means of INSERT... ON DUPLICATE KEY UPDATE, and it would not work with MERGE tables. |
#8
| |||
| |||
|
|
A few moments spent looking at the MySQL manual reveals partitioned tables and MERGE tables At first, I really liked your idea of a MERGE. It does fit my dataset very well. |
#9
| |||
| |||
|
|
Ignoramus13208 <ignoramus13208 (AT) NOSPAM (DOT) 13208.invalid> wrote: A few moments spent looking at the MySQL manual reveals partitioned tables and MERGE tables At first, I really liked your idea of a MERGE. It does fit my dataset very well. I doubt that. MERGE is the much older and quite crappy sister of partitioned tables. And since you claimed partitions would not fit, MERGE won't too. Also the MERGE engine is notoriously unstable. I expect to see support for MERGE tables being dropped soon. |
#10
| |||
| |||
|
|
The largest table will consume the most space and will take, perhaps, 200,000,000 rows. |
![]() |
| Thread Tools | |
| Display Modes | |
| |