![]() | |
![]() |
| | Thread Tools | Search this Thread | Display Modes |
#1
| |||
| |||
|
|
-----Original Message----- From: owner-informix-list (AT) iiug (DOT) org [mailto wner-informix-list (AT) iiug (DOT) org] On Behalf Of TBPSent: 19 May 2004 10:42 To: informix-list (AT) iiug (DOT) org Subject: Re: Extremly long checkpoints: How to find the reason and solve the Francisco Roldan wrote: Are you Replicating to other server ? Some time ago I got very long checkpoints in an Informix Server with High Availability Data Replication (HDR) on the primary server. I found out that the reason was an extremely complex query executed in the secondary server (Stand By Server) for generating a report (DSS Reports in OLTP System , not a good idea !! ). Chekpoints for HDR Systems are always Synchronous . It doesn't matter if you configure the system to be Asynchronous (I don't remember the name of the parameter in the Onconfig File), the only thing that really gets Asynchronous are the transactions (No 2-Fase Commit Protocol), the onconfig's parameter should be named TwoFaseCommit instead of the name that I don't remember. Primary Server Always wait an acknowledge message of the other servers for finishing its own checkpoint. If you are not replicating ignore this message, I just wanted to express my frustrating experience with HDR. Enterprise Replication (ER) would solve the problem. Regards snip ... DRINTERVAL -1 (Synchronous) or DRINTERVAL > 0 Asynchronous There appears to be some activity which the checkpoint is dependent on (not the checkpoint itself) which is synchronous; you can see this when the checkpoint completes on the primary but hasn't started / completed on the secondary when DRINTERVAL > 0. Something to do with flushing the physical log buffer on the secondary, and threads in critical section. What was the "extremely complex query executed in the secondary server (Stand By Server) for generating a report (DSS Reports in OLTP System , not a good idea !! )." (I thought that was a nice way to split out DSS from the OLTP primary by putting DSS on the secondary). Did you log a Tech Support case?? Were there a lot of writes involved on the secondary to temp tables? |
#2
| |||
| |||
|
|
Francisco Roldan wrote: Are you Replicating to other server ? Some time ago I got very long checkpoints in an Informix Server with High Availability Data Replication (HDR) on the primary server. I found out that the reason was an extremely complex query executed in the secondary server (Stand By Server) for generating a report (DSS Reports in OLTP System , not a good idea !! ). Chekpoints for HDR Systems are always Synchronous . It doesn't matter if you configure the system to be Asynchronous (I don't remember the name of the parameter in the Onconfig File), the only thing that really gets Asynchronous are the transactions (No 2-Fase Commit Protocol), the onconfig's parameter should be named TwoFaseCommit instead of the name that I don't remember. Primary Server Always wait an acknowledge message of the other servers for finishing its own checkpoint. If you are not replicating ignore this message, I just wanted to express my frustrating experience with HDR. Enterprise Replication (ER) would solve the problem. Regards snip ... DRINTERVAL -1 (Synchronous) or DRINTERVAL > 0 Asynchronous There appears to be some activity which the checkpoint is dependent on (not the checkpoint itself) which is synchronous; you can see this when the checkpoint completes on the primary but hasn't started / completed on the secondary when DRINTERVAL > 0. Something to do with flushing the physical log buffer on the secondary, and threads in critical section. What was the "extremely complex query executed in the secondary server (Stand By Server) for generating a report (DSS Reports in OLTP System , not a good idea !! )." (I thought that was a nice way to split out DSS from the OLTP primary by putting DSS on the secondary). Did you log a Tech Support case?? Were there a lot of writes involved on the secondary to temp tables? |
#3
| |||
| |||
|
|
HDR Systems are not optimized for having the secondary server for DSS . Sounds logic because HDR is oriented for High Availability Systems, => OLTP Systems. In that time the company wanted the primary server for the Production System, and the Secondary Server for up to date Reports . I learned this by the hard way .... I hope this message helps people who are evaluating HDR . |

#4
| |||
| |||
|
|
Francisco Roldan wrote: HDR Systems are not optimized for having the secondary server for DSS . Sounds logic because HDR is oriented for High Availability Systems, => OLTP Systems. In that time the company wanted the primary server for the Production System, and the Secondary Server for up to date Reports . I learned this by the hard way .... I hope this message helps people who are evaluating HDR . I've been working in an environment where customer uses secondary mainly for data extraction (DSS queries and jobs and extraction for DW). There were and still are problems regarding HDR which can cause the situation you mentioned. Some are related to heavy load on the secondary and one is (I think) caused by a (as of recently) known bug, related to critical sections and checkpoint holding on secondary. This situations MUST be considered as BUGS and not as "design orientation". We currently have an open case, under investigation by IBM technical support. I'd also like to point one fact: Around 9.30.UC1 there was a version which forced the same ONCONFIG parameters on primary and secondary for things like BUFFERS, CPUVPs etc. This would inhibit a configuration where the resources were different between primary and secondary. This was eventually considered a bug and corrected. This clearly states that you can use the secondary for different purposes then the primary. The normal and desirable behaviour, in case secondary as too much work is to stop replication and eventually recover. DRPINGTIMEOUT has also this objective. Of course that, if you intend your HDR environment to be mainly a stand-by database, you won't want this to happen, but in that case you shouldn't load the secondary too much... ![]() Regards, Fernando Nunes |
#5
| |||
| |||
|
|
I completely agree with you, this has to be considered as a bug. Checkpoints in a HDR system must be asynchronous/independents between servers. |
![]() |
| Thread Tools | Search this Thread |
| Display Modes | |
| |