![]() | |
![]() |
| | Thread Tools | Display Modes |
#1
| |||
| |||
|
#2
| |||
| |||
|
#3
| |||
| |||
|
|
Hi Mike, I got into some similar things, on AIX though. Here is what it would be worthwhile trying. In ckptmpl.def, I changed this way BSTD: /bin/echo beginning checkpoint to tape %C of %N locations BSDD: /bin/echo beginning checkpoint to disk %C of %N locations ESTD: /bin/echo ending checkpoint to tape %C of %N locations ESDD: /bin/echo ending checkpoint to disk %C of %N locations PSTD: /bin/echo mount tape %M and press return; read foo; WSTD: cd %D; iitar cbf 20 %C . PSDD: /bin/echo executing checkpoint to disk WSDD: cd %D; iitar cf %A . And in $II_SYSTEM/ingres/utility it is iitar which overcomes a tar stupid message that says file has changed size. #!/usr/bin/ksh # # Script to set the exit code on tar # when a file size change event occurs # # Set locale to C so error will be found # LC_ALL=C LANG=C # # Set source directories for commands # TARSOURCE=/usr/bin/tar GREPSOURCE=/usr/bin/grep # # Create file for errors to be written too. # touch /tmp/errorlog # # Start tar command with errors redirected to log file # $TARSOURCE "$@" 2> /tmp/errorlog if [ -f /tmp/errorlog ] ;then $GREPSOURCE "file changed size" /tmp/errorlog echo $? > /tmp/status if [ -f /tmp/status -a /tmp/status = 0 ] ;then exit 0 fi fi exit Maybe it works for you as well. Cheers, Armand -----Original Message----- From: info-ingres-admin (AT) cariboulake (DOT) com [mailto:info-ingres-admin (AT) cariboulake (DOT) com] On Behalf Of Michael Leo Sent: Tuesday, March 15, 2005 5:40 PM To: info-ingres (AT) cariboulake (DOT) com Subject: [Info-ingres] Problem with online checkpoints for r3 and Linux Gang, Working with one of our clients, we have discovered that Ingres r3 online checkpoints on Linux (RH Enterprise 3.0, supported) can occasionally fail if a table's file is extended while it is being "tarred" up. We are using the default checkpoint template, so we aren't doing anything "funny" in cktmpl.def. Good news here is that you really have to work to make this happen. You have to insert a ton of records into a fairly large table while the online checkpoint is backing up that table. Bad news is the checkpoint doesn't appear to be recorded in the config file and doesn't show up in the "infodb" output. The failed "dump" attempt seems to be logged there. Very odd. Here are the scripts we used to do this: ftp://ftp.cariboulake.com/test_olcp/test_olcp.zip You may need to run the script several times to get the error to occur, but thre We used a pretty small system and pretty big system to test this: Small AMD Athon 1GHz single processor, 1.5GB RAM, 70GB IDE disk Small Pentium Xeon 2.3Ghz dual HT processor, 4GB RAM, 160GB USCSI disk Last year we saw this error in Solaris with Ingres II 2.6, which can be mitigated by using the 'correct' version of tar. This was worked through Ingres technical support under CA Startrak Issue 12399005. Here is the output of the checkpoint attempt: Tue Mar 15 18:17:07 2005 CPP: Preparing to checkpoint database: test_olcp Tue Mar 15 18:17:07 2005 CPP: Preparing stall of database, active xact cnt: 0 Tue Mar 15 18:17:07 2005 CPP: Finished stall of database beginning checkpoint to disk /opt/CA/IngresII/ingres/ckp/default/test_olcp of 1 locations Tue Mar 15 18:17:08 2005 CPP: Start checkpoint of location: ii_database to disk: path = '/opt/CA/IngresII/ingres/ckp/default/test_olcp' file = 'c0002001.ckp' executing checkpoint to disk /bin/tar: aaaaaaop.t00: file changed as we read it /bin/tar: Error exit delayed from previous errors Tue Mar 15 18:17:09 2005 E_DM1101_CPP_WRITE_ERROR Error writing checkpoint. Tue Mar 15 18:17:09 2005 E_DM110B_CPP_FAILED Error occurred checkpointing the database. Here is what infodb says: ==================Tue Mar 15 18:19:54 2005 Database Information================= Database : (test_olcp,mal) ID : 0x42376CB8 Collation : default Extents : 5 Last Table Id : 239 Config File Version Id : 0x00060001 Database Version Id : 7 Mode : DDL ALLOWED, ONLINE CHECKPOINT ENABLED Status : VALID,JOURNAL,DUMP,CFG_BACKUP The Database has been Checkpointed. The Database is Journaled. Journals are valid from checkpoint sequence : 1 ----Journal information--------------------------------------------------------- Checkpoint sequence : 2 Journal sequence : 1 Current journal block : 161 Journal block size : 16384 Initial journal size : 4 Target journal size : 512 Last Log Address Journaled : <1110927916:68089:2560 ----Dump information------------------------------------------------------------ Checkpoint sequence : 2 Dump sequence : 1 Current dump block : 0 Dump block size : 16384 Initial dump size : 4 Target dump size : 512 Last Log Address Dumped : <0:0:0 ----Checkpoint History for Journal---------------------------------------------- Date Ckp_sequence First_jnl Last_jnl valid mode ------------------------------------------------------------------------ ---- Tue Mar 15 18:17:06 2005 1 1 1 1 OFFLINE ----Checkpoint History for Dump------------------------------------------------- Date Ckp_sequence First_dmp Last_dmp valid mode ------------------------------------------------------------------------ ---- Tue Mar 15 18:17:07 2005 2 0 0 0 ONLINE ----Cluster Journal History----------------------------------------------------- Node ID Current Journal Current Block Last Log Address ------------------------------------------------------------ None. ----Extent directory------------------------------------------------------------ Location Flags Physical_path ------------------------------------------------------------------ ii_database ROOT,DATA /opt/CA/IngresII/ingres/data/default/test_olcp ii_journal JOURNAL /opt/CA/IngresII/ingres/jnl/default/test_olcp ii_checkpoint CHECKPOINT /opt/CA/IngresII/ingres/ckp/default/test_olcp ii_dump DUMP /opt/CA/IngresII/ingres/dmp/default/test_olcp ii_work WORK /opt/CA/IngresII/ingres/work/default/test_olcp ================================================== ====================== ======== You can see that it doesn't record the status of checkpoint #2, but it does record the dump status of checkpoint #2. Anyone else seen this. Got a better suggestion than using tar for backup on RH 3.0? Cheers, -- Michael Leo Java, J2EE, BEA WebLogic, Caribou Lake LLC Oracle, Open Source, Ingres, mleo (AT) cariboulake (DOT) com Real Enterprise Applications Climate is what we expect. Weather is what we get. - Mark Twain _______________________________________________ Info-ingres mailing list Info-ingres (AT) cariboulake (DOT) com http://mailman.cariboulake.com/mailm...py/info-ingres _______________________________________________ Info-ingres mailing list Info-ingres (AT) cariboulake (DOT) com http://mailman.cariboulake.com/mailm...py/info-ingres |
#4
| |||
| |||
|
|
Hi Mike, I got into some similar things, on AIX though. Here is what it would be worthwhile trying. In ckptmpl.def, I changed this way BSTD: /bin/echo beginning checkpoint to tape %C of %N locations BSDD: /bin/echo beginning checkpoint to disk %C of %N locations ESTD: /bin/echo ending checkpoint to tape %C of %N locations ESDD: /bin/echo ending checkpoint to disk %C of %N locations PSTD: /bin/echo mount tape %M and press return; read foo; WSTD: cd %D; iitar cbf 20 %C . PSDD: /bin/echo executing checkpoint to disk WSDD: cd %D; iitar cf %A . And in $II_SYSTEM/ingres/utility it is iitar which overcomes a tar stupid message that says file has changed size. #!/usr/bin/ksh # # Script to set the exit code on tar # when a file size change event occurs # # Set locale to C so error will be found # LC_ALL=C LANG=C # # Set source directories for commands # TARSOURCE=/usr/bin/tar GREPSOURCE=/usr/bin/grep # # Create file for errors to be written too. # touch /tmp/errorlog # # Start tar command with errors redirected to log file # $TARSOURCE "$@" 2> /tmp/errorlog if [ -f /tmp/errorlog ] ;then $GREPSOURCE "file changed size" /tmp/errorlog echo $? > /tmp/status if [ -f /tmp/status -a /tmp/status = 0 ] ;then exit 0 fi fi exit Maybe it works for you as well. Cheers, Armand -----Original Message----- From: info-ingres-admin (AT) cariboulake (DOT) com [mailto:info-ingres-admin (AT) cariboulake (DOT) com] On Behalf Of Michael Leo Sent: Tuesday, March 15, 2005 5:40 PM To: info-ingres (AT) cariboulake (DOT) com Subject: [Info-ingres] Problem with online checkpoints for r3 and Linux Gang, Working with one of our clients, we have discovered that Ingres r3 online checkpoints on Linux (RH Enterprise 3.0, supported) can occasionally fail if a table's file is extended while it is being "tarred" up. We are using the default checkpoint template, so we aren't doing anything "funny" in cktmpl.def. Good news here is that you really have to work to make this happen. You have to insert a ton of records into a fairly large table while the online checkpoint is backing up that table. Bad news is the checkpoint doesn't appear to be recorded in the config file and doesn't show up in the "infodb" output. The failed "dump" attempt seems to be logged there. Very odd. Here are the scripts we used to do this: ftp://ftp.cariboulake.com/test_olcp/test_olcp.zip You may need to run the script several times to get the error to occur, but thre We used a pretty small system and pretty big system to test this: Small AMD Athon 1GHz single processor, 1.5GB RAM, 70GB IDE disk Small Pentium Xeon 2.3Ghz dual HT processor, 4GB RAM, 160GB USCSI disk Last year we saw this error in Solaris with Ingres II 2.6, which can be mitigated by using the 'correct' version of tar. This was worked through Ingres technical support under CA Startrak Issue 12399005. Here is the output of the checkpoint attempt: Tue Mar 15 18:17:07 2005 CPP: Preparing to checkpoint database: test_olcp Tue Mar 15 18:17:07 2005 CPP: Preparing stall of database, active xact cnt: 0 Tue Mar 15 18:17:07 2005 CPP: Finished stall of database beginning checkpoint to disk /opt/CA/IngresII/ingres/ckp/default/test_olcp of 1 locations Tue Mar 15 18:17:08 2005 CPP: Start checkpoint of location: ii_database to disk: path = '/opt/CA/IngresII/ingres/ckp/default/test_olcp' file = 'c0002001.ckp' executing checkpoint to disk /bin/tar: aaaaaaop.t00: file changed as we read it /bin/tar: Error exit delayed from previous errors Tue Mar 15 18:17:09 2005 E_DM1101_CPP_WRITE_ERROR Error writing checkpoint. Tue Mar 15 18:17:09 2005 E_DM110B_CPP_FAILED Error occurred checkpointing the database. Here is what infodb says: ==================Tue Mar 15 18:19:54 2005 Database Information================= Database : (test_olcp,mal) ID : 0x42376CB8 Collation : default Extents : 5 Last Table Id : 239 Config File Version Id : 0x00060001 Database Version Id : 7 Mode : DDL ALLOWED, ONLINE CHECKPOINT ENABLED Status : VALID,JOURNAL,DUMP,CFG_BACKUP The Database has been Checkpointed. The Database is Journaled. Journals are valid from checkpoint sequence : 1 ----Journal information--------------------------------------------------------- Checkpoint sequence : 2 Journal sequence : 1 Current journal block : 161 Journal block size : 16384 Initial journal size : 4 Target journal size : 512 Last Log Address Journaled : <1110927916:68089:2560 ----Dump information------------------------------------------------------------ Checkpoint sequence : 2 Dump sequence : 1 Current dump block : 0 Dump block size : 16384 Initial dump size : 4 Target dump size : 512 Last Log Address Dumped : <0:0:0 ----Checkpoint History for Journal---------------------------------------------- Date Ckp_sequence First_jnl Last_jnl valid mode ----------------------------------------------------------------------- - ---- Tue Mar 15 18:17:06 2005 1 1 1 1 OFFLINE ----Checkpoint History for Dump------------------------------------------------- Date Ckp_sequence First_dmp Last_dmp valid mode ----------------------------------------------------------------------- - ---- Tue Mar 15 18:17:07 2005 2 0 0 0 ONLINE ----Cluster Journal History----------------------------------------------------- Node ID Current Journal Current Block Last Log Address ------------------------------------------------------------ None. ----Extent directory------------------------------------------------------------ Location Flags Physical_path ------------------------------------------------------------------ ii_database ROOT,DATA /opt/CA/IngresII/ingres/data/default/test_olcp ii_journal JOURNAL /opt/CA/IngresII/ingres/jnl/default/test_olcp ii_checkpoint CHECKPOINT /opt/CA/IngresII/ingres/ckp/default/test_olcp ii_dump DUMP /opt/CA/IngresII/ingres/dmp/default/test_olcp ii_work WORK /opt/CA/IngresII/ingres/work/default/test_olcp ================================================== ===================== = ======== You can see that it doesn't record the status of checkpoint #2, but it does record the dump status of checkpoint #2. Anyone else seen this. Got a better suggestion than using tar for backup on RH 3.0? Cheers, -- Michael Leo Java, J2EE, BEA WebLogic, Caribou Lake LLC Oracle, Open Source, Ingres, mleo (AT) cariboulake (DOT) com Real Enterprise Applications Climate is what we expect. Weather is what we get. - Mark Twain _______________________________________________ Info-ingres mailing list Info-ingres (AT) cariboulake (DOT) com http://mailman.cariboulake.com/mailm...py/info-ingres _______________________________________________ Info-ingres mailing list Info-ingres (AT) cariboulake (DOT) com http://mailman.cariboulake.com/mailm...py/info-ingres |
#5
| |||
| |||
|
|
Hi Mike, I got into some similar things, on AIX though. Here is what it would be worthwhile trying. In ckptmpl.def, I changed this way BSTD: /bin/echo beginning checkpoint to tape %C of %N locations BSDD: /bin/echo beginning checkpoint to disk %C of %N locations ESTD: /bin/echo ending checkpoint to tape %C of %N locations ESDD: /bin/echo ending checkpoint to disk %C of %N locations PSTD: /bin/echo mount tape %M and press return; read foo; WSTD: cd %D; iitar cbf 20 %C . PSDD: /bin/echo executing checkpoint to disk WSDD: cd %D; iitar cf %A . And in $II_SYSTEM/ingres/utility it is iitar which overcomes a tar stupid message that says file has changed size. #!/usr/bin/ksh # # Script to set the exit code on tar # when a file size change event occurs # # Set locale to C so error will be found # LC_ALL=C LANG=C # # Set source directories for commands # TARSOURCE=/usr/bin/tar GREPSOURCE=/usr/bin/grep # # Create file for errors to be written too. # touch /tmp/errorlog # # Start tar command with errors redirected to log file # $TARSOURCE "$@" 2> /tmp/errorlog if [ -f /tmp/errorlog ] ;then $GREPSOURCE "file changed size" /tmp/errorlog echo $? > /tmp/status if [ -f /tmp/status -a /tmp/status = 0 ] ;then exit 0 fi fi exit Maybe it works for you as well. Cheers, Armand -----Original Message----- From: info-ingres-admin (AT) cariboulake (DOT) com [mailto:info-ingres-admin (AT) cariboulake (DOT) com] On Behalf Of Michael Leo Sent: Tuesday, March 15, 2005 5:40 PM To: info-ingres (AT) cariboulake (DOT) com Subject: [Info-ingres] Problem with online checkpoints for r3 and Linux Gang, Working with one of our clients, we have discovered that Ingres r3 online checkpoints on Linux (RH Enterprise 3.0, supported) can occasionally fail if a table's file is extended while it is being "tarred" up. We are using the default checkpoint template, so we aren't doing anything "funny" in cktmpl.def. Good news here is that you really have to work to make this happen. You have to insert a ton of records into a fairly large table while the online checkpoint is backing up that table. Bad news is the checkpoint doesn't appear to be recorded in the config file and doesn't show up in the "infodb" output. The failed "dump" attempt seems to be logged there. Very odd. Here are the scripts we used to do this: ftp://ftp.cariboulake.com/test_olcp/test_olcp.zip You may need to run the script several times to get the error to occur, but thre We used a pretty small system and pretty big system to test this: Small AMD Athon 1GHz single processor, 1.5GB RAM, 70GB IDE disk Small Pentium Xeon 2.3Ghz dual HT processor, 4GB RAM, 160GB USCSI disk Last year we saw this error in Solaris with Ingres II 2.6, which can be mitigated by using the 'correct' version of tar. This was worked through Ingres technical support under CA Startrak Issue 12399005. Here is the output of the checkpoint attempt: Tue Mar 15 18:17:07 2005 CPP: Preparing to checkpoint database: test_olcp Tue Mar 15 18:17:07 2005 CPP: Preparing stall of database, active xact cnt: 0 Tue Mar 15 18:17:07 2005 CPP: Finished stall of database beginning checkpoint to disk /opt/CA/IngresII/ingres/ckp/default/test_olcp of 1 locations Tue Mar 15 18:17:08 2005 CPP: Start checkpoint of location: ii_database to disk: path = '/opt/CA/IngresII/ingres/ckp/default/test_olcp' file = 'c0002001.ckp' executing checkpoint to disk /bin/tar: aaaaaaop.t00: file changed as we read it /bin/tar: Error exit delayed from previous errors Tue Mar 15 18:17:09 2005 E_DM1101_CPP_WRITE_ERROR Error writing checkpoint. Tue Mar 15 18:17:09 2005 E_DM110B_CPP_FAILED Error occurred checkpointing the database. Here is what infodb says: ==================Tue Mar 15 18:19:54 2005 Database Information================= Database : (test_olcp,mal) ID : 0x42376CB8 Collation : default Extents : 5 Last Table Id : 239 Config File Version Id : 0x00060001 Database Version Id : 7 Mode : DDL ALLOWED, ONLINE CHECKPOINT ENABLED Status : VALID,JOURNAL,DUMP,CFG_BACKUP The Database has been Checkpointed. The Database is Journaled. Journals are valid from checkpoint sequence : 1 ----Journal information--------------------------------------------------------- Checkpoint sequence : 2 Journal sequence : 1 Current journal block : 161 Journal block size : 16384 Initial journal size : 4 Target journal size : 512 Last Log Address Journaled : <1110927916:68089:2560 ----Dump information------------------------------------------------------------ Checkpoint sequence : 2 Dump sequence : 1 Current dump block : 0 Dump block size : 16384 Initial dump size : 4 Target dump size : 512 Last Log Address Dumped : <0:0:0 ----Checkpoint History for Journal---------------------------------------------- Date Ckp_sequence First_jnl Last_jnl valid mode ------------------------------------------------------------------------ ---- Tue Mar 15 18:17:06 2005 1 1 1 1 OFFLINE ----Checkpoint History for Dump------------------------------------------------- Date Ckp_sequence First_dmp Last_dmp valid mode ------------------------------------------------------------------------ ---- Tue Mar 15 18:17:07 2005 2 0 0 0 ONLINE ----Cluster Journal History----------------------------------------------------- Node ID Current Journal Current Block Last Log Address ------------------------------------------------------------ None. ----Extent directory------------------------------------------------------------ Location Flags Physical_path ------------------------------------------------------------------ ii_database ROOT,DATA /opt/CA/IngresII/ingres/data/default/test_olcp ii_journal JOURNAL /opt/CA/IngresII/ingres/jnl/default/test_olcp ii_checkpoint CHECKPOINT /opt/CA/IngresII/ingres/ckp/default/test_olcp ii_dump DUMP /opt/CA/IngresII/ingres/dmp/default/test_olcp ii_work WORK /opt/CA/IngresII/ingres/work/default/test_olcp ================================================== ====================== ======== You can see that it doesn't record the status of checkpoint #2, but it does record the dump status of checkpoint #2. Anyone else seen this. Got a better suggestion than using tar for backup on RH 3.0? Cheers, -- Michael Leo Java, J2EE, BEA WebLogic, Caribou Lake LLC Oracle, Open Source, Ingres, mleo (AT) cariboulake (DOT) com Real Enterprise Applications Climate is what we expect. Weather is what we get. - Mark Twain _______________________________________________ Info-ingres mailing list Info-ingres (AT) cariboulake (DOT) com http://mailman.cariboulake.com/mailm...py/info-ingres _______________________________________________ Info-ingres mailing list Info-ingres (AT) cariboulake (DOT) com http://mailman.cariboulake.com/mailm...py/info-ingres |
#6
| |||
| |||
|
|
I think this is what you need TEC317591 Paul -----Original Message----- From: Michael Leo [mailto:mleo (AT) cariboulake (DOT) com] Sent: Wednesday, 16 March 2005 12:04 PM To: Armand Pirvu (yahoo); 'Michael Leo'; info-ingres (AT) cariboulake (DOT) com Subject: RE: [Info-ingres] Problem with online checkpoints for r3 and Linux Armand, That is the perfect solution, except the iitar is not shipped with Ingres II r3 on Linux. Methinks I will need to open a call with CA and see if they can slip that file into an official version so the customer is comfortable it will be supported. Thanks! Mike Leo At 6:03 PM -0600 3/15/05, Armand Pirvu \(yahoo\) wrote: Hi Mike, I got into some similar things, on AIX though. Here is what it would be worthwhile trying. In ckptmpl.def, I changed this way BSTD: /bin/echo beginning checkpoint to tape %C of %N locations BSDD: /bin/echo beginning checkpoint to disk %C of %N locations ESTD: /bin/echo ending checkpoint to tape %C of %N locations ESDD: /bin/echo ending checkpoint to disk %C of %N locations PSTD: /bin/echo mount tape %M and press return; read foo; WSTD: cd %D; iitar cbf 20 %C . PSDD: /bin/echo executing checkpoint to disk WSDD: cd %D; iitar cf %A . And in $II_SYSTEM/ingres/utility it is iitar which overcomes a tar stupid message that says file has changed size. #!/usr/bin/ksh # # Script to set the exit code on tar # when a file size change event occurs # # Set locale to C so error will be found # LC_ALL=C LANG=C # # Set source directories for commands # TARSOURCE=/usr/bin/tar GREPSOURCE=/usr/bin/grep # # Create file for errors to be written too. # touch /tmp/errorlog # # Start tar command with errors redirected to log file # $TARSOURCE "$@" 2> /tmp/errorlog if [ -f /tmp/errorlog ] ;then $GREPSOURCE "file changed size" /tmp/errorlog echo $? > /tmp/status if [ -f /tmp/status -a /tmp/status = 0 ] ;then exit 0 fi fi exit Maybe it works for you as well. Cheers, Armand -----Original Message----- From: info-ingres-admin (AT) cariboulake (DOT) com [mailto:info-ingres-admin (AT) cariboulake (DOT) com] On Behalf Of Michael Leo Sent: Tuesday, March 15, 2005 5:40 PM To: info-ingres (AT) cariboulake (DOT) com Subject: [Info-ingres] Problem with online checkpoints for r3 and Linux Gang, Working with one of our clients, we have discovered that Ingres r3 online checkpoints on Linux (RH Enterprise 3.0, supported) can occasionally fail if a table's file is extended while it is being "tarred" up. We are using the default checkpoint template, so we aren't doing anything "funny" in cktmpl.def. Good news here is that you really have to work to make this happen. You have to insert a ton of records into a fairly large table while the online checkpoint is backing up that table. Bad news is the checkpoint doesn't appear to be recorded in the config file and doesn't show up in the "infodb" output. The failed "dump" attempt seems to be logged there. Very odd. Here are the scripts we used to do this: ftp://ftp.cariboulake.com/test_olcp/test_olcp.zip You may need to run the script several times to get the error to occur, but thre We used a pretty small system and pretty big system to test this: Small AMD Athon 1GHz single processor, 1.5GB RAM, 70GB IDE disk Small Pentium Xeon 2.3Ghz dual HT processor, 4GB RAM, 160GB USCSI disk Last year we saw this error in Solaris with Ingres II 2.6, which can be mitigated by using the 'correct' version of tar. This was worked through Ingres technical support under CA Startrak Issue 12399005. Here is the output of the checkpoint attempt: Tue Mar 15 18:17:07 2005 CPP: Preparing to checkpoint database: test_olcp Tue Mar 15 18:17:07 2005 CPP: Preparing stall of database, active xact cnt: 0 Tue Mar 15 18:17:07 2005 CPP: Finished stall of database beginning checkpoint to disk /opt/CA/IngresII/ingres/ckp/default/test_olcp of 1 locations Tue Mar 15 18:17:08 2005 CPP: Start checkpoint of location: ii_database to disk: path = '/opt/CA/IngresII/ingres/ckp/default/test_olcp' file = 'c0002001.ckp' executing checkpoint to disk /bin/tar: aaaaaaop.t00: file changed as we read it /bin/tar: Error exit delayed from previous errors Tue Mar 15 18:17:09 2005 E_DM1101_CPP_WRITE_ERROR Error writing checkpoint. Tue Mar 15 18:17:09 2005 E_DM110B_CPP_FAILED Error occurred checkpointing the database. Here is what infodb says: ==================Tue Mar 15 18:19:54 2005 Database Information================= Database : (test_olcp,mal) ID : 0x42376CB8 Collation : default Extents : 5 Last Table Id : 239 Config File Version Id : 0x00060001 Database Version Id : 7 Mode : DDL ALLOWED, ONLINE CHECKPOINT ENABLED Status : VALID,JOURNAL,DUMP,CFG_BACKUP The Database has been Checkpointed. The Database is Journaled. Journals are valid from checkpoint sequence : 1 ----Journal information--------------------------------------------------------- Checkpoint sequence : 2 Journal sequence : 1 Current journal block : 161 Journal block size : 16384 Initial journal size : 4 Target journal size : 512 Last Log Address Journaled : <1110927916:68089:2560 ----Dump information------------------------------------------------------------ Checkpoint sequence : 2 Dump sequence : 1 Current dump block : 0 Dump block size : 16384 Initial dump size : 4 Target dump size : 512 Last Log Address Dumped : <0:0:0 ----Checkpoint History for Journal---------------------------------------------- Date Ckp_sequence First_jnl Last_jnl valid mode ------------------------------------------------------------------------ ---- Tue Mar 15 18:17:06 2005 1 1 1 1 OFFLINE ----Checkpoint History for Dump------------------------------------------------- Date Ckp_sequence First_dmp Last_dmp valid mode ------------------------------------------------------------------------ ---- Tue Mar 15 18:17:07 2005 2 0 0 0 ONLINE ----Cluster Journal History----------------------------------------------------- Node ID Current Journal Current Block Last Log Address ------------------------------------------------------------ None. ----Extent directory------------------------------------------------------------ Location Flags Physical_path ------------------------------------------------------------------ ii_database ROOT,DATA /opt/CA/IngresII/ingres/data/default/test_olcp ii_journal JOURNAL /opt/CA/IngresII/ingres/jnl/default/test_olcp ii_checkpoint CHECKPOINT /opt/CA/IngresII/ingres/ckp/default/test_olcp ii_dump DUMP /opt/CA/IngresII/ingres/dmp/default/test_olcp ii_work WORK /opt/CA/IngresII/ingres/work/default/test_olcp ================================================== ====================== ======== You can see that it doesn't record the status of checkpoint #2, but it does record the dump status of checkpoint #2. Anyone else seen this. Got a better suggestion than using tar for backup on RH 3.0? Cheers, -- Michael Leo Java, J2EE, BEA WebLogic, Caribou Lake LLC Oracle, Open Source, Ingres, mleo (AT) cariboulake (DOT) com Real Enterprise Applications Climate is what we expect. Weather is what we get. - Mark Twain _______________________________________________ Info-ingres mailing list Info-ingres (AT) cariboulake (DOT) com http://mailman.cariboulake.com/mailm...py/info-ingres _______________________________________________ Info-ingres mailing list Info-ingres (AT) cariboulake (DOT) com http://mailman.cariboulake.com/mailm...py/info-ingres _______________________________________________ Info-ingres mailing list Info-ingres (AT) cariboulake (DOT) com http://mailman.cariboulake.com/mailm...py/info-ingres _______________________________________________ Info-ingres mailing list Info-ingres (AT) cariboulake (DOT) com http://mailman.cariboulake.com/mailm...py/info-ingres |
#7
| |||
| |||
|
|
I think this is what you need TEC317591 Paul -----Original Message----- From: Michael Leo [mailto:mleo (AT) cariboulake (DOT) com] Sent: Wednesday, 16 March 2005 12:04 PM To: Armand Pirvu (yahoo); 'Michael Leo'; info-ingres (AT) cariboulake (DOT) com Subject: RE: [Info-ingres] Problem with online checkpoints for r3 and Linux Armand, That is the perfect solution, except the iitar is not shipped with Ingres II r3 on Linux. Methinks I will need to open a call with CA and see if they can slip that file into an official version so the customer is comfortable it will be supported. Thanks! Mike Leo At 6:03 PM -0600 3/15/05, Armand Pirvu \(yahoo\) wrote: Hi Mike, I got into some similar things, on AIX though. Here is what it would be worthwhile trying. In ckptmpl.def, I changed this way BSTD: /bin/echo beginning checkpoint to tape %C of %N locations BSDD: /bin/echo beginning checkpoint to disk %C of %N locations ESTD: /bin/echo ending checkpoint to tape %C of %N locations ESDD: /bin/echo ending checkpoint to disk %C of %N locations PSTD: /bin/echo mount tape %M and press return; read foo; WSTD: cd %D; iitar cbf 20 %C . PSDD: /bin/echo executing checkpoint to disk WSDD: cd %D; iitar cf %A . And in $II_SYSTEM/ingres/utility it is iitar which overcomes a tar stupid message that says file has changed size. #!/usr/bin/ksh # # Script to set the exit code on tar # when a file size change event occurs # # Set locale to C so error will be found # LC_ALL=C LANG=C # # Set source directories for commands # TARSOURCE=/usr/bin/tar GREPSOURCE=/usr/bin/grep # # Create file for errors to be written too. # touch /tmp/errorlog # # Start tar command with errors redirected to log file # $TARSOURCE "$@" 2> /tmp/errorlog if [ -f /tmp/errorlog ] ;then $GREPSOURCE "file changed size" /tmp/errorlog echo $? > /tmp/status if [ -f /tmp/status -a /tmp/status = 0 ] ;then exit 0 fi fi exit Maybe it works for you as well. Cheers, Armand -----Original Message----- From: info-ingres-admin (AT) cariboulake (DOT) com [mailto:info-ingres-admin (AT) cariboulake (DOT) com] On Behalf Of Michael Leo Sent: Tuesday, March 15, 2005 5:40 PM To: info-ingres (AT) cariboulake (DOT) com Subject: [Info-ingres] Problem with online checkpoints for r3 and Linux Gang, Working with one of our clients, we have discovered that Ingres r3 online checkpoints on Linux (RH Enterprise 3.0, supported) can occasionally fail if a table's file is extended while it is being "tarred" up. We are using the default checkpoint template, so we aren't doing anything "funny" in cktmpl.def. Good news here is that you really have to work to make this happen. You have to insert a ton of records into a fairly large table while the online checkpoint is backing up that table. Bad news is the checkpoint doesn't appear to be recorded in the config file and doesn't show up in the "infodb" output. The failed "dump" attempt seems to be logged there. Very odd. Here are the scripts we used to do this: ftp://ftp.cariboulake.com/test_olcp/test_olcp.zip You may need to run the script several times to get the error to occur, but thre We used a pretty small system and pretty big system to test this: Small AMD Athon 1GHz single processor, 1.5GB RAM, 70GB IDE disk Small Pentium Xeon 2.3Ghz dual HT processor, 4GB RAM, 160GB USCSI disk Last year we saw this error in Solaris with Ingres II 2.6, which can be mitigated by using the 'correct' version of tar. This was worked through Ingres technical support under CA Startrak Issue 12399005. Here is the output of the checkpoint attempt: Tue Mar 15 18:17:07 2005 CPP: Preparing to checkpoint database: test_olcp Tue Mar 15 18:17:07 2005 CPP: Preparing stall of database, active xact cnt: 0 Tue Mar 15 18:17:07 2005 CPP: Finished stall of database beginning checkpoint to disk /opt/CA/IngresII/ingres/ckp/default/test_olcp of 1 locations Tue Mar 15 18:17:08 2005 CPP: Start checkpoint of location: ii_database to disk: path = '/opt/CA/IngresII/ingres/ckp/default/test_olcp' file = 'c0002001.ckp' executing checkpoint to disk /bin/tar: aaaaaaop.t00: file changed as we read it /bin/tar: Error exit delayed from previous errors Tue Mar 15 18:17:09 2005 E_DM1101_CPP_WRITE_ERROR Error writing checkpoint. Tue Mar 15 18:17:09 2005 E_DM110B_CPP_FAILED Error occurred checkpointing the database. Here is what infodb says: ==================Tue Mar 15 18:19:54 2005 Database Information================= Database : (test_olcp,mal) ID : 0x42376CB8 Collation : default Extents : 5 Last Table Id : 239 Config File Version Id : 0x00060001 Database Version Id : 7 Mode : DDL ALLOWED, ONLINE CHECKPOINT ENABLED Status : VALID,JOURNAL,DUMP,CFG_BACKUP The Database has been Checkpointed. The Database is Journaled. Journals are valid from checkpoint sequence : 1 ----Journal information--------------------------------------------------------- Checkpoint sequence : 2 Journal sequence : 1 Current journal block : 161 Journal block size : 16384 Initial journal size : 4 Target journal size : 512 Last Log Address Journaled : <1110927916:68089:2560 ----Dump information------------------------------------------------------------ Checkpoint sequence : 2 Dump sequence : 1 Current dump block : 0 Dump block size : 16384 Initial dump size : 4 Target dump size : 512 Last Log Address Dumped : <0:0:0 ----Checkpoint History for Journal---------------------------------------------- Date Ckp_sequence First_jnl Last_jnl valid mode ------------------------------------------------------------------------ ---- Tue Mar 15 18:17:06 2005 1 1 1 1 OFFLINE ----Checkpoint History for Dump------------------------------------------------- Date Ckp_sequence First_dmp Last_dmp valid mode ------------------------------------------------------------------------ ---- Tue Mar 15 18:17:07 2005 2 0 0 0 ONLINE ----Cluster Journal History----------------------------------------------------- Node ID Current Journal Current Block Last Log Address ------------------------------------------------------------ None. ----Extent directory------------------------------------------------------------ Location Flags Physical_path ------------------------------------------------------------------ ii_database ROOT,DATA /opt/CA/IngresII/ingres/data/default/test_olcp ii_journal JOURNAL /opt/CA/IngresII/ingres/jnl/default/test_olcp ii_checkpoint CHECKPOINT /opt/CA/IngresII/ingres/ckp/default/test_olcp ii_dump DUMP /opt/CA/IngresII/ingres/dmp/default/test_olcp ii_work WORK /opt/CA/IngresII/ingres/work/default/test_olcp ================================================== ====================== ======== You can see that it doesn't record the status of checkpoint #2, but it does record the dump status of checkpoint #2. Anyone else seen this. Got a better suggestion than using tar for backup on RH 3.0? Cheers, -- Michael Leo Java, J2EE, BEA WebLogic, Caribou Lake LLC Oracle, Open Source, Ingres, mleo (AT) cariboulake (DOT) com Real Enterprise Applications Climate is what we expect. Weather is what we get. - Mark Twain _______________________________________________ Info-ingres mailing list Info-ingres (AT) cariboulake (DOT) com http://mailman.cariboulake.com/mailm...py/info-ingres _______________________________________________ Info-ingres mailing list Info-ingres (AT) cariboulake (DOT) com http://mailman.cariboulake.com/mailm...py/info-ingres _______________________________________________ Info-ingres mailing list Info-ingres (AT) cariboulake (DOT) com http://mailman.cariboulake.com/mailm...py/info-ingres _______________________________________________ Info-ingres mailing list Info-ingres (AT) cariboulake (DOT) com http://mailman.cariboulake.com/mailm...py/info-ingres |
#8
| |||
| |||
|
|
I think this is what you need TEC317591 Paul -----Original Message----- From: Michael Leo [mailto:mleo (AT) cariboulake (DOT) com] Sent: Wednesday, 16 March 2005 12:04 PM To: Armand Pirvu (yahoo); 'Michael Leo'; info-ingres (AT) cariboulake (DOT) com Subject: RE: [Info-ingres] Problem with online checkpoints for r3 and Linux Armand, That is the perfect solution, except the iitar is not shipped with Ingres II r3 on Linux. Methinks I will need to open a call with CA and see if they can slip that file into an official version so the customer is comfortable it will be supported. Thanks! Mike Leo At 6:03 PM -0600 3/15/05, Armand Pirvu \(yahoo\) wrote: Hi Mike, I got into some similar things, on AIX though. Here is what it would be worthwhile trying. In ckptmpl.def, I changed this way BSTD: /bin/echo beginning checkpoint to tape %C of %N locations BSDD: /bin/echo beginning checkpoint to disk %C of %N locations ESTD: /bin/echo ending checkpoint to tape %C of %N locations ESDD: /bin/echo ending checkpoint to disk %C of %N locations PSTD: /bin/echo mount tape %M and press return; read foo; WSTD: cd %D; iitar cbf 20 %C . PSDD: /bin/echo executing checkpoint to disk WSDD: cd %D; iitar cf %A . And in $II_SYSTEM/ingres/utility it is iitar which overcomes a tar stupid message that says file has changed size. #!/usr/bin/ksh # # Script to set the exit code on tar # when a file size change event occurs # # Set locale to C so error will be found # LC_ALL=C LANG=C # # Set source directories for commands # TARSOURCE=/usr/bin/tar GREPSOURCE=/usr/bin/grep # # Create file for errors to be written too. # touch /tmp/errorlog # # Start tar command with errors redirected to log file # $TARSOURCE "$@" 2> /tmp/errorlog if [ -f /tmp/errorlog ] ;then $GREPSOURCE "file changed size" /tmp/errorlog echo $? > /tmp/status if [ -f /tmp/status -a /tmp/status = 0 ] ;then exit 0 fi fi exit Maybe it works for you as well. Cheers, Armand -----Original Message----- From: info-ingres-admin (AT) cariboulake (DOT) com [mailto:info-ingres-admin (AT) cariboulake (DOT) com] On Behalf Of Michael Leo Sent: Tuesday, March 15, 2005 5:40 PM To: info-ingres (AT) cariboulake (DOT) com Subject: [Info-ingres] Problem with online checkpoints for r3 and Linux Gang, Working with one of our clients, we have discovered that Ingres r3 online checkpoints on Linux (RH Enterprise 3.0, supported) can occasionally fail if a table's file is extended while it is being "tarred" up. We are using the default checkpoint template, so we aren't doing anything "funny" in cktmpl.def. Good news here is that you really have to work to make this happen. You have to insert a ton of records into a fairly large table while the online checkpoint is backing up that table. Bad news is the checkpoint doesn't appear to be recorded in the config file and doesn't show up in the "infodb" output. The failed "dump" attempt seems to be logged there. Very odd. Here are the scripts we used to do this: ftp://ftp.cariboulake.com/test_olcp/test_olcp.zip You may need to run the script several times to get the error to occur, but thre We used a pretty small system and pretty big system to test this: Small AMD Athon 1GHz single processor, 1.5GB RAM, 70GB IDE disk Small Pentium Xeon 2.3Ghz dual HT processor, 4GB RAM, 160GB USCSI disk Last year we saw this error in Solaris with Ingres II 2.6, which can be mitigated by using the 'correct' version of tar. This was worked through Ingres technical support under CA Startrak Issue 12399005. Here is the output of the checkpoint attempt: Tue Mar 15 18:17:07 2005 CPP: Preparing to checkpoint database: test_olcp Tue Mar 15 18:17:07 2005 CPP: Preparing stall of database, active xact cnt: 0 Tue Mar 15 18:17:07 2005 CPP: Finished stall of database beginning checkpoint to disk /opt/CA/IngresII/ingres/ckp/default/test_olcp of 1 locations Tue Mar 15 18:17:08 2005 CPP: Start checkpoint of location: ii_database to disk: path = '/opt/CA/IngresII/ingres/ckp/default/test_olcp' file = 'c0002001.ckp' executing checkpoint to disk /bin/tar: aaaaaaop.t00: file changed as we read it /bin/tar: Error exit delayed from previous errors Tue Mar 15 18:17:09 2005 E_DM1101_CPP_WRITE_ERROR Error writing checkpoint. Tue Mar 15 18:17:09 2005 E_DM110B_CPP_FAILED Error occurred checkpointing the database. Here is what infodb says: ==================Tue Mar 15 18:19:54 2005 Database Information================= Database : (test_olcp,mal) ID : 0x42376CB8 Collation : default Extents : 5 Last Table Id : 239 Config File Version Id : 0x00060001 Database Version Id : 7 Mode : DDL ALLOWED, ONLINE CHECKPOINT ENABLED Status : VALID,JOURNAL,DUMP,CFG_BACKUP The Database has been Checkpointed. The Database is Journaled. Journals are valid from checkpoint sequence : 1 ----Journal information--------------------------------------------------------- Checkpoint sequence : 2 Journal sequence : 1 Current journal block : 161 Journal block size : 16384 Initial journal size : 4 Target journal size : 512 Last Log Address Journaled : <1110927916:68089:2560 ----Dump information------------------------------------------------------------ Checkpoint sequence : 2 Dump sequence : 1 Current dump block : 0 Dump block size : 16384 Initial dump size : 4 Target dump size : 512 Last Log Address Dumped : <0:0:0 ----Checkpoint History for Journal---------------------------------------------- Date Ckp_sequence First_jnl Last_jnl valid mode ------------------------------------------------------------------------ ---- Tue Mar 15 18:17:06 2005 1 1 1 1 OFFLINE ----Checkpoint History for Dump------------------------------------------------- Date Ckp_sequence First_dmp Last_dmp valid mode ------------------------------------------------------------------------ ---- Tue Mar 15 18:17:07 2005 2 0 0 0 ONLINE ----Cluster Journal History----------------------------------------------------- Node ID Current Journal Current Block Last Log Address ------------------------------------------------------------ None. ----Extent directory------------------------------------------------------------ Location Flags Physical_path ------------------------------------------------------------------ ii_database ROOT,DATA /opt/CA/IngresII/ingres/data/default/test_olcp ii_journal JOURNAL /opt/CA/IngresII/ingres/jnl/default/test_olcp ii_checkpoint CHECKPOINT /opt/CA/IngresII/ingres/ckp/default/test_olcp ii_dump DUMP /opt/CA/IngresII/ingres/dmp/default/test_olcp ii_work WORK /opt/CA/IngresII/ingres/work/default/test_olcp ================================================== ====================== ======== You can see that it doesn't record the status of checkpoint #2, but it does record the dump status of checkpoint #2. Anyone else seen this. Got a better suggestion than using tar for backup on RH 3.0? Cheers, -- Michael Leo Java, J2EE, BEA WebLogic, Caribou Lake LLC Oracle, Open Source, Ingres, mleo (AT) cariboulake (DOT) com Real Enterprise Applications Climate is what we expect. Weather is what we get. - Mark Twain _______________________________________________ Info-ingres mailing list Info-ingres (AT) cariboulake (DOT) com http://mailman.cariboulake.com/mailm...py/info-ingres _______________________________________________ Info-ingres mailing list Info-ingres (AT) cariboulake (DOT) com http://mailman.cariboulake.com/mailm...py/info-ingres _______________________________________________ Info-ingres mailing list Info-ingres (AT) cariboulake (DOT) com http://mailman.cariboulake.com/mailm...py/info-ingres _______________________________________________ Info-ingres mailing list Info-ingres (AT) cariboulake (DOT) com http://mailman.cariboulake.com/mailm...py/info-ingres |
#9
| |||
| |||
|
|
Ok, I am curious though. I was under the impression that the writes to the tables were delayed until after the checkpoint was complete. |
![]() |
| Thread Tools | |
| Display Modes | |
| |