![]() | |
![]() |
| | Thread Tools | Display Modes |
#1
| |||
| |||
|
#2
| |||
| |||
|
|
pg_dump: ERROR: unexpected chunk number 0 (expected 1) for toast value 4294879152 pg_dump: SQL command to dump the contents of table "dataset_cache" failed: PQendcopy() failed. pg_dump: Error message from server: ERROR: unexpected chunk number 0 (expected 1) for toast value 4294879152 pg_dump: The command was: COPY public.dataset_cache (checksum, version_no, bind_params, sql_statement, serialized_value, date_created) TO stdout; pg_dump: *** aborted because of error We saw the same message when we tried to cluster the effective table. The problem went away after truncating the table. I've searched the pgsql-bugs archives and found reports of this problem, but haven't seen a solution. Is there a solution to keep this from happening in the future? (Version upgrade maybe?) |
#3
| |||
| |||
|
|
Looks very much like the table was corrupted. Maybe you should try to test your RAM and disks. Not sure how to do that on x86-64 though, unless the test utility at www.memtest86.com has been ported to it. |
#4
| |||
| |||
|
|
Alvaro Herrera <alvherre (AT) alvh (DOT) no-ip.org> 08/10/05 9:03 AM On Mon, Aug 01, 2005 at 06:02:30AM +0100, Aaron Harsh wrote: pg_dump: ERROR: unexpected chunk number 0 (expected 1) for toast value ... Looks very much like the table was corrupted. Maybe you should try to test your RAM and disks. Not sure how to do that on x86-64 though, unless the test utility at www.memtest86.com has been ported to it. |
#5
| |||
| |||
|
|
Alvaro Herrera <alvherre (AT) alvh (DOT) no-ip.org> 08/10/05 9:03 AM On Mon, Aug 01, 2005 at 06:02:30AM +0100, Aaron Harsh wrote: pg_dump: ERROR: unexpected chunk number 0 (expected 1) for toast value ... Looks very much like the table was corrupted. Maybe you should try to test your RAM and disks. Not sure how to do that on x86-64 though, unless the test utility at www.memtest86.com has been ported to it. The server is running off of ECC RAM on a RAID-10 set, so a one-off disk/RAM failure seems unlikely. The server had been running beautifully for 6 months prior to this error, and hasn't been evidencing the problem since, so it seems unlikely that this is due to a bad DIMM or RAID controller. The timing might be a coincidence, but this error happened within a day of our OID counter wrapping around back to 0. (Although Tom Lane mentioned in pgsql-general that he was inclined to consider the timing a coincidence). |
#6
| |||
| |||
|
|
Alvaro Herrera <alvherre (AT) alvh (DOT) no-ip.org> 08/11/05 9:52 AM Anyway I was originally thinking the problem data was 4294879152 (0xFFFEA7B0), not the 0. Have you tried to manually extract the data from the dataset_cache table? You could try figuring out what page contains the bad data, and manually peek into it using pg_filedump. |
![]() |
| Thread Tools | |
| Display Modes | |
| |