[prev in list] [next in list] [prev in thread] [next in thread] 

List:       postgresql-general
Subject:    Re: WAL File Recovery on Standby Server Stops Before End of WAL Files
From:       Kyotaro Horiguchi <horikyota.ntt () gmail ! com>
Date:       2021-10-29 1:44:46
Message-ID: 20211029.104446.382302278518357208.horikyota.ntt () gmail ! com
[Download RAW message or body]

At Thu, 28 Oct 2021 13:52:36 +0000, "Ryan, Les" <Les.Ryan@wsp.com> wrote in 
> Hi Kyotaro and Dilip,
> 
> Thank you for getting back to me.
> 
> Kyotaro: I ran pg_dump and the output was "pg_waldump: fatal: could not read file \
> "00000001000004190000005A": read 50 of 8192".  I'm guessing that it means that wal \
> file 00000001000004190000005A is corrupted and that is why the recovery process \
> stops there.  Is there any way to fix the file?

No way, unless the segment is still living in the primary's
pg_wal. Your archive storage is unstable or archive_command is not
performing its task reliably enough.

> Dilip:   setting the log level to debug2 did not provide any additional \
> information.  Here are the log entries: 
> 2021-10-28 06:51:06.166 MDT [7556] LOG:  restored log file \
> "000000010000041900000059" from archive 2021-10-28 06:51:06.464 MDT [7556] DEBUG:  \
> got WAL segment from archive 2021-10-28 06:51:06.579 MDT [7556] LOG:  restored log \
> file "00000001000004190000005A" from archive 2021-10-28 06:51:06.854 MDT [7556] \
> DEBUG:  got WAL segment from archive 2021-10-28 06:51:07.107 MDT [7556] LOG:  \
> consistent recovery state reached at 419/5ABFFFF8 2021-10-28 06:51:07.107 MDT \
> [7556] DEBUG:  switched WAL source from archive to stream after failure 2021-10-28 \
> 06:51:07.109 MDT [7844] LOG:  database system is ready to accept read only \
> connections 2021-10-28 06:51:07.152 MDT [7844] DEBUG:  forked new backend, pid=6900 \
> socket=6068 
> I set the log level to debug5 and here is what I got:
> 
> 2021-10-28 06:25:41.262 MDT [6288] CONTEXT:  WAL redo at 419/5ABFFF60 for \
> Btree/INSERT_LEAF: off 130 2021-10-28 06:25:41.262 MDT [6288] DEBUG:  record known \
> xact 33776257 latestObservedXid 33776257 2021-10-28 06:25:41.262 MDT [6288] \
> CONTEXT:  WAL redo at 419/5ABFFFA0 for Heap/INSERT: off 95 flags 0x00 2021-10-28 \
> 06:25:41.262 MDT [6288] LOG:  consistent recovery state reached at 419/5ABFFFF8 \
> 2021-10-28 06:25:41.263 MDT [6288] DEBUG:  switched WAL source from archive to \
> stream after failure 2021-10-28 06:25:41.264 MDT [5512] LOG:  database system is \
> ready to accept read only connections 
> Does the "switched WAL source from archive to stream after failure" indicate a \
> problem with the WAL file?

Actually the message says that some trouble happend while reading file
but that happens always at the end of wal.  We might need a bit more
detailed message about the trouble in higher debug level messages.

> Anyway, it looks like I need to restore the standby server from a new backup.  \
> Thank you both for your help.

Unfortunately I think so, too.

> Sincerely,
> -Les

-- 
Kyotaro Horiguchi
NTT Open Source Software Center


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic