[prev in list] [next in list] [prev in thread] [next in thread] 

List:       slony1-general
Subject:    Re: [Slony1-general] runaway vacuum
From:       "David Rees" <drees76 () gmail ! com>
Date:       2008-02-21 20:00:10
Message-ID: 72dbd3150802211200v793776f0ucf48d56a3786794 () mail ! gmail ! com
[Download RAW message or body]

On Thu, Feb 21, 2008 at 9:15 AM, Andrew Sullivan <ajs@crankycanuck.ca> wrote:
> On Thu, Feb 21, 2008 at 08:09:08AM -0800, Craig James wrote:
>  > In a situation like this, some sort of ACTIVE response from Slony would be
>  > nice.  Here's an idea.  When the Slony daemon detects an unrecoverable
>  > error, it should STOP, and send an email to a configurable administration
>  > email address.  Something like this:
>
>  No, no, that should not go in the daemon.  That should go in your monitoring
>  system.  I believe there are Nagios plugins floating about.  They could be
>  smarter, though, particularly about this sort of recoverable/non-recoverable
>  distinction you're mentioning.

Yep, we use Nagios to monitor replication status, it works quite well.
When things get out of sync (has actually never happened in production
yet!) we simply go through the slon/pg logs to figure out what went
wrong.

-Dave
_______________________________________________
Slony1-general mailing list
Slony1-general@lists.slony.info
http://lists.slony.info/mailman/listinfo/slony1-general
[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic