[prev in list] [next in list] [prev in thread] [next in thread] 

List:       bacula-bugs
Subject:    [Bacula-bugs] [bacula 0001714]: mail sending process sometimes gets
From:       Mantis Bug Tracker <nobody () baculabugs ! unixathome ! org>
Date:       2011-03-22 16:42:43
Message-ID: acf55b92e5def14ae5b568cf187b5ee5 () bugs ! bacula ! org
[Download RAW message or body]


Issue 0001714 is now monitored by user mnalis. 
====================================================================== 
http://bugs.bacula.org/view.php?id=1714 
====================================================================== 
Reported By:                mnalis
Assigned To:                
====================================================================== 
Project:                    bacula
Issue ID:                   1714
Category:                   Director
Reproducibility:            sometimes
Severity:                   minor
Priority:                   normal
Status:                     new
====================================================================== 
Date Submitted:             2011-03-22 16:41 GMT
Last Modified:              2011-03-22 16:42 GMT
====================================================================== 
Summary:                    mail sending process sometimes gets killed by bacula
timeout
Description: 
Sometimes, when there are lots of clients being backed up (in my case about a
100), bacula kills some some of the mail processes it spawns (in my case, about
5-20). 

The error looks like:
2011-03-21 02:48:48.936755500 21-Mar 02:48  Message delivery ERROR: Mail program
terminated in error.
2011-03-21 02:48:48.936756500 CMD=/usr/bin/mail -s "Bacula: Backup OK of
forum-fd Incremental" root@localhost
2011-03-21 02:48:48.936757500 ERR=800000f:Child died from signal 15: Termination

repeated 5-20 times for different clients.

The I/O load at the server is somewhat higher when that happens, but not
enormously (loadavg of about 5-6). It happens with bacula bsmtp, as well as
debian squeeze /usr/bin/mail.

I think I've traced it to 120 second timeout in open_mail_pipe(), but even
increasing it to 600 seconds does not seem to help, so it might be some kind of
deadlock when many parallel mail clients are spawned at near same time).

Steps to Reproduce: 
have many clients run (and probably finish) at about the same time, and watch
the stdout output of bacula-director (it does not seem to be logged to bacula
log file)

Additional Information: 
exact bacula version is 5.0.3 with git Branch-5.0 as of 20110301
====================================================================== 

Issue History 
Date Modified    Username       Field                    Change               
====================================================================== 
2011-03-22 16:41 mnalis         New Issue                                    
2011-03-22 16:42 mnalis         Issue Monitored: mnalis                      
======================================================================


------------------------------------------------------------------------------
Enable your software for Intel(R) Active Management Technology to meet the
growing manageability and security demands of your customers. Businesses
are taking advantage of Intel(R) vPro (TM) technology - will your software 
be a part of the solution? Download the Intel(R) Manageability Checker 
today! http://p.sf.net/sfu/intel-dev2devmar
_______________________________________________
Bacula-bugs mailing list
Bacula-bugs@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-bugs
[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic