[prev in list] [next in list] [prev in thread] [next in thread] 

List:       lustre-discuss
Subject:    [Lustre-discuss] Parallel fortran program bug.
From:       robin.humble+lustre () anu ! edu ! au (Robin Humble)
Date:       2010-12-30 15:12:22
Message-ID: 20101230151222.GA1159 () grizzly ! cita ! utoronto ! ca
[Download RAW message or body]

On Thu, Dec 23, 2010 at 03:48:50PM +0100, Roy Dragseth wrote:
>On Thursday, December 23, 2010 15:18:13 Rick Grubin wrote:
>> > We have an occasional problem with parallel fortran programs that open
>> > files with status "old" or "unknown" returns errors on open.  This seems
>> Sounds like bug 17545:  https://bugzilla.lustre.org/show_bug.cgi?id=17545
>> The issue is fixed for v1.8.2 and beyond.
>Thanks a lot for your quick reply!  This seems to be it, we will upgrade next 
>week.

if you are using Intel Fortran, then I think your open() failures will
probably continue even with latest Lustre, but at a lower rate. see
  https://bugzilla.lustre.org/show_bug.cgi?id=23978

this bug has flown under the radar a bit as it causes fairly cryptic
app failures, and only Intel fortran hits it with any frequency.
what the user sees usually just looks like a failed open with an oddly
corrupted filename string.

cheers,
robin
--
Dr Robin Humble, HPC Systems Analyst, NCI National Facility


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic