[prev in list] [next in list] [prev in thread] [next in thread] 

List:       lustre-discuss
Subject:    [Lustre-discuss] MDS read-only
From:       wanglu () ihep ! ac ! cn (wanglu)
Date:       2012-10-09 4:23:54
Message-ID: 79D7E25E-1841-43AA-A722-0285AC58EA8E () ihep ! ac ! cn
[Download RAW message or body]

        By the way,  we have also tried to dd the MDT device and mount the replica, \
the problem still exists. Besides, we have not seen any error reported on hardware \
monitor.  It is much more like an ldiskfs error than hardware error.

Lu


? 2012-10-9???12:04? wanglu ???

> Dear all, 
> Two of our MDS have got repeatedly read-only error recently after once e2fsck on \
> lustre 1.8.5.  After the MDT mounted for a while, the kernel will reports errors \
>                 like:
> Oct  8 20:16:44 mainmds kernel: LDISKFS-fs error (device cciss!c0d1): \
> ldiskfs_ext_check_inode: bad header/extent in inode #50736178: invalid magic - \
>                 magic 0, entries 0, max 0(0), depth 0(0)
> Oct  8 20:16:44 mainmds kernel: Aborting journal on device cciss!c0d1-8.
> And make the MDS read-only.  
> This problem has made  about 1PB data, 0.1 billion files unavailable  to access.  \
> We believe there is some structure wrong in the local file system of MDT, so we \
> have tried to use e2fsck to fix it follow the process in lustre manual. However, \
> with the loop always goes like this:  1.  run e2fsck,   fixed or not fixed some \
> errors 2.  mount  MDT, report read-only after some client operations, and the whole \
> system became unusable.  3.  e2fsck again. 
> 
> We have tried with three different version lustre:  1.8.5, 1.8.6, and 1.8.8-wc and \
> their corresponding e2fsprog, the problem still exists.  Currently, We can only use \
> lustre with all the clients mounted in read-only mode,  and tried to copy the whole \
> file system. However, It takes a long period to generate all the directory \
> structure and file list for 0.1 billion files.  
> Can any one give us some suggestions? Thank you very much!
> 
> Lu Wang 
> CC-IHEP
> 
> 
> 
> _______________________________________________
> Lustre-discuss mailing list
> Lustre-discuss at lists.lustre.org
> http://lists.lustre.org/mailman/listinfo/lustre-discuss


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic