[prev in list] [next in list] [prev in thread] [next in thread] 

List:       gpfsug-discuss
Subject:    Re: [gpfsug-discuss] Protection against silent data corruption
From:       Achim Rehor <Achim.Rehor () de ! ibm ! com>
Date:       2022-06-10 8:01:01
Message-ID: da9bc89dec7eeeb201a5f989fde08983c5c3a84c.camel () de ! ibm ! com
[Download RAW message or body]

[Attachment #2 (text/plain)]

Thanks Stephen,

for clarifying, i misread the initial question, and thanks Stefan for raising that \
IDEA. The new address for raising RFEs/IDEAs on GPFS now is : \
https://ibm-sys-storage.ideas.ibm.com/ideas?project=GPFS


--

Mit freundlichen Grüßen / Kind regards

Achim Rehor

-----Original Message-----
From: Stephen Ulmer <ulmer@ulmer.org<mailto:Stephen%20Ulmer%20%3culmer@ulmer.org%3e>>
Reply-To: gpfsug main discussion list \
<gpfsug-discuss@gpfsug.org<mailto:gpfsug%20main%20discussion%20list%20%3cgpfsug-discuss@gpfsug.org%3e>>
                
To: gpfsug main discussion list \
<gpfsug-discuss@gpfsug.org<mailto:gpfsug%20main%20discussion%20list%20%3cgpfsug-discuss@gpfsug.org%3e>>
                
Subject: [EXTERNAL] Re: [gpfsug-discuss] Protection against silent data corruption
Date: Thu, 09 Jun 2022 15:47:07 -0400

Just to be clear: any follow-up should be directed to Stephan, who is requesting the \
feature. I am well aware that Scale does not provide this feature, and was just \
clarifying Stephan's question for Achim, who answered the question with an \
ZjQcmQRYFpfptBannerStart This Message Is From an External Sender
This message came from outside your organization.

ZjQcmQRYFpfptBannerEnd
Just to be clear: any follow-up should be directed to Stephan, who is requesting the \
feature.

I am well aware that Scale does not provide this feature, and was just clarifying \
Stephan's question for Achim, who answered the question with an unrelated reference \
after which Scale support replied to me.

This is also where I notice that for all that is holy, the generated IDEA links point \
to DeveloperWorks and don't even get you to the correct forum thread. Sigh.

--
Stephen



On Jun 9, 2022, at 2:45 PM, IBM Spectrum Scale \
<scale@us.ibm.com<mailto:scale@us.ibm.com>> wrote:


Thanks Stephan.
This will be looked into and accordingly prioritized by the offering manager team. \
Incase the IBM team has any further questions on this then we will get back to you.

Regards, The Spectrum Scale (GPFS) team

------------------------------------------------------------------------------------------------------------------
 If you feel that your question can benefit other users of  Spectrum Scale (GPFS), \
then please post it to the public IBM developerWroks Forum at \
https://www.ibm.com/developerworks/community/forums/html/forum?id=11111111-0000-0000-0000-000000000479.


If your query concerns a potential software error in Spectrum Scale (GPFS) and you \
have an IBM software maintenance contract please contact  1-800-237-5511 in the \
United States or your local IBM Service Center in other countries.

The forum is informally monitored as time permits and should not be used for priority \
messages to the Spectrum Scale (GPFS) team.

<graycol.gif>"Stephan Graf" ---09-06-2022 11.31.01 AM---Hi, I have create an IDEA for \
it:

From: "Stephan Graf" <st.graf@fz-juelich.de<mailto:st.graf@fz-juelich.de>>
To: <gpfsug-discuss@gpfsug.org<mailto:gpfsug-discuss@gpfsug.org>>
Date: 09-06-2022 11.31 AM
Subject: [EXTERNAL] Re: [gpfsug-discuss] Protection against silent data corruption
Sent by: "gpfsug-discuss" \
<gpfsug-discuss-bounces@gpfsug.org<mailto:gpfsug-discuss-bounces@gpfsug.org>>

________________________________



Hi,

I have create an IDEA for it:
https://ibm-sys-storage.ideas.ibm.com/ideas/GPFS-I-851

Stephan


Am 08.06.2022 um 20:35 schrieb IBM Spectrum Scale:
> Hi Stephen,
> 
> Currently such a feature is not available in Spectrum Scale product.
> 
> 
> Regards, The Spectrum Scale (GPFS) team
> 
> ------------------------------------------------------------------------------------------------------------------
>  If you feel that your question can benefit other users of  Spectrum
> Scale (GPFS), then please post it to the public IBM developerWroks Forum
> at
> https://www.ibm.com/developerworks/community/forums/html/forum?id=11111111-0000-0000-0000-000000000479
>  <https://www.ibm.com/developerworks/community/forums/html/forum?id=11111111-0000-0000-0000-000000000479>.
>  
> 
> If your query concerns a potential software error in Spectrum Scale
> (GPFS) and you have an IBM software maintenance contract please contact
> 1-800-237-5511 in the United States or your local IBM Service Center
> in other countries.
> 
> The forum is informally monitored as time permits and should not be used
> for priority messages to the Spectrum Scale (GPFS) team.
> 
> Inactive hide details for "Stephen Ulmer" ---02-06-2022 11.32.27
> PM---This only adds a checksum to the NSD wire protocol. The q"Stephen
> Ulmer" ---02-06-2022 11.32.27 PM---This only adds a checksum to the NSD
> wire protocol. The question was about detecting data corruption
> 
> From: "Stephen Ulmer" <ulmer@ulmer.org<mailto:ulmer@ulmer.org>>
> To: "gpfsug main discussion list" \
>                 <gpfsug-discuss@gpfsug.org<mailto:gpfsug-discuss@gpfsug.org>>
> Date: 02-06-2022 11.32 PM
> Subject: [EXTERNAL] Re: [gpfsug-discuss] Protection against silent data
> corruption
> Sent by: "gpfsug-discuss" \
> <gpfsug-discuss-bounces@gpfsug.org<mailto:gpfsug-discuss-bounces@gpfsug.org>> 
> ------------------------------------------------------------------------
> 
> 
> 
> This only adds a checksum to the NSD wire protocol. The question was
> about detecting data corruption at rest. -- Stephen On Jun 2, 2022, at
> 1:01 PM, Achim Rehor <Achim.Rehor@de.ibm.com<mailto:Achim.Rehor@de.ibm.com>> wrote: \
> hi Stephan, ‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍
>  
> This only adds a checksum to the NSD wire protocol. The question was
> about detecting data corruption at rest.
> 
> --
> Stephen
> 
> 
> On Jun 2, 2022, at 1:01 PM, Achim Rehor <_Achim.Rehor@de.ibm.com_
> <mailto:Achim.Rehor@de.ibm.com>> wrote:
> 
> hi Stephan,
> 
> there is, see mmchconfig man page :
> 
> nsdCksumTraditional
> This attribute enables checksum data-integrity checking between a
> traditional NSD client node and its NSD server. Valid values are yes
> and no. The default value is no.
> (Traditional in this context means that the NSD client and server
> are configured with IBM Spectrum Scale rather than with IBM Spectrum
> Scale RAID.
> The latter is a component of IBM Elastic Storage Server (ESS) and of
> IBM GPFS Storage Server (GSS).)
> 
> The checksum procedure detects any corruption by the network of the
> data in the NSD RPCs that are exchanged between the NSD client and the
> server. A checksum error triggers a request to retransmit the message.
> 
> When this attribute is enabled on a client node, the client
> indicates in each of its requests to the server that it is using
> checksums. The server uses checksums only in
> response to client requests in which the indicator is set. A client
> node that accesses a file system that belongs to another cluster can
> use checksums in the same way.
> 
> You can change the value of the this attribute for an entire cluster
> without shutting down the mmfsd daemon, or for one or more nodes
> without restarting the nodes.
> 
> Note:
> * Enabling this feature can result in significant I/O performance
> degradation and a considerable increase in CPU usage.
> 
> * To enable checksums for a subset of the nodes in a cluster, issue
> a command like the following one:
> mmchconfig nsdCksumTraditional=yes -i -N <subset-of-nodes>
> 
> The -N flag is valid for this attribute.
> 
> --
> Mit freundlichen Grüßen / Kind regards
> 
> Achim Rehor
> 
> Technical Support Specialist S​pectrum Scale and ESS (SME)
> Advisory Product Services Professional
> IBM Systems Storage Support - EMEA
> 
> _Achim.Rehor@de.ibm.com_
> <mailto:Achim.Rehor@de.ibm.com> +49-170-4521194
> IBM Deutschland GmbH
> Vorsitzender des Aufsichtsrats: Sebastian Krause
> Geschäftsführung: Gregor Pillen (Vorsitzender), Nicole Reimer,
> Gabriele Schwarenthorer, Christine Rupp, Frank Theisen
> Sitz der Gesellschaft: Ehningen / Registergericht: Amtsgericht
> Stuttgart, HRB 14562 / WEEE-Reg.-Nr. DE 99369940
> 
> 
> -----Original Message-----
> *From*: Stephan Graf <_st.graf@fz-juelich.de_
> <mailto:Stephan%20Graf%20%3cst.graf@fz-juelich.de%3e>>
> *Reply-To*: gpfsug main discussion list <_gpfsug-discuss@gpfsug.org_
> <mailto:gpfsug%20main%20discussion%20list%20%3cgpfsug-discuss@gpfsug.org%3e>>
> *To*: gpfsug-discuss <_gpfsug-discuss@gpfsug.org_
> <mailto:gpfsug-discuss%20%3cgpfsug-discuss@gpfsug.org%3e>>
> *Subject*: [EXTERNAL] [gpfsug-discuss] Protection against silent
> data corruption
> *Date*: Thu, 02 Jun 2022 16:31:43 +0200
> 
> Hi,
> 
> I am wondering if there is an option in SS to enable some checking to
> detect silent data corruption.
> 
> Form GNR I know that there is End-to-End integrity. So a checksum is
> stored in addition.
> 
> The background is that we are facing an issue where in some files
> (which
> have data replication =  2) the mmrestripefile is reporting, that one
> block is mismatching it's copy (the storage cluster is running SS
> without GNR).
> We have validated that the copied block is fine, but the original
> one is
> broken (and this is what is returned on read access).
> SS right now in our installation is unable to determine which is the
> correct one.
> Is there any option to enable this kind of feature in SS? If not, does
> it make sense to create an "IDEA" for it?
> 
> Stephan
> 
> _______________________________________________
> gpfsug-discuss mailing list
> gpfsug-discuss at _gpfsug.org_ <http://gpfsug.org<http://gpfsug.org/>>
> _http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org_<http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org_>
>  <http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org<http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org>>
>  _______________________________________________
> gpfsug-discuss mailing list
> gpfsug-discuss at _gpfsug.org_ <http://gpfsug.org<http://gpfsug.org/>>_
> __http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org_<http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org_>
>  <http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org<http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org>>
>  
> _______________________________________________
> gpfsug-discuss mailing list
> gpfsug-discuss at gpfsug.org<http://gpfsug.org>
> http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org<http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org>
>  <http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org<http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org>>
>  
> 
> 
> 
> 
> _______________________________________________
> gpfsug-discuss mailing list
> gpfsug-discuss at gpfsug.org<http://gpfsug.org>
> http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org<http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org>
> 

--
Stephan Graf
Juelich Supercomputing Centre

Phone:  +49-2461-61-6578
Fax:    +49-2461-61-6656
E-mail: st.graf@fz-juelich.de<mailto:st.graf@fz-juelich.de>
WWW:    http://www.fz-juelich.de/jsc/<http://www.fz-juelich.de/jsc/>
---------------------------------------------------------------------------------------------
                
---------------------------------------------------------------------------------------------
 Forschungszentrum Juelich GmbH
52425 Juelich
Sitz der Gesellschaft: Juelich
Eingetragen im Handelsregister des Amtsgerichts Dueren Nr. HR B 3498
Vorsitzender des Aufsichtsrats: MinDir Volker Rieke
Geschaeftsfuehrung: Prof. Dr.-Ing. Wolfgang Marquardt (Vorsitzender),
Karsten Beneke (stellv. Vorsitzender), Dr. Astrid Lambrecht,
Prof. Dr. Frauke Melchior
---------------------------------------------------------------------------------------------
                
---------------------------------------------------------------------------------------------
 [attachment "smime.p7s" deleted by Huzefa H Pancha/India/IBM] \
_______________________________________________ gpfsug-discuss mailing list
gpfsug-discuss at gpfsug.org<http://gpfsug.org>
http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org<http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org>





_______________________________________________
gpfsug-discuss mailing list
gpfsug-discuss at gpfsug.org<http://gpfsug.org>
http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org<http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org>


_______________________________________________
gpfsug-discuss mailing list
gpfsug-discuss at gpfsug.org
http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org


[Attachment #3 (text/html)]

<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
</head>
<body style="word-wrap: break-word; -webkit-nbsp-mode: space; line-break: \
after-white-space;" class=""> <div>Thanks Stephen,&nbsp;</div>
<div><br>
</div>
<div>for clarifying, i misread the initial question, and thanks Stefan for raising \
that IDEA.<br> The new address for raising RFEs/IDEAs on GPFS now is :&nbsp;<a \
href="https://ibm-sys-storage.ideas.ibm.com/ideas?project=GPFS">https://ibm-sys-storage.ideas.ibm.com/ideas?project=GPFS</a></div>
 <div><br>
</div>
<div><span>
<pre>-- <br></pre>
<div data-evo-paragraph="" class="" style="width: 71ch;" \
data-evo-signature-plain-text-mode=""> Mit freundlichen Grüßen / Kind regards</div>
<div data-evo-paragraph="" class="" style="width: 71ch;"><br>
</div>
<div data-evo-paragraph="" class="" style="width: 71ch;">Achim Rehor</div>
<div data-evo-paragraph="" class="" style="width: 71ch;"><br>
</div>
<div data-evo-paragraph="" class="" style="width: 71ch;">-----Original \
Message-----</div> </span></div>
<div><b>From</b>: Stephen Ulmer &lt;<a \
href="mailto:Stephen%20Ulmer%20%3culmer@ulmer.org%3e">ulmer@ulmer.org</a>&gt;</div> \
<div><b>Reply-To</b>: gpfsug main discussion list &lt;<a \
href="mailto:gpfsug%20main%20discussion%20list%20%3cgpfsug-discuss@gpfsug.org%3e">gpfsug-discuss@gpfsug.org</a>&gt;</div>
 <div><b>To</b>: gpfsug main discussion list &lt;<a \
href="mailto:gpfsug%20main%20discussion%20list%20%3cgpfsug-discuss@gpfsug.org%3e">gpfsug-discuss@gpfsug.org</a>&gt;</div>
 <div><b>Subject</b>: [EXTERNAL] Re: [gpfsug-discuss] Protection against silent data \
corruption</div> <div><b>Date</b>: Thu, 09 Jun 2022 15:47:07 -0400</div>
<div><br>
</div>
<!-- text/html --><!-- BaNnErBlUrFlE-BoDy-start --><!-- Preheader Text : BEGIN -->
<div style="display:none \
!important;display:none;visibility:hidden;mso-hide:all;font-size:1px;color:#ffffff;line-height:1px;height:0px;max-height:0px;opacity:0;overflow:hidden;">
 Just to be clear: any follow-up should be directed to Stephan, who is requesting the \
feature. I am well aware that Scale does not provide this feature, and was just \
clarifying Stephan's question for Achim, who answered the question with an</div> <!-- \
Preheader Text : END --><!-- Email Banner : BEGIN --> <div style="display:none \
!important;display:none;visibility:hidden;mso-hide:all;font-size:1px;color:#ffffff;line-height:1px;height:0px;max-height:0px;opacity:0;overflow:hidden;">
 ZjQcmQRYFpfptBannerStart</div>
<!--[if ((ie)|(mso))]>
  <table border="0" cellspacing="0" cellpadding="0" width="100%" style="padding: 16px \
0px 16px 0px; direction: ltr" lang="en"><tr><td>  <table border="0" cellspacing="0" \
cellpadding="0" style="padding: 0px 10px 5px 6px; width: 100%; border-radius:4px; \
                border-top:4px solid #90a4ae;background-color:#D0D8DC;"><tr><td \
                valign="top">
      <table align="left" border="0" cellspacing="0" cellpadding="0" style="padding: \
4px 8px 4px 8px">  <tr><td style="color:#000000; font-family: 'Arial', sans-serif; \
font-weight:bold; font-size:14px; direction: ltr">  This Message Is From an External \
Sender  </td></tr>
	<tr><td style="color:#000000; font-weight:normal; font-family: 'Arial', sans-serif; \
font-size:12px; direction: ltr">  This message came from outside your organization.
	</td></tr>

      </table>

    </td></tr></table>
  </td></tr></table>
<![endif]--><!--[if !((ie)|(mso))]-->
<div dir="ltr" lang="en" id="pfptBanneryhn0xcb" style="all: revert !important; \
display:block !important; text-align: left !important; margin:16px 0px 16px 0px \
!important; padding:8px 16px 8px 16px !important; border-radius: 4px !important; \
min-width: 200px !important; background-color: #D0D8DC !important; background-color: \
#D0D8DC; border-top: 4px solid #90a4ae !important; border-top: 4px solid #90a4ae;"> \
<div id="pfptBanneryhn0xcb" style="all: unset !important; float:left !important; \
display:block !important; margin: 0px 0px 1px 0px !important; max-width: 600px \
!important;"> <div id="pfptBanneryhn0xcb" style="all: unset !important; display:block \
!important; visibility: visible !important; background-color: #D0D8DC !important; \
color:#000000 !important; color:#000000; font-family: 'Arial', sans-serif !important; \
font-family: 'Arial', sans-serif; font-weight:bold !important; font-weight:bold; \
font-size:14px !important; line-height:18px !important; line-height:18px"> This \
Message Is From an External Sender </div> <div id="pfptBanneryhn0xcb" style="all: \
unset !important; display:block !important; visibility: visible !important; \
background-color: #D0D8DC !important; color:#000000 !important; color:#000000; \
font-weight:normal; font-family: 'Arial', sans-serif !important; font-family: \
'Arial', sans-serif; font-size:12px !important; line-height:18px !important; \
line-height:18px; margin-top:2px !important;"> This message came from outside your \
organization. </div> </div>
<div style="clear: both !important; display: block !important; visibility: hidden \
!important; line-height: 0 !important; font-size: 0.01px !important; height: 0px"> \
&nbsp;</div> </div>
<!--[endif]-->
<div style="display:none \
!important;display:none;visibility:hidden;mso-hide:all;font-size:1px;color:#ffffff;line-height:1px;height:0px;max-height:0px;opacity:0;overflow:hidden;">
 ZjQcmQRYFpfptBannerEnd</div>
<!-- Email Banner : END --><!-- BaNnErBlUrFlE-BoDy-end --><!-- \
BaNnErBlUrFlE-HeAdEr-start --><style> #pfptBanneryhn0xcb { all: revert !important; \
display: block !important; visibility: visible !important; opacity: 1 !important; \
background-color: #D0D8DC !important; max-width: none !important; max-height: none \
!important } .pfptPrimaryButtonyhn0xcb:hover, .pfptPrimaryButtonyhn0xcb:focus { \
background-color: #b4c1c7 !important; } .pfptPrimaryButtonyhn0xcb:active { \
background-color: #90a4ae !important; }</style><!-- BaNnErBlUrFlE-HeAdEr-end -->Just  \
to be clear: any follow-up should be directed to Stephan, who is requesting the \
feature. <div class=""><br class="">
</div>
<div class="">I am well aware that Scale does not provide this feature, and was just \
clarifying Stephan's question for Achim, who answered the question with an unrelated \
reference after which Scale support replied to me. <div class=""><br class="">
</div>
<div class="">This is also where I notice that for all that is holy, the generated \
IDEA links point to DeveloperWorks and don't even get you to the correct forum \
thread. Sigh.</div> <div class=""><br class="">
<div class="">
<div style="color: rgb(0, 0, 0); letter-spacing: normal; orphans: auto; text-align: \
start; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; \
word-spacing: 0px; -webkit-text-stroke-width: 0px; word-wrap: break-word; \
                -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" \
                class="">
--&nbsp;<br class="">
Stephen<br class="">
<br class="">
<br class="">
</div>
</div>
<div><br class="">
<blockquote type="cite" style="margin:0 0 0 .8ex; border-left:2px #729fcf \
solid;padding-left:1ex"> <div class="">On Jun 9, 2022, at 2:45 PM, IBM Spectrum Scale \
&lt;<a href="mailto:scale@us.ibm.com" class="">scale@us.ibm.com</a>&gt; wrote:</div> \
<div><br class="Apple-interchange-newline"> </div>
<div class="">
<div class="">
<p class=""><font size="2" face="sans-serif" class="">Thanks Stephan.</font><br \
class=""> <font size="2" face="sans-serif" class="">This will be looked into and \
accordingly prioritized by the offering manager team. Incase the IBM team has any \
further questions on this then we will get back to you.</font><br class=""> <br \
class=""> <font size="2" face="sans-serif" class="">Regards, The Spectrum Scale \
(GPFS) team<br class=""> <br class="">
------------------------------------------------------------------------------------------------------------------<br \
class=""> If you feel that your question can benefit other users of &nbsp;Spectrum \
Scale (GPFS), then please post it to the public IBM developerWroks Forum at <a \
href="https://www.ibm.com/developerworks/community/forums/html/forum?id=11111111-0000-0000-0000-000000000479" \
class="">https://www.ibm.com/developerworks/community/forums/html/forum?id=11111111-0000-0000-0000-000000000479</a>.
 <br class="">
<br class="">
If your query concerns a potential software error in Spectrum Scale (GPFS) and you \
have an IBM software maintenance contract please contact &nbsp;1-800-237-5511 in the \
United States or your local IBM Service Center in other countries. <br class="">
<br class="">
The forum is informally monitored as time permits and should not be used for priority \
messages to the Spectrum Scale (GPFS) team.</font><br class=""> <br class="">
<span id="cid:1__=0ABB02CFDFF542A98f9e8a93df938690@ibm.com">&lt;graycol.gif&gt;</span><font \
size="2" color="#424282" face="sans-serif" class="">&quot;Stephan Graf&quot; \
---09-06-2022 11.31.01 AM---Hi, I have create an IDEA for it:</font><br class=""> <br \
class=""> <font size="1" color="#5F5F5F" face="sans-serif" class="">From: \
</font><font size="1" face="sans-serif" class="">&quot;Stephan Graf&quot; &lt;<a \
href="mailto:st.graf@fz-juelich.de" class="">st.graf@fz-juelich.de</a>&gt;</font><br \
class=""> <font size="1" color="#5F5F5F" face="sans-serif" class="">To: </font><font \
size="1" face="sans-serif" class="">&lt;<a href="mailto:gpfsug-discuss@gpfsug.org" \
class="">gpfsug-discuss@gpfsug.org</a>&gt;</font><br class=""> <font size="1" \
color="#5F5F5F" face="sans-serif" class="">Date: </font><font size="1" \
face="sans-serif" class="">09-06-2022 11.31 AM</font><br class=""> <font size="1" \
color="#5F5F5F" face="sans-serif" class="">Subject: </font><font size="1" \
face="sans-serif" class="">[EXTERNAL] Re: [gpfsug-discuss] Protection against silent \
data corruption</font><br class=""> <font size="1" color="#5F5F5F" face="sans-serif" \
class="">Sent by: </font><font size="1" face="sans-serif" \
class="">&quot;gpfsug-discuss&quot; &lt;<a \
href="mailto:gpfsug-discuss-bounces@gpfsug.org" \
class="">gpfsug-discuss-bounces@gpfsug.org</a>&gt;</font><br class=""> </p>
<hr width="100%" size="2" align="left" noshade="" style="color:#8091A5; " class="">
<br class="">
<br class="">
<br class="">
<tt class=""><font size="2" class="">Hi,<br class="">
<br class="">
I have create an IDEA for it: <br class="">
</font></tt><tt class=""><font size="2" class=""><a \
href="https://ibm-sys-storage.ideas.ibm.com/ideas/GPFS-I-851" \
class="">https://ibm-sys-storage.ideas.ibm.com/ideas/GPFS-I-851</a></font></tt><tt \
class=""><font size="2" class=""><br class=""> <br class="">
Stephan<br class="">
<br class="">
<br class="">
Am 08.06.2022 um 20:35 schrieb IBM Spectrum Scale:<br class="">
&gt; Hi Stephen,<br class="">
&gt; <br class="">
&gt; Currently such a feature is not available in Spectrum Scale product.<br \
class=""> &gt; <br class="">
&gt; <br class="">
&gt; Regards, The Spectrum Scale (GPFS) team<br class="">
&gt; <br class="">
&gt; ------------------------------------------------------------------------------------------------------------------<br \
class=""> &gt; If you feel that your question can benefit other users of \
&nbsp;Spectrum <br class=""> &gt; Scale (GPFS), then please post it to the public IBM \
developerWroks Forum <br class=""> &gt; at <br class="">
&gt; </font></tt><tt class=""><font size="2" class=""><a \
href="https://www.ibm.com/developerworks/community/forums/html/forum?id=11111111-0000-0000-0000-000000000479" \
class="">https://www.ibm.com/developerworks/community/forums/html/forum?id=11111111-0000-0000-0000-000000000479</a></font></tt><tt \
class=""><font size="2" class="">&nbsp;<br class=""> &gt; &lt;</font></tt><tt \
class=""><font size="2" class=""><a \
href="https://www.ibm.com/developerworks/community/forums/html/forum?id=11111111-0000-0000-0000-000000000479" \
class="">https://www.ibm.com/developerworks/community/forums/html/forum?id=11111111-0000-0000-0000-000000000479</a></font></tt><tt \
class=""><font size="2" class="">&gt;. <br class="">
&gt; <br class="">
&gt; <br class="">
&gt; If your query concerns a potential software error in Spectrum Scale <br \
class=""> &gt; (GPFS) and you have an IBM software maintenance contract please \
contact <br class=""> &gt; &nbsp;&nbsp;1-800-237-5511 in the United States or your \
local IBM Service Center <br class=""> &gt; in other countries.<br class="">
&gt; <br class="">
&gt; The forum is informally monitored as time permits and should not be used <br \
class=""> &gt; for priority messages to the Spectrum Scale (GPFS) team.<br class="">
&gt; <br class="">
&gt; Inactive hide details for &quot;Stephen Ulmer&quot; ---02-06-2022 11.32.27 <br \
class=""> &gt; PM---This only adds a checksum to the NSD wire protocol. The \
q&quot;Stephen <br class=""> &gt; Ulmer&quot; ---02-06-2022 11.32.27 PM---This only \
adds a checksum to the NSD <br class=""> &gt; wire protocol. The question was about \
detecting data corruption<br class=""> &gt; <br class="">
&gt; From: &quot;Stephen Ulmer&quot; &lt;<a href="mailto:ulmer@ulmer.org" \
class="">ulmer@ulmer.org</a>&gt;<br class=""> &gt; To: &quot;gpfsug main discussion \
list&quot; &lt;<a href="mailto:gpfsug-discuss@gpfsug.org" \
class="">gpfsug-discuss@gpfsug.org</a>&gt;<br class=""> &gt; Date: 02-06-2022 11.32 \
PM<br class=""> &gt; Subject: [EXTERNAL] Re: [gpfsug-discuss] Protection against \
silent data <br class=""> &gt; corruption<br class="">
&gt; Sent by: &quot;gpfsug-discuss&quot; &lt;<a \
href="mailto:gpfsug-discuss-bounces@gpfsug.org" \
class="">gpfsug-discuss-bounces@gpfsug.org</a>&gt;<br class=""> &gt; <br class="">
&gt; ------------------------------------------------------------------------<br \
class=""> &gt; <br class="">
&gt; <br class="">
&gt; <br class="">
&gt; This only adds a checksum to the NSD wire protocol. The question was <br \
class=""> &gt; about detecting data corruption at rest. -- Stephen On Jun 2, 2022, at \
<br class=""> &gt; 1:01 PM, Achim Rehor &lt;<a href="mailto:Achim.Rehor@de.ibm.com" \
class="">Achim.Rehor@de.ibm.com</a>&gt; wrote: hi Stephan, <br class="">
&gt; ‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍<br \
class=""> &gt; <br class="">
&gt; This only adds a checksum to the NSD wire protocol. The question was <br \
class=""> &gt; about detecting data corruption at rest.<br class="">
&gt; <br class="">
&gt; -- <br class="">
&gt; Stephen<br class="">
&gt; <br class="">
&gt; <br class="">
&gt; &nbsp; &nbsp; On Jun 2, 2022, at 1:01 PM, Achim Rehor \
&lt;_Achim.Rehor@de.ibm.com_<br class=""> &gt; &nbsp; &nbsp; &lt;</font></tt><tt \
class=""><font size="2" class=""><a href="mailto:Achim.Rehor@de.ibm.com" \
class="">mailto:Achim.Rehor@de.ibm.com</a></font></tt><tt class=""><font size="2" \
class="">&gt;&gt; wrote:<br class=""> &gt; <br class="">
&gt; &nbsp; &nbsp; hi Stephan,<br class="">
&gt; <br class="">
&gt; &nbsp; &nbsp; there is, see mmchconfig man page :<br class="">
&gt; <br class="">
&gt; &nbsp; &nbsp; nsdCksumTraditional<br class="">
&gt; &nbsp; &nbsp; This attribute enables checksum data-integrity checking between \
a<br class=""> &gt; &nbsp; &nbsp; traditional NSD client node and its NSD server. \
Valid values are yes<br class=""> &gt; &nbsp; &nbsp; and no. The default value is \
no.<br class=""> &gt; &nbsp; &nbsp; (Traditional in this context means that the NSD \
client and server<br class=""> &gt; &nbsp; &nbsp; are configured with IBM Spectrum \
Scale rather than with IBM Spectrum<br class=""> &gt; &nbsp; &nbsp; Scale RAID.<br \
class=""> &gt; &nbsp; &nbsp; The latter is a component of IBM Elastic Storage Server \
(ESS) and of<br class=""> &gt; &nbsp; &nbsp; IBM GPFS Storage Server (GSS).)<br \
class=""> &gt; <br class="">
&gt; &nbsp; &nbsp; The checksum procedure detects any corruption by the network of \
the<br class=""> &gt; &nbsp; &nbsp; data in the NSD RPCs that are exchanged between \
the NSD client and the<br class=""> &gt; &nbsp; &nbsp; server. A checksum error \
triggers a request to retransmit the message.<br class=""> &gt; <br class="">
&gt; &nbsp; &nbsp; When this attribute is enabled on a client node, the client<br \
class=""> &gt; &nbsp; &nbsp; indicates in each of its requests to the server that it \
is using<br class=""> &gt; &nbsp; &nbsp; checksums. The server uses checksums only \
in<br class=""> &gt; &nbsp; &nbsp; response to client requests in which the indicator \
is set. A client<br class=""> &gt; &nbsp; &nbsp; node that accesses a file system \
that belongs to another cluster can<br class=""> &gt; &nbsp; &nbsp; use checksums in \
the same way.<br class=""> &gt; <br class="">
&gt; &nbsp; &nbsp; You can change the value of the this attribute for an entire \
cluster<br class=""> &gt; &nbsp; &nbsp; without shutting down the mmfsd daemon, or \
for one or more nodes<br class=""> &gt; &nbsp; &nbsp; without restarting the \
nodes.<br class=""> &gt; <br class="">
&gt; &nbsp; &nbsp; Note:<br class="">
&gt; &nbsp; &nbsp; * Enabling this feature can result in significant I/O \
performance<br class=""> &gt; &nbsp; &nbsp; degradation and a considerable increase \
in CPU usage.<br class=""> &gt; <br class="">
&gt; &nbsp; &nbsp; * To enable checksums for a subset of the nodes in a cluster, \
issue<br class=""> &gt; &nbsp; &nbsp; a command like the following one:<br class="">
&gt; &nbsp; &nbsp; &nbsp;&nbsp; &nbsp;mmchconfig nsdCksumTraditional=yes -i -N \
&lt;subset-of-nodes&gt;<br class=""> &gt; <br class="">
&gt; &nbsp; &nbsp; &nbsp;&nbsp; &nbsp;The -N flag is valid for this attribute.<br \
class=""> &gt; <br class="">
&gt; &nbsp; &nbsp; -- <br class="">
&gt; &nbsp; &nbsp; Mit freundlichen Grüßen / Kind regards<br class="">
&gt; <br class="">
&gt; &nbsp; &nbsp; Achim Rehor<br class="">
&gt; <br class="">
&gt; &nbsp; &nbsp; Technical Support Specialist S​pectrum Scale and ESS (SME)<br \
class=""> &gt; &nbsp; &nbsp; Advisory Product Services Professional<br class="">
&gt; &nbsp; &nbsp; IBM Systems Storage Support - EMEA<br class="">
&gt; <br class="">
&gt; &nbsp; &nbsp; _Achim.Rehor@de.ibm.com_<br class="">
&gt; &nbsp; &nbsp; &lt;</font></tt><tt class=""><font size="2" class=""><a \
href="mailto:Achim.Rehor@de.ibm.com" \
class="">mailto:Achim.Rehor@de.ibm.com</a></font></tt><tt class=""><font size="2" \
class="">&gt;&nbsp;+49-170-4521194<br class=""> &gt; &nbsp; &nbsp; IBM Deutschland \
GmbH<br class=""> &gt; &nbsp; &nbsp; Vorsitzender des Aufsichtsrats: Sebastian \
Krause<br class=""> &gt; &nbsp; &nbsp; Geschäftsführung: Gregor Pillen \
(Vorsitzender), Nicole Reimer,<br class=""> &gt; &nbsp; &nbsp; Gabriele \
Schwarenthorer, Christine Rupp, Frank Theisen<br class=""> &gt; &nbsp; &nbsp; Sitz \
der Gesellschaft: Ehningen / Registergericht: Amtsgericht<br class=""> &gt; &nbsp; \
&nbsp; Stuttgart, HRB 14562 / WEEE-Reg.-Nr. DE 99369940<br class=""> &gt; <br \
class=""> &gt; <br class="">
&gt; &nbsp; &nbsp; -----Original Message-----<br class="">
&gt; &nbsp; &nbsp; *From*: Stephan Graf &lt;_st.graf@fz-juelich.de_<br class="">
&gt; &nbsp; &nbsp; &lt;</font></tt><tt class=""><font size="2" class=""><a \
href="mailto:Stephan%20Graf%20%3cst.graf@fz-juelich.de%3e" \
class="">mailto:Stephan%20Graf%20%3cst.graf@fz-juelich.de%3e</a></font></tt><tt \
class=""><font size="2" class="">&gt;&gt;<br class=""> &gt; &nbsp; &nbsp; *Reply-To*: \
gpfsug main discussion list &lt;_gpfsug-discuss@gpfsug.org_<br class=""> &gt; &nbsp; \
&nbsp; &lt;</font></tt><tt class=""><font size="2" class=""><a \
href="mailto:gpfsug%20main%20discussion%20list%20%3cgpfsug-discuss@gpfsug.org%3e" \
class="">mailto:gpfsug%20main%20discussion%20list%20%3cgpfsug-discuss@gpfsug.org%3e</a></font></tt><tt \
class=""><font size="2" class="">&gt;&gt;<br class=""> &gt; &nbsp; &nbsp; *To*: \
gpfsug-discuss &lt;_gpfsug-discuss@gpfsug.org_<br class=""> &gt; &nbsp; &nbsp; \
&lt;</font></tt><tt class=""><font size="2" class=""><a \
href="mailto:gpfsug-discuss%20%3cgpfsug-discuss@gpfsug.org%3e" \
class="">mailto:gpfsug-discuss%20%3cgpfsug-discuss@gpfsug.org%3e</a></font></tt><tt \
class=""><font size="2" class="">&gt;&gt;<br class=""> &gt; &nbsp; &nbsp; *Subject*: \
[EXTERNAL] [gpfsug-discuss] Protection against silent<br class=""> &gt; &nbsp; &nbsp; \
data corruption<br class=""> &gt; &nbsp; &nbsp; *Date*: Thu, 02 Jun 2022 16:31:43 \
+0200<br class=""> &gt; <br class="">
&gt; &nbsp; &nbsp; Hi,<br class="">
&gt; <br class="">
&gt; &nbsp; &nbsp; I am wondering if there is an option in SS to enable some checking \
to<br class=""> &gt; &nbsp; &nbsp; detect silent data corruption.<br class="">
&gt; <br class="">
&gt; &nbsp; &nbsp; Form GNR I know that there is End-to-End integrity. So a checksum \
is<br class=""> &gt; &nbsp; &nbsp; stored in addition.<br class="">
&gt; <br class="">
&gt; &nbsp; &nbsp; The background is that we are facing an issue where in some \
files<br class=""> &gt; &nbsp; &nbsp; (which<br class="">
&gt; &nbsp; &nbsp; have data replication = &nbsp;2) the mmrestripefile is reporting, \
that one<br class=""> &gt; &nbsp; &nbsp; block is mismatching it's copy (the storage \
cluster is running SS<br class=""> &gt; &nbsp; &nbsp; without GNR).<br class="">
&gt; &nbsp; &nbsp; We have validated that the copied block is fine, but the \
original<br class=""> &gt; &nbsp; &nbsp; one is<br class="">
&gt; &nbsp; &nbsp; broken (and this is what is returned on read access).<br class="">
&gt; &nbsp; &nbsp; SS right now in our installation is unable to determine which is \
the<br class=""> &gt; &nbsp; &nbsp; correct one.<br class="">
&gt; &nbsp; &nbsp; Is there any option to enable this kind of feature in SS? If not, \
does<br class=""> &gt; &nbsp; &nbsp; it make sense to create an &quot;IDEA&quot; for \
it?<br class=""> &gt; <br class="">
&gt; &nbsp; &nbsp; Stephan<br class="">
&gt; <br class="">
&gt; &nbsp; &nbsp; _______________________________________________<br class="">
&gt; &nbsp; &nbsp; gpfsug-discuss mailing list<br class="">
&gt; &nbsp; &nbsp; gpfsug-discuss at _gpfsug.org_ &lt;</font></tt><tt class=""><font \
size="2" class=""><a href="http://gpfsug.org/" \
class="">http://gpfsug.org</a></font></tt><tt class=""><font size="2" \
class="">&gt;<br class=""> &gt; &nbsp; &nbsp; _</font></tt><tt class=""><font \
size="2" class=""><a \
href="http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org_" \
class="">http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org_</a></font></tt><tt \
class=""><font size="2" class=""><br class=""> &gt; &nbsp; &nbsp; &lt;</font></tt><tt \
class=""><font size="2" class=""><a \
href="http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org" \
class="">http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org</a></font></tt><tt \
class=""><font size="2" class="">&gt;<br class=""> &gt; &nbsp; &nbsp; \
_______________________________________________<br class=""> &gt; &nbsp; &nbsp; \
gpfsug-discuss mailing list<br class=""> &gt; &nbsp; &nbsp; gpfsug-discuss at \
_gpfsug.org_ &lt;</font></tt><tt class=""><font size="2" class=""><a \
href="http://gpfsug.org/" class="">http://gpfsug.org</a></font></tt><tt \
class=""><font size="2" class="">&gt;_<br class=""> &gt; &nbsp; &nbsp; \
__</font></tt><tt class=""><font size="2" class=""><a \
href="http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org_" \
class="">http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org_</a></font></tt><tt \
class=""><font size="2" class=""><br class=""> &gt; &nbsp; &nbsp; &lt;</font></tt><tt \
class=""><font size="2" class=""><a \
href="http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org" \
class="">http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org</a></font></tt><tt \
class=""><font size="2" class="">&gt;<br class=""> &gt; <br class="">
&gt; _______________________________________________<br class="">
&gt; gpfsug-discuss mailing list<br class="">
&gt; gpfsug-discuss at <a href="http://gpfsug.org" class="">gpfsug.org</a><br \
class=""> &gt; </font></tt><tt class=""><font size="2" class=""><a \
href="http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org" \
class="">http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org</a></font></tt><tt \
class=""><font size="2" class="">&nbsp;<br class=""> &gt; &lt;</font></tt><tt \
class=""><font size="2" class=""><a \
href="http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org" \
class="">http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org</a></font></tt><tt \
class=""><font size="2" class="">&gt;<br class=""> &gt; <br class="">
&gt; <br class="">
&gt; <br class="">
&gt; <br class="">
&gt; <br class="">
&gt; _______________________________________________<br class="">
&gt; gpfsug-discuss mailing list<br class="">
&gt; gpfsug-discuss at <a href="http://gpfsug.org" class="">gpfsug.org</a><br \
class=""> &gt; </font></tt><tt class=""><font size="2" class=""><a \
href="http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org" \
class="">http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org</a></font></tt><tt \
class=""><font size="2" class=""><br class=""> <br class="">
-- <br class="">
Stephan Graf<br class="">
Juelich Supercomputing Centre<br class="">
<br class="">
Phone: &nbsp;+49-2461-61-6578<br class="">
Fax: &nbsp; &nbsp;+49-2461-61-6656<br class="">
<a href="mailto:st.graf@fz-juelich.de" class="">E-mail: st.graf@fz-juelich.de</a><br \
                class="">
WWW: &nbsp; &nbsp;</font></tt><tt class=""><font size="2" class=""><a \
href="http://www.fz-juelich.de/jsc/" \
class="">http://www.fz-juelich.de/jsc/</a></font></tt><tt class=""><font size="2" \
                class=""><br class="">
---------------------------------------------------------------------------------------------<br \
                class="">
---------------------------------------------------------------------------------------------<br \
class=""> Forschungszentrum Juelich GmbH<br class="">
52425 Juelich<br class="">
Sitz der Gesellschaft: Juelich<br class="">
Eingetragen im Handelsregister des Amtsgerichts Dueren Nr. HR B 3498<br class="">
Vorsitzender des Aufsichtsrats: MinDir Volker Rieke<br class="">
Geschaeftsfuehrung: Prof. Dr.-Ing. Wolfgang Marquardt (Vorsitzender),<br class="">
Karsten Beneke (stellv. Vorsitzender), Dr. Astrid Lambrecht,<br class="">
Prof. Dr. Frauke Melchior<br class="">
---------------------------------------------------------------------------------------------<br \
                class="">
---------------------------------------------------------------------------------------------<br \
class=""> [attachment &quot;smime.p7s&quot; deleted by Huzefa H Pancha/India/IBM] \
_______________________________________________<br class=""> gpfsug-discuss mailing \
list<br class=""> gpfsug-discuss at <a href="http://gpfsug.org" \
class="">gpfsug.org</a><br class=""> </font></tt><tt class=""><font size="2" \
class=""><a href="http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org" \
class="">http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org</a></font></tt><tt \
class=""><font size="2" class=""><br class=""> </font></tt><br class="">
<br class="">
<br class="">
<br class="">
</div>
_______________________________________________<br class="">
gpfsug-discuss mailing list<br class="">
gpfsug-discuss at <a href="http://gpfsug.org" class="">gpfsug.org</a><br class="">
<a href="http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org" \
class="">http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org</a><br \
class=""> </div>
</blockquote>
</div>
<br class="">
</div>
</div>
<div>_______________________________________________<br>
</div>
<div>gpfsug-discuss mailing list<br>
</div>
<div>gpfsug-discuss at gpfsug.org<br>
</div>
<div><a href="http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org">http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org</a>&nbsp;<br>
 </div>
</body>
</html>



_______________________________________________
gpfsug-discuss mailing list
gpfsug-discuss at gpfsug.org
http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org

--===============1323868033632235837==--


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic