[prev in list] [next in list] [prev in thread] [next in thread] 

List:       openmosix-devel
Subject:    [Openmosix-devel] mosix_update_remote_files: not listed
From:       "David Toomey" <dtoomey () rcsi ! ie>
Date:       2005-07-26 11:06:31
Message-ID: B20E514494F77645B9FE20A93B8D3E1F023CE05B () exchstgstaff01 ! rcsi-internal ! ie
[Download RAW message or body]

Hi All

 

I have sent this to the general group but received no response. I hope one
of the developers out there can help.

I have a cluster of 8 nodes all running mosix on a 2.4.26 Gentoo Kernel.
There is a master node from which all commands are executed and NFS is
running on all the nodes. We have a python script running which is causing
the above error but only after hours of running it. At this point the nodes
start to drop off with a panic, one by one over time
"mosix_update_remote_files: not listed". We have noticed that a file that
should have been created by the script appears to be missing. The script has
executed this piece of code thousands of times while it has been running and
we have put checks in to be sure that the file exists before using it later.
Can anyone enlighten me on what this function does and possible reasons we
might get this error? I am not sure at this point if it is a bug with Mosix
or with the script itself. One other point, the master node from which the
script is executed does not seem to fall over which makes me think that the
problem is with Mosix rather than the script. 

 

Thanks

 

Dave Toomey

 


[Attachment #3 (text/html)]

<html xmlns:o="urn:schemas-microsoft-com:office:office" \
xmlns:w="urn:schemas-microsoft-com:office:word" \
xmlns="http://www.w3.org/TR/REC-html40">

<head>
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=us-ascii">
<meta name=Generator content="Microsoft Word 11 (filtered medium)">
<style>
<!--
 /* Style Definitions */
 p.MsoNormal, li.MsoNormal, div.MsoNormal
	{margin:0cm;
	margin-bottom:.0001pt;
	font-size:12.0pt;
	font-family:"Times New Roman";}
a:link, span.MsoHyperlink
	{color:blue;
	text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
	{color:#606420;
	text-decoration:underline;}
span.EmailStyle17
	{mso-style-type:personal-compose;
	font-family:Arial;
	color:windowtext;}
@page Section1
	{size:612.0pt 792.0pt;
	margin:72.0pt 90.0pt 72.0pt 90.0pt;}
div.Section1
	{page:Section1;}
-->
</style>

</head>

<body lang=EN-GB link=blue vlink="#606420">

<div class=Section1>

<p class=MsoNormal style='text-autospace:none'><font size=2 face="Courier New"><span
style='font-size:10.0pt;font-family:"Courier New"'>Hi \
All<o:p></o:p></span></font></p>

<p class=MsoNormal style='text-autospace:none'><font size=2 face="Courier New"><span
style='font-size:10.0pt;font-family:"Courier \
New"'><o:p>&nbsp;</o:p></span></font></p>

<p class=MsoNormal style='text-autospace:none'><font size=2 face="Courier New"><span
style='font-size:10.0pt;font-family:"Courier New"'>I have sent this to the
general group but received no response. I hope one of the developers out there
can help.<o:p></o:p></span></font></p>

<p class=MsoNormal style='text-autospace:none'><font size=2 face="Courier New"><span
style='font-size:10.0pt;font-family:"Courier New"'>I have a cluster of 8 nodes
all running mosix on a 2.4.26 Gentoo Kernel. There is a master node from which
all commands are executed and NFS is running on all the nodes. We have a python
script running which is causing the above error but only after hours of running
it. At this point the nodes start to drop off with a panic, one by one over
time &#8220;mosix_update_remote_files: not listed&#8221;. We have noticed that
a file that should have been created by the script appears to be missing. The
script has executed this piece of code thousands of times while it has been
running and we have put checks in to be sure that the file exists before using
it later. Can anyone enlighten me on what this function does and possible
reasons we might get this error? I am not sure at this point if it is a bug
with Mosix or with the script itself. One other point, the master node from
which the script is executed does not seem to fall over which makes me think that
the problem is with Mosix rather than the script. <o:p></o:p></span></font></p>

<p class=MsoNormal style='text-autospace:none'><font size=2 face="Courier New"><span
style='font-size:10.0pt;font-family:"Courier \
New"'><o:p>&nbsp;</o:p></span></font></p>

<p class=MsoNormal style='text-autospace:none'><font size=2 face="Courier New"><span
style='font-size:10.0pt;font-family:"Courier \
New"'>Thanks<o:p></o:p></span></font></p>

<p class=MsoNormal style='text-autospace:none'><font size=2 face="Courier New"><span
style='font-size:10.0pt;font-family:"Courier \
New"'><o:p>&nbsp;</o:p></span></font></p>

<p class=MsoNormal style='text-autospace:none'><font size=2 face="Courier New"><span
style='font-size:10.0pt;font-family:"Courier New"'>Dave \
Toomey<o:p></o:p></span></font></p>

<p class=MsoNormal><font size=2 face=Arial><span style='font-size:10.0pt;
font-family:Arial'><o:p>&nbsp;</o:p></span></font></p>

</div>

</body>

</html>


-------------------------------------------------------
SF.Net email is sponsored by: Discover Easy Linux Migration Strategies
from IBM. Find simple to follow Roadmaps, straightforward articles,
informative Webcasts and more! Get everything you need to get up to
speed, fast. http://ads.osdn.com/?ad_id=7477&alloc_id=16492&op=click
_______________________________________________
openMosix-devel mailing list
openMosix-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/openmosix-devel

[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic