[prev in list] [next in list] [prev in thread] [next in thread] 

List:       beowulf
Subject:    [Beowulf] send back output from local node
From:       Jerry Xu <jerry () oban ! biosc ! lsu ! edu>
Date:       2005-01-21 20:19:14
Message-ID: 1106338754.15340.14.camel () strathmill ! biosc ! lsu ! edu
[Download RAW message or body]

Hi, People:
  I have a small question regarding PBS in beowulf , I setup my walltime
and cput time both 12:00 Hours for my queue. My program itself can run
more than 12 hours and output results periodically. All my nodes are
exact same. But I met situation that some nodes send back results much
much more output than others.Thus it makes my programs very inefficient,
say, I used 32 nodes, but 16 nodes will give me back enough data but 16
nodes only feedback very little results, 
 I know that each computing node actually save the output in local
storage and send back the output to the master node later according to
some protocols (that i donot know). Since these nodes are same, I assume
some nodes hold the results in local and did not send them back. 
  My question is, how can I make sure that all the computing nodes can
send the output that stored in their local storage back to the master
when walltime or cput time is reached...... 

  Is there any people ever met the similar situation and provide some
suggestion?

Thanks,

Jerry

_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org
To change your subscription (digest mode or unsubscribe) visit \
http://www.beowulf.org/mailman/listinfo/beowulf


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic