[prev in list] [next in list] [prev in thread] [next in thread]
List: lustre-announce
Subject: Re: [Lustre-discuss] MDS & OST Hardware Considerations
From: Donny Cooper <dcooper () atcc ! necsys ! com>
Date: 2003-12-19 15:22:52
[Download RAW message or body]
Thanks for the info Phil. I'll post any of my successes (or failures) with
the 2xGig-E paths into a single 2way OST, in case it's of any interest to
someone else.
Regards,
Donny
On Fri December 19 2003 12:05 am, Phil Schwan wrote:
>Hi Donny--
>
>Donny Cooper wrote:
>>
>> MDS Server Considerations:
>> (1) Any recommended ratio of MDS memory to the number of OSTs.
>> Does it make sense to assume, the greater number of OST's, the more
"muscle"
>> your MDS should have?
>
>The amount of MDS memory is not really related to the number of OSTs,
>no. There are two factors which I would consider in your decision:
>
>- the number of clients
>- the size of your directories, and variety of the load
>
>If you have many hundreds or thousands of clients, the default
>configuration will allow hundreds of thousands of locks to be acquired
>on the scale of the entire cluster. That can be tuned down, but it's
>important to remember that clients can only cache things when they have
>a lock for it -- if they can hold very few locks, they can cache very
>little metadata. So it's worth spending a few hundred extra dollars for
>a couple gigabytes of ram.
>
>If you have large directories (1 million or more files) or a large
>number of directories which see a lot of use, you may benefit
>significantly from having more memory. In an environment where clients
>randomly access one of 10 million files, for example, having extra
>memory for the cache improves performance very significantly.
>
>> OST Server configurations:
>> (1) Any recommended RAM size for the OSTs? Is more better? ...or is there
a
>> point of diminishing return, depending on OST CPU speed and interconnect
>> bandwidth?
>
>Today the OSTs use memory to manage locks (as in the MDS) and for the
>read cache. We do not write through the page cache, so memory size
>should have no effect on write performance.
>
>My opinion is that memory on the OST is useful primarily for locks,
>unless your load will benefit significantly from read caching.
>
>> (2) Any recommended CPU speed for the OST, based on the interconnect type?
>> > (3) Could a dual-processor server be logically used as 2 OST
>stripes, if 2
>> interconnect paths are fed into the system, and the output bandwidth is 2x
of
>> the input? (Like 2x Gig-E in ==> 1x 2GBps Fibre out) Should is scale
>> "similar" to 2 uni-processor systems?
>
>Our experience is that the TCP stack consumes a lot of CPU. We have not
>made any serious attempts yet at 2xGige in/2 Gbit out, but we do have a
>rule of thumb:
>
>1/3 CPU for network
>1/3 CPU for disk backend
>1/3 CPU for Lustre
>
>If you can remain within these constraints, I think it will be ok. If
>you want to make a serious attempt for a production site, then I predict
>that you will need some support beyond what the mailing list can provide.
>
>It is worth noting that Elan in/2 Gbps out works very well, and goes at
>~92% of the raw speed of the disk. I think that the networking issues
>will be the hard ones, not at all the fibrechannel issues.
>
>Hope this helps--
>
>-Phil
>
>
--
Donny Cooper
NEC Solutions (America), Inc.
Advanced Technical Computing Center
ph: +1-281-465-1506
email: Donny.Cooper@NECsam.com
_______________________________________________
Lustre-discuss mailing list
Lustre-discuss@lists.clusterfs.com
https://lists.clusterfs.com/mailman/listinfo/lustre-discuss
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic