LKML Archive on lore.kernel.org help / color / mirror / Atom feed
From: Andrew Morton <akpm@linux-foundation.org> To: Herbert Poetzl <herbert@13thfloor.at> Cc: "Eric W. Biederman" <ebiederm@xmission.com>, containers@lists.osdl.org, linux-kernel@vger.kernel.org, Dave Hansen <hansendc@us.ibm.com> Subject: Re: Linux-VServer example results for sharing vs. separate mappings ... Date: Sat, 24 Mar 2007 20:29:51 -0800 [thread overview] Message-ID: <20070324202951.38f7187a.akpm@linux-foundation.org> (raw) In-Reply-To: <20070325022156.GA12442@MAIL.13thfloor.at> On Sun, 25 Mar 2007 04:21:56 +0200 Herbert Poetzl <herbert@13thfloor.at> wrote: > > a) slice the machine into 128 fake NUMA nodes, use each node as the > > basic block of memory allocation, manage the binding between these > > memory hunks and process groups with cpusets. > > 128 sounds a little small to me, considering that we > already see 300+ Guests on older machines .... > (or am I missing something here?) Yes, you're missing something very significant. I'm talking about resource management (ie: partitioning) and you're talking about virtual servers. They're different applications, with quite a lot in common. For resource management, a few fives or tens of containers is probably an upper bound. An impementation needs to address both requirements. > > This is what google are testing, and it works. > > > > b) Create a new memory abstraction, call it the "software zone", > > which is mostly decoupled from the present "hardware zones". Most of > > the MM is reworked to use "software zones". The "software zones" are > > runtime-resizeable, and obtain their pages via some means from the > > hardware zones. A container uses a software zone. > > > > c) Something else, similar to the above. Various schemes can be > > envisaged, it isn't terribly important for this discussion. > > for me, the most natural approach is the one with > the least impact and smallest number of changes > in the (granted quite complex) system: leave > everything as is, from the 'entire system' point > of view, and do adjustments and decisions with the > additional Guest/Context information in mind ... > > e.g. if we decide to reclaim pages, and the 'normal' > mechanism would end up with 100 'equal' candidates, > the Guest badness can be a good additional criterion > to decide which pages get thrown out ... > > OTOH, the Guest status should never control the > entire system behaviour in a way which harms the > overall performance or resource efficiency On the contrary - if one container exceeds its allotted resource, we want the processes in that container to bear the majority of the cost of that. Ideally, all of the cost. > > > All doable, if we indeed have a demonstrable problem > > which needs to be addressed. > > all in all I seem to be missing the 'original problem' > which basically forces us to do all those things you > describe instead of letting the Linux Memory System > work as it works right now and just get the accounting > right ... The VM presently cannot satisfy resource management requirements, because piggy activity from one job will impact the performance of all other jobs. > > > note that the 'frowned upon' accounting Linux-VServer > > > does seems to work for those cases quite fine .. here > > > the relevant accounting/limits for three guests, the > > > first two unified and started in strict sequence, the > > > third one completely separate > > > > > > Limit current min/max soft/hard hits > > > VM: 41739 0/ 64023 -1/ -1 0 > > > RSS: 8073 0/ 9222 -1/ -1 0 > > > ANON: 3110 0/ 3405 -1/ -1 0 > > > RMAP: 4960 0/ 5889 -1/ -1 0 > > > SHM: 7138 0/ 7138 -1/ -1 0 > > > > > > Limit current min/max soft/hard hits > > > VM: 41738 0/ 64163 -1/ -1 0 > > > RSS: 8058 0/ 9383 -1/ -1 0 > > > ANON: 3108 0/ 3505 -1/ -1 0 > > > RMAP: 4950 0/ 5912 -1/ -1 0 > > > SHM: 7138 0/ 7138 -1/ -1 0 > > > > > > Limit current min/max soft/hard hits > > > VM: 41738 0/ 63912 -1/ -1 0 > > > RSS: 8050 0/ 9211 -1/ -1 0 > > > ANON: 3104 0/ 3399 -1/ -1 0 > > > RMAP: 4946 0/ 5885 -1/ -1 0 > > > SHM: 7138 0/ 7138 -1/ -1 0 > > > > Sorry, I tend to go to sleep when presented with rows and rows of > > numbers. Sure, it's good to show the data but I much prefer it if the > > sender can tell us what the data means: the executive summary. > > sorry, I'm more the technical person and I hate > 'executive summaries' and similar stuff, but the > message is simple and clear: accouting works even > for shared/unified guests, all three guests show > reasonably similar values ... I don't see "accounting" as being useful for resource managment. I mean, so we have a bunch of numbers - so what? The problem is: what do we do when the jobs in a container exceed their allotment? With zone-based physical containers we already have pretty much all the accounting we need, in the existing per-zone accounting.
next prev parent reply other threads:[~2007-03-25 4:30 UTC|newest] Thread overview: 15+ messages / expand[flat|nested] mbox.gz Atom feed top 2007-03-23 19:30 Linux-VServer example results for sharing vs. separate mappings Herbert Poetzl 2007-03-24 5:42 ` Andrew Morton 2007-03-24 18:38 ` Herbert Poetzl 2007-03-24 20:19 ` Andrew Morton 2007-03-25 2:21 ` Herbert Poetzl 2007-03-25 4:29 ` Andrew Morton [this message] 2007-03-25 14:40 ` Herbert Poetzl 2007-03-28 1:22 ` Ethan Solomita 2007-03-25 9:50 ` Balbir Singh 2007-03-25 18:51 ` Andrew Morton 2007-03-26 2:36 ` Balbir Singh 2007-03-26 5:26 ` Andrew Morton 2007-03-26 6:05 ` Balbir Singh 2007-03-26 9:26 ` [Devel] " Kirill Korotaev [not found] <81XMo-Fl-19@gated-at.bofh.it> [not found] ` <827iC-6DZ-7@gated-at.bofh.it> [not found] ` <82jjS-89s-11@gated-at.bofh.it> [not found] ` <82l2j-2lv-11@gated-at.bofh.it> [not found] ` <82qEC-2yM-9@gated-at.bofh.it> [not found] ` <83v9d-3kf-1@gated-at.bofh.it> 2007-03-29 7:24 ` Bodo Eggert
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=20070324202951.38f7187a.akpm@linux-foundation.org \ --to=akpm@linux-foundation.org \ --cc=containers@lists.osdl.org \ --cc=ebiederm@xmission.com \ --cc=hansendc@us.ibm.com \ --cc=herbert@13thfloor.at \ --cc=linux-kernel@vger.kernel.org \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: linkBe sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).