LKML Archive on lore.kernel.org help / color / mirror / Atom feed
From: Vlastimil Babka <vbabka@suse.cz> To: "Michael Kerrisk (man-pages)" <mtk.manpages@gmail.com>, "Kirill A. Shutemov" <kirill@shutemov.name> Cc: Dave Hansen <dave.hansen@intel.com>, Mel Gorman <mgorman@suse.de>, linux-mm@kvack.org, Minchan Kim <minchan@kernel.org>, Andrew Morton <akpm@linux-foundation.org>, linux-kernel@vger.kernel.org, linux-api@vger.kernel.org, linux-man@vger.kernel.org, Hugh Dickins <hughd@google.com> Subject: Re: MADV_DONTNEED semantics? Was: [RFC PATCH] mm: madvise: Ignore repeated MADV_DONTNEED hints Date: Wed, 04 Feb 2015 14:46:00 +0100 [thread overview] Message-ID: <54D22298.3040504@suse.cz> (raw) In-Reply-To: <54D0F56A.9050003@gmail.com> On 02/03/2015 05:20 PM, Michael Kerrisk (man-pages) wrote: > Hello Vlastimil > > Thanks for CCing me into this thread. NP > On 02/03/2015 12:42 PM, Vlastimil Babka wrote: >> On 02/03/2015 11:53 AM, Kirill A. Shutemov wrote: >>> On Tue, Feb 03, 2015 at 09:19:15AM +0100, Vlastimil Babka wrote: >>> >>> It doesn't skip. It fails with -EINVAL. Or I miss something. >> >> No, I missed that. Thanks for pointing out. The manpage also explains EINVAL in >> this case: >> >> * The application is attempting to release locked or shared pages (with >> MADV_DONTNEED). > > Yes, there is that. But the page could be more explicit when discussing > MADV_DONTNEED in the main text. I've done that. > >> - that covers mlocking ok, not sure if the rest fits the "shared pages" case >> though. I dont see any check for other kinds of shared pages in the code. > > Agreed. "shared" here seems confused. I've removed it. And I've > added mention of "Huge TLB pages" for this error. > Thanks. >>>> - The word "will result" did sound as a guarantee at least to me. So here it >>>> could be changed to "may result (unless the advice is ignored)"? >>> >>> It's too late to fix documentation. Applications already depends on the >>> beheviour. >> >> Right, so as long as they check for EINVAL, it should be safe. It appears that >> jemalloc does. > > So, first a brief question: in the cases where the call does not error out, > are we agreed that in the current implementation, MADV_DONTNEED will > always result in zero-filled pages when the region is faulted back in > (when we consider pages that are not backed by a file)? I'd agree at this point. Also we should probably mention anonymously shared pages (shmem). I think they behave the same as file here. >> I still wouldnt be sure just by reading the man page that the clearing is >> guaranteed whenever I dont get an error return value, though, > > I'm not quite sure what you want here. I mean: if there's an error, I was just reiterating that the guarantee is not clear from if you consider all the statements in the man page. > then the DONTNEED action didn't occur, right? Therefore, there won't > be zero-filled pages. But, for what it's worth, I added "If the > operation succeeds" at the start of that sentence beginning "Subsequent > accesses...". Yes, that should clarify it. Thanks! > Now, some history, explaining why the page is a bit of a mess, > and for that matter why I could really use more help on it from MM > folk (especially in the form of actual patches [1], rather than notes > about deficiencies in the documentation), because: > > ***I simply cannot keep up with all of the details***. I see, and expected it would be like this. I would just send patch if the situation was clear, but here we should agree first, and I thought you should be involved from the beginning. > Once upon a time (Linux 2.4), there was madvise() with just 5 flags: > > MADV_NORMAL > MADV_RANDOM > MADV_SEQUENTIAL > MADV_WILLNEED > MADV_DONTNEED > > And already a dozen years ago, *I* added the text about MADV_DONTNEED. > Back then, I believe it was true. I'm not sure if it's still true now, > but I assume for the moment that it is, and await feedback. And the > text saying that the call does not affect the semantics of memory > access dates back even further (and was then true, MADV_DONTNEED aside). > > Those 5 flags have analogs in POSIX's posix_madvise() (albeit, there > is a semantic mismatch between the destructive MADV_DONTNEED and > POSIX's nondestructive POSIX_MADV_DONTNEED). They also appear > on most other implementations. > > Since the original implementation, numerous pieces of cruft^W^W^W > excellent new flags have been overloaded into this one system call. > Some of those certainly violated the "does not change the semantics > of the application" statement, but, sadly, the kernel developers who > implemented MADV_REMOVE or MADV_DONTFORK did not think to send a > patch to the man page for those new flags, one that might have noted > that the semantics of the application are changed by such flags. Equally > sadly, I did overlook to scan the bigger page when *I* added > documentation of these flags to those pages, otherwise I might have > caught that detail. > > So, just to repeat, I could really use more help on it from MM > folk in the form of actual patches to the man page. Thanks for the background. I'll try to remember to check for man-pages part when I review some api changing patch. > Thanks, > > Michael > > [1] https://www.kernel.org/doc/man-pages/patches.html >
next prev parent reply other threads:[~2015-02-04 13:46 UTC|newest] Thread overview: 31+ messages / expand[flat|nested] mbox.gz Atom feed top 2015-02-02 16:55 [RFC PATCH] mm: madvise: Ignore repeated MADV_DONTNEED hints Mel Gorman 2015-02-02 22:05 ` Andrew Morton 2015-02-02 22:18 ` Mel Gorman 2015-02-02 22:35 ` Andrew Morton 2015-02-03 0:26 ` Davidlohr Bueso 2015-02-03 10:50 ` Mel Gorman 2015-02-05 21:44 ` Rik van Riel 2015-02-02 22:22 ` Dave Hansen 2015-02-03 8:19 ` MADV_DONTNEED semantics? Was: " Vlastimil Babka 2015-02-03 10:53 ` Kirill A. Shutemov 2015-02-03 11:42 ` Vlastimil Babka 2015-02-03 16:20 ` Michael Kerrisk (man-pages) 2015-02-04 13:46 ` Vlastimil Babka [this message] 2015-02-04 14:00 ` Michael Kerrisk (man-pages) 2015-02-04 17:02 ` Vlastimil Babka 2015-02-04 19:24 ` Michael Kerrisk (man-pages) 2015-02-05 1:07 ` Minchan Kim 2015-02-06 15:41 ` Michael Kerrisk (man-pages) 2015-02-09 6:46 ` Minchan Kim 2015-02-09 9:13 ` Michael Kerrisk (man-pages) 2015-02-05 15:41 ` Michal Hocko 2015-02-06 15:57 ` Michael Kerrisk (man-pages) 2015-02-06 20:45 ` Michal Hocko 2015-02-09 6:50 ` Minchan Kim 2015-02-04 0:09 ` Minchan Kim 2015-02-03 11:16 ` Mel Gorman 2015-02-03 15:21 ` Michal Hocko 2015-02-03 16:25 ` Michael Kerrisk (man-pages) 2015-02-03 9:47 ` Mel Gorman 2015-02-03 10:47 ` Kirill A. Shutemov 2015-02-03 11:21 ` Mel Gorman
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=54D22298.3040504@suse.cz \ --to=vbabka@suse.cz \ --cc=akpm@linux-foundation.org \ --cc=dave.hansen@intel.com \ --cc=hughd@google.com \ --cc=kirill@shutemov.name \ --cc=linux-api@vger.kernel.org \ --cc=linux-kernel@vger.kernel.org \ --cc=linux-man@vger.kernel.org \ --cc=linux-mm@kvack.org \ --cc=mgorman@suse.de \ --cc=minchan@kernel.org \ --cc=mtk.manpages@gmail.com \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: linkBe sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).