LKML Archive on lore.kernel.org
help / color / mirror / Atom feed
* [PATCH v2] ext4: use non-movable memory for superblock readahead
@ 2020-02-29 0:14 Roman Gushchin
2020-02-29 7:49 ` Andreas Dilger
2020-04-10 3:23 ` Theodore Y. Ts'o
0 siblings, 2 replies; 7+ messages in thread
From: Roman Gushchin @ 2020-02-29 0:14 UTC (permalink / raw)
To: linux-fsdevel, linux-ext4, linux-kernel
Cc: Alexander Viro, Andreas Dilger, Roman Gushchin,
Andrew Perepechko, Theodore Ts'o, Gioh Kim, Jan Kara
Since commit a8ac900b8163 ("ext4: use non-movable memory for the
superblock") buffers for ext4 superblock were allocated using
the sb_bread_unmovable() helper which allocated buffer heads
out of non-movable memory blocks. It was necessarily to not block
page migrations and do not cause cma allocation failures.
However commit 85c8f176a611 ("ext4: preload block group descriptors")
broke this by introducing pre-reading of the ext4 superblock.
The problem is that __breadahead() is using __getblk() underneath,
which allocates buffer heads out of movable memory.
It resulted in page migration failures I've seen on a machine
with an ext4 partition and a preallocated cma area.
Fix this by introducing sb_breadahead_unmovable() and
__breadahead_gfp() helpers which use non-movable memory for buffer
head allocations and use them for the ext4 superblock readahead.
v2: found a similar issue in __ext4_get_inode_loc()
Fixes: 85c8f176a611 ("ext4: preload block group descriptors")
Signed-off-by: Roman Gushchin <guro@fb.com>
Cc: Andrew Perepechko <andrew.perepechko@seagate.com>
Cc: Theodore Ts'o <tytso@mit.edu>
Cc: Gioh Kim <gioh.kim@lge.com>
Cc: Jan Kara <jack@suse.cz>
---
fs/buffer.c | 11 +++++++++++
fs/ext4/inode.c | 2 +-
fs/ext4/super.c | 2 +-
include/linux/buffer_head.h | 8 ++++++++
4 files changed, 21 insertions(+), 2 deletions(-)
diff --git a/fs/buffer.c b/fs/buffer.c
index 4299e100a05b..25462edd920e 100644
--- a/fs/buffer.c
+++ b/fs/buffer.c
@@ -1414,6 +1414,17 @@ void __breadahead(struct block_device *bdev, sector_t block, unsigned size)
}
EXPORT_SYMBOL(__breadahead);
+void __breadahead_gfp(struct block_device *bdev, sector_t block, unsigned size,
+ gfp_t gfp)
+{
+ struct buffer_head *bh = __getblk_gfp(bdev, block, size, gfp);
+ if (likely(bh)) {
+ ll_rw_block(REQ_OP_READ, REQ_RAHEAD, 1, &bh);
+ brelse(bh);
+ }
+}
+EXPORT_SYMBOL(__breadahead_gfp);
+
/**
* __bread_gfp() - reads a specified block and returns the bh
* @bdev: the block_device to read from
diff --git a/fs/ext4/inode.c b/fs/ext4/inode.c
index fa0ff78dc033..b131fedc6b77 100644
--- a/fs/ext4/inode.c
+++ b/fs/ext4/inode.c
@@ -4348,7 +4348,7 @@ static int __ext4_get_inode_loc(struct inode *inode,
if (end > table)
end = table;
while (b <= end)
- sb_breadahead(sb, b++);
+ sb_breadahead_unmovable(sb, b++);
}
/*
diff --git a/fs/ext4/super.c b/fs/ext4/super.c
index ff1b764b0c0e..fb2338a5220e 100644
--- a/fs/ext4/super.c
+++ b/fs/ext4/super.c
@@ -4331,7 +4331,7 @@ static int ext4_fill_super(struct super_block *sb, void *data, int silent)
/* Pre-read the descriptors into the buffer cache */
for (i = 0; i < db_count; i++) {
block = descriptor_loc(sb, logical_sb_block, i);
- sb_breadahead(sb, block);
+ sb_breadahead_unmovable(sb, block);
}
for (i = 0; i < db_count; i++) {
diff --git a/include/linux/buffer_head.h b/include/linux/buffer_head.h
index 7b73ef7f902d..b56cc825f64d 100644
--- a/include/linux/buffer_head.h
+++ b/include/linux/buffer_head.h
@@ -189,6 +189,8 @@ struct buffer_head *__getblk_gfp(struct block_device *bdev, sector_t block,
void __brelse(struct buffer_head *);
void __bforget(struct buffer_head *);
void __breadahead(struct block_device *, sector_t block, unsigned int size);
+void __breadahead_gfp(struct block_device *, sector_t block, unsigned int size,
+ gfp_t gfp);
struct buffer_head *__bread_gfp(struct block_device *,
sector_t block, unsigned size, gfp_t gfp);
void invalidate_bh_lrus(void);
@@ -319,6 +321,12 @@ sb_breadahead(struct super_block *sb, sector_t block)
__breadahead(sb->s_bdev, block, sb->s_blocksize);
}
+static inline void
+sb_breadahead_unmovable(struct super_block *sb, sector_t block)
+{
+ __breadahead_gfp(sb->s_bdev, block, sb->s_blocksize, 0);
+}
+
static inline struct buffer_head *
sb_getblk(struct super_block *sb, sector_t block)
{
--
2.24.1
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH v2] ext4: use non-movable memory for superblock readahead
2020-02-29 0:14 [PATCH v2] ext4: use non-movable memory for superblock readahead Roman Gushchin
@ 2020-02-29 7:49 ` Andreas Dilger
2020-03-02 16:37 ` Roman Gushchin
` (2 more replies)
2020-04-10 3:23 ` Theodore Y. Ts'o
1 sibling, 3 replies; 7+ messages in thread
From: Andreas Dilger @ 2020-02-29 7:49 UTC (permalink / raw)
To: Roman Gushchin
Cc: Linux FS Devel, linux-ext4, Linux Kernel Mailing List,
Alexander Viro, Andrew Perepechko, Theodore Ts'o, Gioh Kim,
Jan Kara
[-- Attachment #1: Type: text/plain, Size: 4373 bytes --]
On Feb 28, 2020, at 5:14 PM, Roman Gushchin <guro@fb.com> wrote:
>
> Since commit a8ac900b8163 ("ext4: use non-movable memory for the
> superblock") buffers for ext4 superblock were allocated using
> the sb_bread_unmovable() helper which allocated buffer heads
> out of non-movable memory blocks. It was necessarily to not block
> page migrations and do not cause cma allocation failures.
>
> However commit 85c8f176a611 ("ext4: preload block group descriptors")
> broke this by introducing pre-reading of the ext4 superblock.
> The problem is that __breadahead() is using __getblk() underneath,
> which allocates buffer heads out of movable memory.
>
> It resulted in page migration failures I've seen on a machine
> with an ext4 partition and a preallocated cma area.
>
> Fix this by introducing sb_breadahead_unmovable() and
> __breadahead_gfp() helpers which use non-movable memory for buffer
> head allocations and use them for the ext4 superblock readahead.
>
> v2: found a similar issue in __ext4_get_inode_loc()
>
> Fixes: 85c8f176a611 ("ext4: preload block group descriptors")
> Signed-off-by: Roman Gushchin <guro@fb.com>
Reviewed-by: Andreas Dilger <adilger@dilger.ca>
> Cc: Andrew Perepechko <andrew.perepechko@seagate.com>
> Cc: Theodore Ts'o <tytso@mit.edu>
> Cc: Gioh Kim <gioh.kim@lge.com>
> Cc: Jan Kara <jack@suse.cz>
> ---
> fs/buffer.c | 11 +++++++++++
> fs/ext4/inode.c | 2 +-
> fs/ext4/super.c | 2 +-
> include/linux/buffer_head.h | 8 ++++++++
> 4 files changed, 21 insertions(+), 2 deletions(-)
>
> diff --git a/fs/buffer.c b/fs/buffer.c
> index 4299e100a05b..25462edd920e 100644
> --- a/fs/buffer.c
> +++ b/fs/buffer.c
> @@ -1414,6 +1414,17 @@ void __breadahead(struct block_device *bdev, sector_t block, unsigned size)
> }
> EXPORT_SYMBOL(__breadahead);
>
> +void __breadahead_gfp(struct block_device *bdev, sector_t block, unsigned size,
> + gfp_t gfp)
> +{
> + struct buffer_head *bh = __getblk_gfp(bdev, block, size, gfp);
> + if (likely(bh)) {
> + ll_rw_block(REQ_OP_READ, REQ_RAHEAD, 1, &bh);
> + brelse(bh);
> + }
> +}
> +EXPORT_SYMBOL(__breadahead_gfp);
> +
> /**
> * __bread_gfp() - reads a specified block and returns the bh
> * @bdev: the block_device to read from
> diff --git a/fs/ext4/inode.c b/fs/ext4/inode.c
> index fa0ff78dc033..b131fedc6b77 100644
> --- a/fs/ext4/inode.c
> +++ b/fs/ext4/inode.c
> @@ -4348,7 +4348,7 @@ static int __ext4_get_inode_loc(struct inode *inode,
> if (end > table)
> end = table;
> while (b <= end)
> - sb_breadahead(sb, b++);
> + sb_breadahead_unmovable(sb, b++);
> }
>
> /*
> diff --git a/fs/ext4/super.c b/fs/ext4/super.c
> index ff1b764b0c0e..fb2338a5220e 100644
> --- a/fs/ext4/super.c
> +++ b/fs/ext4/super.c
> @@ -4331,7 +4331,7 @@ static int ext4_fill_super(struct super_block *sb, void *data, int silent)
> /* Pre-read the descriptors into the buffer cache */
> for (i = 0; i < db_count; i++) {
> block = descriptor_loc(sb, logical_sb_block, i);
> - sb_breadahead(sb, block);
> + sb_breadahead_unmovable(sb, block);
> }
>
> for (i = 0; i < db_count; i++) {
> diff --git a/include/linux/buffer_head.h b/include/linux/buffer_head.h
> index 7b73ef7f902d..b56cc825f64d 100644
> --- a/include/linux/buffer_head.h
> +++ b/include/linux/buffer_head.h
> @@ -189,6 +189,8 @@ struct buffer_head *__getblk_gfp(struct block_device *bdev, sector_t block,
> void __brelse(struct buffer_head *);
> void __bforget(struct buffer_head *);
> void __breadahead(struct block_device *, sector_t block, unsigned int size);
> +void __breadahead_gfp(struct block_device *, sector_t block, unsigned int size,
> + gfp_t gfp);
> struct buffer_head *__bread_gfp(struct block_device *,
> sector_t block, unsigned size, gfp_t gfp);
> void invalidate_bh_lrus(void);
> @@ -319,6 +321,12 @@ sb_breadahead(struct super_block *sb, sector_t block)
> __breadahead(sb->s_bdev, block, sb->s_blocksize);
> }
>
> +static inline void
> +sb_breadahead_unmovable(struct super_block *sb, sector_t block)
> +{
> + __breadahead_gfp(sb->s_bdev, block, sb->s_blocksize, 0);
> +}
> +
> static inline struct buffer_head *
> sb_getblk(struct super_block *sb, sector_t block)
> {
> --
> 2.24.1
>
Cheers, Andreas
[-- Attachment #2: Message signed with OpenPGP --]
[-- Type: application/pgp-signature, Size: 873 bytes --]
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH v2] ext4: use non-movable memory for superblock readahead
2020-02-29 7:49 ` Andreas Dilger
@ 2020-03-02 16:37 ` Roman Gushchin
2020-03-03 22:17 ` Roman Gushchin
2020-04-06 17:20 ` Roman Gushchin
2 siblings, 0 replies; 7+ messages in thread
From: Roman Gushchin @ 2020-03-02 16:37 UTC (permalink / raw)
To: Andreas Dilger
Cc: Linux FS Devel, linux-ext4, Linux Kernel Mailing List,
Alexander Viro, Andrew Perepechko, Theodore Ts'o, Gioh Kim,
Jan Kara
On Sat, Feb 29, 2020 at 12:49:13AM -0700, Andreas Dilger wrote:
> On Feb 28, 2020, at 5:14 PM, Roman Gushchin <guro@fb.com> wrote:
> >
> > Since commit a8ac900b8163 ("ext4: use non-movable memory for the
> > superblock") buffers for ext4 superblock were allocated using
> > the sb_bread_unmovable() helper which allocated buffer heads
> > out of non-movable memory blocks. It was necessarily to not block
> > page migrations and do not cause cma allocation failures.
> >
> > However commit 85c8f176a611 ("ext4: preload block group descriptors")
> > broke this by introducing pre-reading of the ext4 superblock.
> > The problem is that __breadahead() is using __getblk() underneath,
> > which allocates buffer heads out of movable memory.
> >
> > It resulted in page migration failures I've seen on a machine
> > with an ext4 partition and a preallocated cma area.
> >
> > Fix this by introducing sb_breadahead_unmovable() and
> > __breadahead_gfp() helpers which use non-movable memory for buffer
> > head allocations and use them for the ext4 superblock readahead.
> >
> > v2: found a similar issue in __ext4_get_inode_loc()
> >
> > Fixes: 85c8f176a611 ("ext4: preload block group descriptors")
> > Signed-off-by: Roman Gushchin <guro@fb.com>
>
> Reviewed-by: Andreas Dilger <adilger@dilger.ca>
Thank you!
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH v2] ext4: use non-movable memory for superblock readahead
2020-02-29 7:49 ` Andreas Dilger
2020-03-02 16:37 ` Roman Gushchin
@ 2020-03-03 22:17 ` Roman Gushchin
2020-04-06 17:20 ` Roman Gushchin
2 siblings, 0 replies; 7+ messages in thread
From: Roman Gushchin @ 2020-03-03 22:17 UTC (permalink / raw)
To: Andreas Dilger
Cc: Linux FS Devel, linux-ext4, Linux Kernel Mailing List,
Alexander Viro, Andrew Perepechko, Theodore Ts'o, Gioh Kim,
Jan Kara
On Sat, Feb 29, 2020 at 12:49:13AM -0700, Andreas Dilger wrote:
> On Feb 28, 2020, at 5:14 PM, Roman Gushchin <guro@fb.com> wrote:
> >
> > Since commit a8ac900b8163 ("ext4: use non-movable memory for the
> > superblock") buffers for ext4 superblock were allocated using
> > the sb_bread_unmovable() helper which allocated buffer heads
> > out of non-movable memory blocks. It was necessarily to not block
> > page migrations and do not cause cma allocation failures.
> >
> > However commit 85c8f176a611 ("ext4: preload block group descriptors")
> > broke this by introducing pre-reading of the ext4 superblock.
> > The problem is that __breadahead() is using __getblk() underneath,
> > which allocates buffer heads out of movable memory.
> >
> > It resulted in page migration failures I've seen on a machine
> > with an ext4 partition and a preallocated cma area.
> >
> > Fix this by introducing sb_breadahead_unmovable() and
> > __breadahead_gfp() helpers which use non-movable memory for buffer
> > head allocations and use them for the ext4 superblock readahead.
> >
> > v2: found a similar issue in __ext4_get_inode_loc()
> >
> > Fixes: 85c8f176a611 ("ext4: preload block group descriptors")
> > Signed-off-by: Roman Gushchin <guro@fb.com>
>
> Reviewed-by: Andreas Dilger <adilger@dilger.ca>
Is it good to go?
Can it go through the ext4 tree?
Thanks!
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH v2] ext4: use non-movable memory for superblock readahead
2020-02-29 7:49 ` Andreas Dilger
2020-03-02 16:37 ` Roman Gushchin
2020-03-03 22:17 ` Roman Gushchin
@ 2020-04-06 17:20 ` Roman Gushchin
2 siblings, 0 replies; 7+ messages in thread
From: Roman Gushchin @ 2020-04-06 17:20 UTC (permalink / raw)
To: Theodore Ts'o
Cc: Linux FS Devel, linux-ext4, Linux Kernel Mailing List,
Alexander Viro, Andrew Perepechko, adilger, Gioh Kim, Jan Kara
On Sat, Feb 29, 2020 at 12:49:13AM -0700, Andreas Dilger wrote:
> On Feb 28, 2020, at 5:14 PM, Roman Gushchin <guro@fb.com> wrote:
> >
> > Since commit a8ac900b8163 ("ext4: use non-movable memory for the
> > superblock") buffers for ext4 superblock were allocated using
> > the sb_bread_unmovable() helper which allocated buffer heads
> > out of non-movable memory blocks. It was necessarily to not block
> > page migrations and do not cause cma allocation failures.
> >
> > However commit 85c8f176a611 ("ext4: preload block group descriptors")
> > broke this by introducing pre-reading of the ext4 superblock.
> > The problem is that __breadahead() is using __getblk() underneath,
> > which allocates buffer heads out of movable memory.
> >
> > It resulted in page migration failures I've seen on a machine
> > with an ext4 partition and a preallocated cma area.
> >
> > Fix this by introducing sb_breadahead_unmovable() and
> > __breadahead_gfp() helpers which use non-movable memory for buffer
> > head allocations and use them for the ext4 superblock readahead.
> >
> > v2: found a similar issue in __ext4_get_inode_loc()
> >
> > Fixes: 85c8f176a611 ("ext4: preload block group descriptors")
> > Signed-off-by: Roman Gushchin <guro@fb.com>
>
> Reviewed-by: Andreas Dilger <adilger@dilger.ca>
Hello, Theodore!
Can you, please, pick this patch?
We've some changes on the mm side (more actively using a cma area for movable
allocations), which might bring a regression without this ext4 change.
Thank you!
Roman
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH v2] ext4: use non-movable memory for superblock readahead
2020-02-29 0:14 [PATCH v2] ext4: use non-movable memory for superblock readahead Roman Gushchin
2020-02-29 7:49 ` Andreas Dilger
@ 2020-04-10 3:23 ` Theodore Y. Ts'o
2020-04-10 16:12 ` Roman Gushchin
1 sibling, 1 reply; 7+ messages in thread
From: Theodore Y. Ts'o @ 2020-04-10 3:23 UTC (permalink / raw)
To: Roman Gushchin
Cc: linux-fsdevel, linux-ext4, linux-kernel, Alexander Viro,
Andreas Dilger, Andrew Perepechko, Gioh Kim, Jan Kara
On Fri, Feb 28, 2020 at 04:14:11PM -0800, Roman Gushchin wrote:
> Since commit a8ac900b8163 ("ext4: use non-movable memory for the
> superblock") buffers for ext4 superblock were allocated using
> the sb_bread_unmovable() helper which allocated buffer heads
> out of non-movable memory blocks. It was necessarily to not block
> page migrations and do not cause cma allocation failures.
>
> However commit 85c8f176a611 ("ext4: preload block group descriptors")
> broke this by introducing pre-reading of the ext4 superblock.
> The problem is that __breadahead() is using __getblk() underneath,
> which allocates buffer heads out of movable memory.
>
> It resulted in page migration failures I've seen on a machine
> with an ext4 partition and a preallocated cma area.
>
> Fix this by introducing sb_breadahead_unmovable() and
> __breadahead_gfp() helpers which use non-movable memory for buffer
> head allocations and use them for the ext4 superblock readahead.
Applied, thanks. Apologies for not picking this up earlier.
- Ted
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH v2] ext4: use non-movable memory for superblock readahead
2020-04-10 3:23 ` Theodore Y. Ts'o
@ 2020-04-10 16:12 ` Roman Gushchin
0 siblings, 0 replies; 7+ messages in thread
From: Roman Gushchin @ 2020-04-10 16:12 UTC (permalink / raw)
To: Theodore Y. Ts'o
Cc: linux-fsdevel, linux-ext4, linux-kernel, Alexander Viro,
Andreas Dilger, Andrew Perepechko, Gioh Kim, Jan Kara
On Thu, Apr 09, 2020 at 11:23:44PM -0400, Theodore Y. Ts'o wrote:
> On Fri, Feb 28, 2020 at 04:14:11PM -0800, Roman Gushchin wrote:
> > Since commit a8ac900b8163 ("ext4: use non-movable memory for the
> > superblock") buffers for ext4 superblock were allocated using
> > the sb_bread_unmovable() helper which allocated buffer heads
> > out of non-movable memory blocks. It was necessarily to not block
> > page migrations and do not cause cma allocation failures.
> >
> > However commit 85c8f176a611 ("ext4: preload block group descriptors")
> > broke this by introducing pre-reading of the ext4 superblock.
> > The problem is that __breadahead() is using __getblk() underneath,
> > which allocates buffer heads out of movable memory.
> >
> > It resulted in page migration failures I've seen on a machine
> > with an ext4 partition and a preallocated cma area.
> >
> > Fix this by introducing sb_breadahead_unmovable() and
> > __breadahead_gfp() helpers which use non-movable memory for buffer
> > head allocations and use them for the ext4 superblock readahead.
>
> Applied, thanks. Apologies for not picking this up earlier.
>
> - Ted
Thank you!
Roman
^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2020-04-10 16:12 UTC | newest]
Thread overview: 7+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2020-02-29 0:14 [PATCH v2] ext4: use non-movable memory for superblock readahead Roman Gushchin
2020-02-29 7:49 ` Andreas Dilger
2020-03-02 16:37 ` Roman Gushchin
2020-03-03 22:17 ` Roman Gushchin
2020-04-06 17:20 ` Roman Gushchin
2020-04-10 3:23 ` Theodore Y. Ts'o
2020-04-10 16:12 ` Roman Gushchin
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).