LKML Archive on lore.kernel.org help / color / mirror / Atom feed
From: Andrea Reale <ar@linux.vnet.ibm.com> To: joeyli <jlee@suse.com> Cc: linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org, linux-mm@kvack.org, m.bielski@virtualopensystems.com, arunks@qti.qualcomm.com, mark.rutland@arm.com, scott.branden@broadcom.com, will.deacon@arm.com, qiuxishi@huawei.com, catalin.marinas@arm.com, mhocko@suse.com, rafael.j.wysocki@intel.com, linux-acpi@vger.kernel.org Subject: Re: [PATCH v2 2/5] mm: memory_hotplug: Remove assumption on memory state before hotremove Date: Mon, 4 Dec 2017 11:28:55 +0000 [thread overview] Message-ID: <20171204112855.GA6373@samekh> (raw) In-Reply-To: <20171129015229.GD1469@linux-l9pv.suse> Hi Joey, and thanks for your comments. Response inline: On Wed 29 Nov 2017, 09:52, joeyli wrote: > On Wed, Nov 29, 2017 at 08:49:13AM +0800, joeyli wrote: > > Hi Andrea, > > > > On Fri, Nov 24, 2017 at 10:22:35AM +0000, Andrea Reale wrote: > > > Resending the patch adding linux-acpi in CC, as suggested by Rafael. > > > Everyone else: apologies for the noise. > > > > > > Commit 242831eb15a0 ("Memory hotplug / ACPI: Simplify memory removal") > > > introduced an assumption whereas when control > > > reaches remove_memory the corresponding memory has been already > > > offlined. In that case, the acpi_memhotplug was making sure that > > > the assumption held. > > > This assumption, however, is not necessarily true if offlining > > > and removal are not done by the same "controller" (for example, > > > when first offlining via sysfs). > > > > > > Removing this assumption for the generic remove_memory code > > > and moving it in the specific acpi_memhotplug code. This is > > > a dependency for the software-aided arm64 offlining and removal > > > process. > > > > > > Signed-off-by: Andrea Reale <ar@linux.vnet.ibm.com> > > > Signed-off-by: Maciej Bielski <m.bielski@linux.vnet.ibm.com> > > > --- > > > drivers/acpi/acpi_memhotplug.c | 2 +- > > > include/linux/memory_hotplug.h | 9 ++++++--- > > > mm/memory_hotplug.c | 13 +++++++++---- > > > 3 files changed, 16 insertions(+), 8 deletions(-) > > > > > > diff --git a/drivers/acpi/acpi_memhotplug.c b/drivers/acpi/acpi_memhotplug.c > > > index 6b0d3ef..b0126a0 100644 > > > --- a/drivers/acpi/acpi_memhotplug.c > > > +++ b/drivers/acpi/acpi_memhotplug.c > > > @@ -282,7 +282,7 @@ static void acpi_memory_remove_memory(struct acpi_memory_device *mem_device) > > > nid = memory_add_physaddr_to_nid(info->start_addr); > > > > > > acpi_unbind_memory_blocks(info); > > > - remove_memory(nid, info->start_addr, info->length); > > > + BUG_ON(remove_memory(nid, info->start_addr, info->length)); > > > list_del(&info->list); > > > kfree(info); > > > } > > > diff --git a/include/linux/memory_hotplug.h b/include/linux/memory_hotplug.h > > > index 58e110a..1a9c7b2 100644 > > > --- a/include/linux/memory_hotplug.h > > > +++ b/include/linux/memory_hotplug.h > > > @@ -295,7 +295,7 @@ static inline bool movable_node_is_enabled(void) > > > extern bool is_mem_section_removable(unsigned long pfn, unsigned long nr_pages); > > > extern void try_offline_node(int nid); > > > extern int offline_pages(unsigned long start_pfn, unsigned long nr_pages); > > > -extern void remove_memory(int nid, u64 start, u64 size); > > > +extern int remove_memory(int nid, u64 start, u64 size); > > > > > > #else > > > static inline bool is_mem_section_removable(unsigned long pfn, > > > @@ -311,7 +311,10 @@ static inline int offline_pages(unsigned long start_pfn, unsigned long nr_pages) > > > return -EINVAL; > > > } > > > > > > -static inline void remove_memory(int nid, u64 start, u64 size) {} > > > +static inline int remove_memory(int nid, u64 start, u64 size) > > > +{ > > > + return -EINVAL; > > > +} > > > #endif /* CONFIG_MEMORY_HOTREMOVE */ > > > > > > extern int walk_memory_range(unsigned long start_pfn, unsigned long end_pfn, > > > @@ -323,7 +326,7 @@ extern void move_pfn_range_to_zone(struct zone *zone, unsigned long start_pfn, > > > unsigned long nr_pages); > > > extern int offline_pages(unsigned long start_pfn, unsigned long nr_pages); > > > extern bool is_memblock_offlined(struct memory_block *mem); > > > -extern void remove_memory(int nid, u64 start, u64 size); > > > +extern int remove_memory(int nid, u64 start, u64 size); > > > extern int sparse_add_one_section(struct pglist_data *pgdat, unsigned long start_pfn); > > > extern void sparse_remove_one_section(struct zone *zone, struct mem_section *ms, > > > unsigned long map_offset); > > > diff --git a/mm/memory_hotplug.c b/mm/memory_hotplug.c > > > index d4b5f29..d5f15af 100644 > > > --- a/mm/memory_hotplug.c > > > +++ b/mm/memory_hotplug.c > > > @@ -1892,7 +1892,7 @@ EXPORT_SYMBOL(try_offline_node); > > > * and online/offline operations before this call, as required by > > > * try_offline_node(). > > > */ > > > -void __ref remove_memory(int nid, u64 start, u64 size) > > > +int __ref remove_memory(int nid, u64 start, u64 size) > > > { > > > int ret; > > > > > > @@ -1908,18 +1908,23 @@ void __ref remove_memory(int nid, u64 start, u64 size) > > > ret = walk_memory_range(PFN_DOWN(start), PFN_UP(start + size - 1), NULL, > > > check_memblock_offlined_cb); > > > if (ret) > > > - BUG(); > > > + goto end_remove; > > > + > > > + ret = arch_remove_memory(start, size); > > Should not include arch_remove_memory() to BUG(). arch_remove_memory might also fail in some cases. In the arm64 implementation of this patchset, for example, it might fail in the (very rare) case when we would have to split a P[UM]D mapped section for removal (and we do not support that - see email thread here: https://lkml.org/lkml/2017/11/23/456). > > > + > > > + if (ret) > > > + goto end_remove; > > > > The original code triggers BUG() when any memblock is not offlined. Why > > the new logic includes the result of arch_remove_memory()? > > > > But I agreed the we don't need BUG(). Returning a error is better. > > Actually, I lost one thing. > > The BUG() have caught a issue about the offline state doesn't sync between > memory_block and device object. like: > mem->dev.offline != (mem->state == MEM_OFFLINE) > > So, the BUG() is useful to capture state issue in memory subsystem. But, I > understood your concern about the two steps offline/remove from userland. > > Maybe we should move the BUG() to somewhere but not just remove it. Or if > we think that the BUG() is too intense, at least we should print out a error > message, and ACPI should checks the return value from subsystem to > interrupt memory-hotplug process. In this patchset, BUG() is moved to acpi_memory_remove_memory(), the caller of arch_remove_memory(). However, I agree with Michal, that we should not BUG() here but rather halt the hotremove process and print some errors. Is there any state in ACPI that should be undone in case of hotremove errors or we can just stop the process "halfway"? > Thanks a lot! > Joey Lee Thanks, Andrea > -- > To unsubscribe from this list: send the line "unsubscribe linux-acpi" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html >
next prev parent reply other threads:[~2017-12-04 11:29 UTC|newest] Thread overview: 49+ messages / expand[flat|nested] mbox.gz Atom feed top 2017-11-23 11:13 [PATCH v2 0/5] Memory hotplug support for arm64 - complete patchset v2 Andrea Reale 2017-11-23 11:13 ` [PATCH v2 1/5] mm: memory_hotplug: Memory hotplug (add) support for arm64 Maciej Bielski 2017-11-24 5:55 ` Arun KS 2017-11-24 9:42 ` Andrea Reale 2017-11-24 10:53 ` Maciej Bielski 2017-11-26 6:58 ` Arun KS 2017-11-27 15:19 ` Robin Murphy 2017-11-27 16:39 ` Maciej Bielski 2017-11-27 17:11 ` Andrea Reale 2017-11-23 11:14 ` [PATCH v2 2/5] mm: memory_hotplug: Remove assumption on memory state before hotremove Andrea Reale 2017-11-23 22:18 ` Rafael J. Wysocki 2017-11-24 14:39 ` Rafael J. Wysocki 2017-11-24 14:49 ` Andrea Reale 2017-11-24 15:43 ` Michal Hocko 2017-11-24 15:54 ` Andrea Reale 2017-11-24 18:17 ` Michal Hocko 2017-11-29 1:20 ` joeyli 2017-11-30 9:47 ` Michal Hocko 2017-11-27 15:20 ` Robin Murphy 2017-11-27 17:44 ` Andrea Reale 2017-11-29 0:49 ` joeyli 2017-11-29 1:52 ` joeyli 2017-12-04 11:28 ` Andrea Reale [this message] 2017-12-04 14:05 ` Rafael J. Wysocki 2017-11-23 11:14 ` [PATCH v2 3/5] mm: memory_hotplug: memblock to track partially removed vmemmap mem Andrea Reale 2017-11-27 15:20 ` Robin Murphy 2017-11-27 17:38 ` Andrea Reale 2017-11-30 14:51 ` Michal Hocko 2017-12-04 11:49 ` Andrea Reale 2017-12-04 12:32 ` Michal Hocko 2017-12-04 12:42 ` Andrea Reale 2017-12-04 12:48 ` Michal Hocko 2017-11-23 11:14 ` [PATCH v2 4/5] mm: memory_hotplug: Add memory hotremove probe device Andrea Reale 2017-11-24 10:35 ` zhong jiang 2017-11-24 10:44 ` Andrea Reale 2017-11-24 12:17 ` zhong jiang 2017-11-24 14:29 ` Andrea Reale 2017-12-04 17:50 ` Reza Arbab 2017-11-27 15:33 ` Robin Murphy 2017-11-27 17:14 ` Andrea Reale 2017-11-30 14:49 ` Michal Hocko 2017-12-04 11:51 ` Andrea Reale 2017-12-04 12:33 ` Michal Hocko 2017-12-04 12:44 ` Andrea Reale 2017-11-23 11:15 ` [PATCH v2 5/5] mm: memory-hotplug: Add memory hot remove support for arm64 Andrea Reale 2017-11-23 16:02 ` [PATCH v2 0/5] Memory hotplug support for arm64 - complete patchset v2 Michal Hocko 2017-11-23 17:33 ` Andrea Reale 2017-11-30 14:57 ` Michal Hocko 2017-12-04 11:34 ` Andrea Reale
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=20171204112855.GA6373@samekh \ --to=ar@linux.vnet.ibm.com \ --cc=arunks@qti.qualcomm.com \ --cc=catalin.marinas@arm.com \ --cc=jlee@suse.com \ --cc=linux-acpi@vger.kernel.org \ --cc=linux-arm-kernel@lists.infradead.org \ --cc=linux-kernel@vger.kernel.org \ --cc=linux-mm@kvack.org \ --cc=m.bielski@virtualopensystems.com \ --cc=mark.rutland@arm.com \ --cc=mhocko@suse.com \ --cc=qiuxishi@huawei.com \ --cc=rafael.j.wysocki@intel.com \ --cc=scott.branden@broadcom.com \ --cc=will.deacon@arm.com \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: linkBe sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).