[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Qemu-devel] [Qemu-ppc] [RFC PATCH v1] spapr: Memory hot-unplug supp
From: |
David Gibson |
Subject: |
Re: [Qemu-devel] [Qemu-ppc] [RFC PATCH v1] spapr: Memory hot-unplug support |
Date: |
Wed, 11 Nov 2015 12:36:30 +1100 |
User-agent: |
Mutt/1.5.23 (2015-06-09) |
On Mon, Oct 26, 2015 at 03:23:05PM +0530, Bharata B Rao wrote:
> Add support to hot remove pc-dimm memory devices.
Sorry it's taken me so long to look at this.
> TODO: In response to memory hot removal operation on a DIMM device,
> guest kernel might refuse to offline a few LMBs that are part of that device.
> In such cases, we will have a DIMM device that has some LMBs online and some
> LMBs offline. To avoid this situation, drmgr could be enhanced to support
> a command line option that results in removal of all the LMBs or none.
Hm.. what would be the end result of such a situation? We want to
handle it as gracefully as we can, even if the guest has old tools.
Is there some way we can detect this failure condition, and re-connect
the DIMM?
It does highlight the fact that the PAPR hotplug interface and the
pc-dimm model don't work together terribly well. I think we have to
try to support it for the sake of management layers, but I do wonder
if we ought to thinkg about an alternative "lmb-pool" backend, where
the precise location of memory blocks isn't so important. With some
thought such a backend might also be useful for paravirt x86.
Which also makes me think, I wonder if it would be possible to wire up
a PAPR compatible interface to qemu's balloon backend, since in some
ways the PAPR memory hotplug model acts more like a balloon (in that
the guest physical address of removed LMBs isn't usually important to
the host).
Still, we need to get the dimm backed model working first, I guess.
Apart from those overall considerations, the patch looks good.
> Signed-off-by: Bharata B Rao <address@hidden>
> ---
> Changes in v1:
> - Got rid of the patch that introduced a field in PCDIMMDevice to track
> DIMM marked for removal since we can track that using within DRC
> object.
> - Removed the patch that added return value to rtas_set_indicator()
> since the required changes are already pushed by Michael Roth.
>
> v0:
>
> hw/ppc/spapr.c | 90
> +++++++++++++++++++++++++++++++++++++++++++++++++++++-
> hw/ppc/spapr_drc.c | 18 +++++++++++
> 2 files changed, 107 insertions(+), 1 deletion(-)
>
> diff --git a/hw/ppc/spapr.c b/hw/ppc/spapr.c
> index e1202ce..f5b1ac2 100644
> --- a/hw/ppc/spapr.c
> +++ b/hw/ppc/spapr.c
> @@ -2174,6 +2174,85 @@ out:
> error_propagate(errp, local_err);
> }
>
> +typedef struct sPAPRDIMMState {
> + uint32_t nr_lmbs;
> +} sPAPRDIMMState;
> +
> +static void spapr_lmb_release(DeviceState *dev, void *opaque)
> +{
> + sPAPRDIMMState *ds = (sPAPRDIMMState *)opaque;
> + HotplugHandler *hotplug_ctrl = NULL;
> + Error *local_err = NULL;
> +
> + if (--ds->nr_lmbs) {
> + return;
> + }
> +
> + g_free(ds);
> +
> + /*
> + * Now that all the LMBs have been removed by the guest, call the
> + * pc-dimm unplug handler to cleanup up the pc-dimm device.
> + */
> + hotplug_ctrl = qdev_get_hotplug_handler(dev);
> + hotplug_handler_unplug(hotplug_ctrl, dev, &local_err);
> +}
> +
> +static void spapr_del_lmbs(DeviceState *dev, uint64_t addr, uint64_t size,
> + Error **errp)
> +{
> + sPAPRDRConnector *drc;
> + sPAPRDRConnectorClass *drck;
> + uint32_t nr_lmbs = size/SPAPR_MEMORY_BLOCK_SIZE;
> + Error *local_err = NULL;
> + int i;
> + sPAPRDIMMState *ds = g_malloc0(sizeof(sPAPRDIMMState));
> +
> + ds->nr_lmbs = nr_lmbs;
> + for (i = 0; i < nr_lmbs; i++) {
> + drc = spapr_dr_connector_by_id(SPAPR_DR_CONNECTOR_TYPE_LMB,
> + addr/SPAPR_MEMORY_BLOCK_SIZE);
> + g_assert(drc);
> +
> + drck = SPAPR_DR_CONNECTOR_GET_CLASS(drc);
> + drck->detach(drc, dev, spapr_lmb_release, ds, &local_err);
> + addr += SPAPR_MEMORY_BLOCK_SIZE;
> + }
> + spapr_hotplug_req_remove_by_count(SPAPR_DR_CONNECTOR_TYPE_LMB, nr_lmbs);
> +}
> +
> +static void spapr_memory_unplug(HotplugHandler *hotplug_dev, DeviceState
> *dev,
> + Error **errp)
> +{
> + sPAPRMachineState *ms = SPAPR_MACHINE(hotplug_dev);
> + PCDIMMDevice *dimm = PC_DIMM(dev);
> + PCDIMMDeviceClass *ddc = PC_DIMM_GET_CLASS(dimm);
> + MemoryRegion *mr = ddc->get_memory_region(dimm);
> +
> + pc_dimm_memory_unplug(dev, &ms->hotplug_memory, mr);
> + object_unparent(OBJECT(dev));
> +}
> +
> +static void spapr_memory_unplug_request(HotplugHandler *hotplug_dev,
> + DeviceState *dev, Error **errp)
> +{
> + Error *local_err = NULL;
> + PCDIMMDevice *dimm = PC_DIMM(dev);
> + PCDIMMDeviceClass *ddc = PC_DIMM_GET_CLASS(dimm);
> + MemoryRegion *mr = ddc->get_memory_region(dimm);
> + uint64_t size = memory_region_size(mr);
> + uint64_t addr;
> +
> + addr = object_property_get_int(OBJECT(dimm), PC_DIMM_ADDR_PROP,
> &local_err);
> + if (local_err) {
> + goto out;
> + }
> +
> + spapr_del_lmbs(dev, addr, size, &local_err);
> +out:
> + error_propagate(errp, local_err);
> +}
> +
> static void spapr_machine_device_plug(HotplugHandler *hotplug_dev,
> DeviceState *dev, Error **errp)
> {
> @@ -2221,7 +2300,15 @@ static void spapr_machine_device_unplug(HotplugHandler
> *hotplug_dev,
> DeviceState *dev, Error **errp)
> {
> if (object_dynamic_cast(OBJECT(dev), TYPE_PC_DIMM)) {
> - error_setg(errp, "Memory hot unplug not supported by sPAPR");
> + spapr_memory_unplug(hotplug_dev, dev, errp);
> + }
> +}
> +
> +static void spapr_machine_device_unplug_request(HotplugHandler *hotplug_dev,
> + DeviceState *dev, Error
> **errp)
> +{
> + if (object_dynamic_cast(OBJECT(dev), TYPE_PC_DIMM)) {
> + spapr_memory_unplug_request(hotplug_dev, dev, errp);
> }
> }
>
> @@ -2263,6 +2350,7 @@ static void spapr_machine_class_init(ObjectClass *oc,
> void *data)
> hc->plug = spapr_machine_device_plug;
> hc->unplug = spapr_machine_device_unplug;
> mc->cpu_index_to_socket_id = spapr_cpu_index_to_socket_id;
> + hc->unplug_request = spapr_machine_device_unplug_request;
>
> smc->dr_lmb_enabled = false;
> fwc->get_dev_path = spapr_get_fw_dev_path;
> diff --git a/hw/ppc/spapr_drc.c b/hw/ppc/spapr_drc.c
> index 5d6ea7c..59b6ea9 100644
> --- a/hw/ppc/spapr_drc.c
> +++ b/hw/ppc/spapr_drc.c
> @@ -11,6 +11,7 @@
> */
>
> #include "hw/ppc/spapr_drc.h"
> +#include "hw/ppc/spapr.h"
> #include "qom/object.h"
> #include "hw/qdev.h"
> #include "qapi/visitor.h"
> @@ -77,6 +78,23 @@ static uint32_t set_isolation_state(sPAPRDRConnector *drc,
> }
> }
>
> + /*
> + * Fail any request to ISOLATE the LMB DRC if this LMB doesn't
> + * belong to a DIMM device that is marked for removal.
> + *
> + * Currently the guest userspace tool drmgr that drives the memory
> + * hotplug/unplug will just try to remove a set of 'removable' LMBs
> + * in response to a hot unplug request that is based on drc-count.
> + * If the LMB being removed doesn't belong to a DIMM device that is
> + * actually being unplugged, fail the isolation request here.
> + */
> + if (drc->type == SPAPR_DR_CONNECTOR_TYPE_LMB) {
> + if ((state == SPAPR_DR_ISOLATION_STATE_ISOLATED) &&
> + !drc->awaiting_release) {
> + return RTAS_OUT_HW_ERROR;
> + }
> + }
> +
> drc->isolation_state = state;
>
> if (drc->isolation_state == SPAPR_DR_ISOLATION_STATE_ISOLATED) {
--
David Gibson | I'll have my music baroque, and my code
david AT gibson.dropbear.id.au | minimalist, thank you. NOT _the_ _other_
| _way_ _around_!
http://www.ozlabs.org/~dgibson
signature.asc
Description: PGP signature
- Re: [Qemu-devel] [Qemu-ppc] [RFC PATCH v1] spapr: Memory hot-unplug support,
David Gibson <=