Re: [PATCH v2] vdpa: reset the backend device in stage of stop last vhos

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH v2] vdpa: reset the backend device in stage of stop last vhos

From:	Michael Qiu
Subject:	Re: [PATCH v2] vdpa: reset the backend device in stage of stop last vhost device
Date:	Thu, 31 Mar 2022 12:02:00 +0800
User-agent:	Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:91.0) Gecko/20100101 Thunderbird/91.7.0



On 2022/3/31 8:15, Si-Wei Liu wrote:



On 3/30/2022 3:02 AM, 08005325@163.com wrote:

From: Michael Qiu <qiudayu@archeros.com>

Currently, when VM poweroff, it will trigger vdpa
device(such as mlx bluefield2 VF) reset many times(with 1 datapath
queue pair and one control queue, triggered 3 times), this
leads to below issue:

vhost VQ 2 ring restore failed: -22: Invalid argument (22)

This because in vhost_net_stop(), it will stop all vhost device bind to

this virtio device, and in vhost_dev_stop(), qemu tries to stop thedevice

, then stop the queue: vhost_virtqueue_stop().

In vhost_dev_stop(), it resets the device, which clear some flags
in low level driver, and in next loop(stop other vhost backends),
qemu try to stop the queue corresponding to the vhost backend,
  the driver finds that the VQ is invalied, this is the root cause.

To solve the issue, vdpa should set vring unready, and
remove reset ops in device stop: vhost_dev_start(hdev, false).

and implement a new function vhost_dev_reset, only reset backend
device when the last vhost stop triggerd.

Signed-off-by: Michael Qiu<qiudayu@archeros.com>
Acked-by: Jason Wang <jasowang@redhat.com>
---
v2 --> v1:
    implement a new function vhost_dev_reset,
    reset the backend kernel device at last.

---
  hw/net/vhost_net.c        | 22 +++++++++++++++++++---
  hw/virtio/vhost-vdpa.c    |  8 ++++----
  hw/virtio/vhost.c         | 16 +++++++++++++++-
  include/hw/virtio/vhost.h |  1 +
  4 files changed, 39 insertions(+), 8 deletions(-)

diff --git a/hw/net/vhost_net.c b/hw/net/vhost_net.c
index 30379d2..3cdf6a4 100644
--- a/hw/net/vhost_net.c
+++ b/hw/net/vhost_net.c
@@ -299,7 +299,7 @@ fail_notifiers:
  }
  static void vhost_net_stop_one(struct vhost_net *net,
-                               VirtIODevice *dev)
+                               VirtIODevice *dev, bool reset)
  {
      struct vhost_vring_file file = { .fd = -1 };

@@ -313,6 +313,11 @@ static void vhost_net_stop_one(struct vhost_net*net,

          net->nc->info->poll(net->nc, true);
      }
      vhost_dev_stop(&net->dev, dev);
+
+    if (reset) {
+        vhost_dev_reset(&net->dev);
+    }
+
      vhost_dev_disable_notifiers(&net->dev, dev);
  }

@@ -391,7 +396,12 @@ int vhost_net_start(VirtIODevice *dev,NetClientState *ncs,

  err_start:
      while (--i >= 0) {
          peer = qemu_get_peer(ncs , i);
-        vhost_net_stop_one(get_vhost_net(peer), dev);
+
+        if (i == 0) {
+            vhost_net_stop_one(get_vhost_net(peer), dev, true);
+        } else {
+            vhost_net_stop_one(get_vhost_net(peer), dev, false);
+        }
      }
      e = k->set_guest_notifiers(qbus->parent, total_notifiers, false);
      if (e < 0) {

@@ -420,7 +430,13 @@ void vhost_net_stop(VirtIODevice *dev,NetClientState *ncs,

          } else {
              peer = qemu_get_peer(ncs, n->max_queue_pairs);
          }
-        vhost_net_stop_one(get_vhost_net(peer), dev);
+
+        /* We only reset backend device during the last vhost */
+        if (i == nvhosts - 1) {

I wonder if there's any specific reason to position device reset in thefor loop given that there's no virtqueue level reset? Wouldn't it becleaner to reset the device at the end of vhost_net_stop() beforereturn? you may use qemu_get_peer(ncs, 0) without hassle? Noted thevhost_ops->vhost_reset_device op is per device rather per vq.

OK, it's a good way to do reset at the end of vhost_net_stop(), I willchange it in next version.

+            vhost_net_stop_one(get_vhost_net(peer), dev, true);
+        } else {
+            vhost_net_stop_one(get_vhost_net(peer), dev, false);
+        }
      }
      r = k->set_guest_notifiers(qbus->parent, total_notifiers, false);
diff --git a/hw/virtio/vhost-vdpa.c b/hw/virtio/vhost-vdpa.c
index c5ed7a3..d858b4f 100644
--- a/hw/virtio/vhost-vdpa.c
+++ b/hw/virtio/vhost-vdpa.c
@@ -719,14 +719,14 @@ static int vhost_vdpa_get_vq_index(structvhost_dev *dev, int idx)
      return idx;
  }
-static int vhost_vdpa_set_vring_ready(struct vhost_dev *dev)
+static int vhost_vdpa_set_vring_ready(struct vhost_dev *dev, unsignedint ready)
  {
      int i;
      trace_vhost_vdpa_set_vring_ready(dev);
      for (i = 0; i < dev->nvqs; ++i) {
          struct vhost_vring_state state = {
              .index = dev->vq_index + i,
-            .num = 1,
+            .num = ready,
          };
          vhost_vdpa_call(dev, VHOST_VDPA_SET_VRING_ENABLE, &state);
      }
@@ -1088,8 +1088,9 @@ static int vhost_vdpa_dev_start(struct vhost_dev*dev, bool started)
          if (unlikely(!ok)) {
              return -1;
          }
-        vhost_vdpa_set_vring_ready(dev);
+        vhost_vdpa_set_vring_ready(dev, 1);
      } else {
+        vhost_vdpa_set_vring_ready(dev, 0);
          ok = vhost_vdpa_svqs_stop(dev);
          if (unlikely(!ok)) {
              return -1;
@@ -1105,7 +1106,6 @@ static int vhost_vdpa_dev_start(struct vhost_dev*dev, bool started)
          memory_listener_register(&v->listener, &address_space_memory);
          return vhost_vdpa_add_status(dev, VIRTIO_CONFIG_S_DRIVER_OK);
      } else {
-        vhost_vdpa_reset_device(dev);
          vhost_vdpa_add_status(dev, VIRTIO_CONFIG_S_ACKNOWLEDGE |
                                     VIRTIO_CONFIG_S_DRIVER);
Here's another issue (regression) got to address - the addedS_ACKNOWLEDGE | S_DRIVER bits will be cleared right away by thefollow-up reset in vhost_net_stop_one(... , true), which in turn willcause virtio fail to initialize e.g. vhost_vdpa_set_features() will failto set VIRTIO_CONFIG_S_FEATURES_OK
Ideally the status bit should be set whenever the corresponding statusbit is set by virtio_net from virtio_net_vhost_status(), or practicallyit can be done at the very beginning of vhost_dev_start(), for e.g. thefirst call before vhost_dev_set_features(). For this purpose, you mayconsider adding another vhost_init_device op, which is symmetric tovhost_ops->vhost_reset_device in the vhost_net_stop() path.

Seems only vdpa device need reset after stop, although virtio spec saidneed reset, but kernel doesn't reset, and if reset it has issue toreprobe virtio-net in guest, So we probely only add it after reset if itis VDPA device, for kernel and other datapath we just keep the same asbefore.


Thanks,
Michael

Thanks,
-Siwei

          memory_listener_unregister(&v->listener);
diff --git a/hw/virtio/vhost.c b/hw/virtio/vhost.c
index b643f42..6d9b4a3 100644
--- a/hw/virtio/vhost.c
+++ b/hw/virtio/vhost.c
@@ -1820,7 +1820,7 @@ fail_features:
  void vhost_dev_stop(struct vhost_dev *hdev, VirtIODevice *vdev)
  {
      int i;
-
+    printf("vhost_dev_stop test\n");
      /* should only be called after backend is connected */
      assert(hdev->vhost_ops);
@@ -1854,3 +1854,17 @@ int vhost_net_set_backend(struct vhost_dev *hdev,
      return -ENOSYS;
  }
+
+int vhost_dev_reset(struct vhost_dev *hdev)
+{
+    int ret = 0;
+
+    /* should only be called after backend is connected */
+    assert(hdev->vhost_ops);
+
+    if (hdev->vhost_ops->vhost_reset_device) {
+        ret = hdev->vhost_ops->vhost_reset_device(hdev);
+    }
+
+    return ret;
+}
diff --git a/include/hw/virtio/vhost.h b/include/hw/virtio/vhost.h
index 58a73e7..b8b7c20 100644
--- a/include/hw/virtio/vhost.h
+++ b/include/hw/virtio/vhost.h

@@ -114,6 +114,7 @@ int vhost_dev_init(struct vhost_dev *hdev, void*opaque,

  void vhost_dev_cleanup(struct vhost_dev *hdev);
  int vhost_dev_start(struct vhost_dev *hdev, VirtIODevice *vdev);
  void vhost_dev_stop(struct vhost_dev *hdev, VirtIODevice *vdev);
+int vhost_dev_reset(struct vhost_dev *hdev);

int vhost_dev_enable_notifiers(struct vhost_dev *hdev, VirtIODevice*vdev); void vhost_dev_disable_notifiers(struct vhost_dev *hdev,VirtIODevice *vdev);

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [PATCH] vdpa: Avoid reset when stop device, (continued)
- Re: [PATCH] vdpa: Avoid reset when stop device, Jason Wang, 2022/03/23
  - Re: [PATCH] vdpa: Avoid reset when stop device, Si-Wei Liu, 2022/03/25
    - Message not available
    - Re: [PATCH] vdpa: Avoid reset when stop device, Si-Wei Liu, 2022/03/25
    - Message not available
    - Re: [PATCH] vdpa: Avoid reset when stop device, Si-Wei Liu, 2022/03/25
    - Re: [PATCH] vdpa: Avoid reset when stop device, Jason Wang, 2022/03/30
    - Re: [PATCH] vdpa: Avoid reset when stop device, Michael Qiu, 2022/03/30
- [PATCH v2] vdpa: reset the backend device in stage of stop last vhost device, 08005325, 2022/03/30
  - Re: [PATCH v2] vdpa: reset the backend device in stage of stop last vhost device, Michael S. Tsirkin, 2022/03/30
    - Re: [PATCH v2] vdpa: reset the backend device in stage of stop last vhost device, Michael Qiu, 2022/03/30
  - Re: [PATCH v2] vdpa: reset the backend device in stage of stop last vhost device, Si-Wei Liu, 2022/03/30
    - Re: [PATCH v2] vdpa: reset the backend device in stage of stop last vhost device, Michael Qiu <=
    - Re: [PATCH v2] vdpa: reset the backend device in stage of stop last vhost device, Michael Qiu, 2022/03/31
  - [PATCH v3] vdpa: reset the backend device in the end of vhost_net_stop(), 08005325, 2022/03/31
    - Re: [PATCH v3] vdpa: reset the backend device in the end of vhost_net_stop(), Jason Wang, 2022/03/31
    - Re: [PATCH v3] vdpa: reset the backend device in the end of vhost_net_stop(), Maxime Coquelin, 2022/03/31
    - Re: [PATCH v3] vdpa: reset the backend device in the end of vhost_net_stop(), Michael Qiu, 2022/03/31
    - Re: [PATCH v3] vdpa: reset the backend device in the end of vhost_net_stop(), Jason Wang, 2022/03/31
  - [PATCH RESEND v3] vdpa: reset the backend device in the end of vhost_net_stop(), qiudayu, 2022/03/31
    - Re: [PATCH RESEND v3] vdpa: reset the backend device in the end of vhost_net_stop(), Michael Qiu, 2022/03/31
    - Message not available
    - Re: [PATCH RESEND v3] vdpa: reset the backend device in the end of vhost_net_stop(), Michael S. Tsirkin, 2022/03/31
    - Re: [PATCH RESEND v3] vdpa: reset the backend device in the end of vhost_net_stop(), Si-Wei Liu, 2022/03/31

Prev by Date: Re: Re: [PATCH] target/riscv: Exit current TB after an sfence.vma
Next by Date: Re: [PATCH v2] vdpa: reset the backend device in stage of stop last vhost device
Previous by thread: Re: [PATCH v2] vdpa: reset the backend device in stage of stop last vhost device
Next by thread: Re: [PATCH v2] vdpa: reset the backend device in stage of stop last vhost device
Index(es):
- Date
- Thread