[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [PATCH v5 0/9] block: Add retry for werror=/rerror= mechanism
From: |
Jiahui Cen |
Subject: |
Re: [PATCH v5 0/9] block: Add retry for werror=/rerror= mechanism |
Date: |
Wed, 10 Feb 2021 09:22:38 +0800 |
User-agent: |
Mozilla/5.0 (Windows NT 10.0; WOW64; rv:68.0) Gecko/20100101 Thunderbird/68.2.2 |
Kindly ping.
Any comments and reviews are wellcome :)
Thanks,
Jiahui
On 2021/2/5 18:13, Jiahui Cen wrote:
> A VM in the cloud environment may use a virutal disk as the backend storage,
> and there are usually filesystems on the virtual block device. When backend
> storage is temporarily down, any I/O issued to the virtual block device
> will cause an error. For example, an error occurred in ext4 filesystem would
> make the filesystem readonly. In production environment, a cloud backend
> storage can be soon recovered. For example, an IP-SAN may be down due to
> network failure and will be online soon after network is recovered. However,
> the error in the filesystem may not be recovered unless a device reattach
> or system restart. Thus an I/O retry mechanism is in need to implement a
> self-healing system.
>
> This patch series propose to extend the werror=/rerror= mechanism to add
> a 'retry' feature. It can automatically retry failed I/O requests on error
> without sending error back to guest, and guest can get back running smoothly
> when I/O is recovred.
>
> v4->v5:
> * Add document for 'retry' in qapi.
> * Support werror=/rerror=retry for scsi-disk.
> * Pause retry when draining.
>
> v3->v4:
> * Adapt to werror=/rerror= mechanism.
>
> v2->v3:
> * Add a doc to describe I/O hang.
>
> v1->v2:
> * Rebase to fix compile problems.
> * Fix incorrect remove of rehandle list.
> * Provide rehandle pause interface.
>
> REF: https://lists.gnu.org/archive/html/qemu-devel/2020-10/msg06560.html
>
> Jiahui Cen (9):
> qapi/block-core: Add retry option for error action
> block-backend: Introduce retry timer
> block-backend: Add device specific retry callback
> block-backend: Enable retry action on errors
> block-backend: Add timeout support for retry
> block: Add error retry param setting
> virtio_blk: Add support for retry on errors
> scsi-bus: Refactor the code that retries requests
> scsi-disk: Add support for retry on errors
>
> block/block-backend.c | 68 ++++++++++++++++++++
> blockdev.c | 52 +++++++++++++++
> hw/block/block.c | 10 +++
> hw/block/virtio-blk.c | 21 +++++-
> hw/scsi/scsi-bus.c | 16 +++--
> hw/scsi/scsi-disk.c | 16 +++++
> include/hw/block/block.h | 7 +-
> include/hw/scsi/scsi.h | 1 +
> include/sysemu/block-backend.h | 10 +++
> qapi/block-core.json | 9 ++-
> 10 files changed, 199 insertions(+), 11 deletions(-)
>
- Re: [PATCH v5 1/9] qapi/block-core: Add retry option for error action, (continued)
- [PATCH v5 7/9] virtio_blk: Add support for retry on errors, Jiahui Cen, 2021/02/05
- [PATCH v5 5/9] block-backend: Add timeout support for retry, Jiahui Cen, 2021/02/05
- [PATCH v5 9/9] scsi-disk: Add support for retry on errors, Jiahui Cen, 2021/02/05
- [PATCH v5 3/9] block-backend: Add device specific retry callback, Jiahui Cen, 2021/02/05
- [PATCH v5 6/9] block: Add error retry param setting, Jiahui Cen, 2021/02/05
- [PATCH v5 4/9] block-backend: Enable retry action on errors, Jiahui Cen, 2021/02/05
- Re: [PATCH v5 0/9] block: Add retry for werror=/rerror= mechanism,
Jiahui Cen <=
- Re: [PATCH v5 0/9] block: Add retry for werror=/rerror= mechanism, Stefan Hajnoczi, 2021/02/22
- Re: [PATCH v5 0/9] block: Add retry for werror=/rerror= mechanism, Stefan Hajnoczi, 2021/02/23