[Qemu-devel] Re: [RFC][STABLE 0.13] Revert "qcow2: Use bdrv_(p)write

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Qemu-devel] Re: [RFC][STABLE 0.13] Revert "qcow2: Use bdrv_(p)write_syn

From:	Avi Kivity
Subject:	[Qemu-devel] Re: [RFC][STABLE 0.13] Revert "qcow2: Use bdrv_(p)write_sync for metadata writes"
Date:	Wed, 25 Aug 2010 10:14:59 +0300
User-agent:	Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.2.8) Gecko/20100806 Fedora/3.1.2-1.fc13 Thunderbird/3.1.2

 On 08/24/2010 04:56 PM, Anthony Liguori wrote:

One doesn't follow from the other (though I'm no fan of internalsnapshots, myself).
It does.  Let's consider the failure scenarios:

1) guest submits write request
2) allocate extent
3) write data to disk (a)
4) write (a) completes
5) update reference count table for new extent (b)
6) write (b) completes
7) write extent table (c)
8) write (c) completes
9) complete guest write request
If this all happened in order and we lost power, the worst case erroris that we leak a block which isn't terrible.
But we're not guaranteed that this happens in order.
If (b) or (c) happen before (a), then the image is not corrupted butdata gets lost. That's okay because it's part of the guest contract.
If (c) happens before (b), then we've created an extent that'sattached to a table with a zero reference count. This is a corruptimage.

If the only issue is new block allocation, it can be easily solved.Instead of allocating exactly the needed amount of blocks, allocate alarge extent and hold them in memory. The next allocation can then befilled from memory, so the allocation sync is amortized over manyblocks. A power fail will leak the preallocated blocks, losing somemegabytes of address space, but not real disk space.

Let's consider if we eliminate the reference count table which meanseliminating internal snapshots.
1) guest submits write request
2) allocate extent
3) write data to disk (a)
4) write (a) completes
5) write extent table (c)
6) write (c) completes
7) complete guest write request
If this all happens in order and we lose power, we just leak a block.It means we need a periodic fsck.
If (c) completes before (a), then it means that the image is notcorrupted but data gets lost. This is okay based on the guest contract.
And that's it.  There is no scenario where the disk is corrupted.


_if_ that's the only failure mode.


--
I have a truly marvellous patch that fixes the bug which this
signature is too narrow to contain.

[Prev in Thread]

Current Thread

[Next in Thread]

[Qemu-devel] Re: [RFC][STABLE 0.13] Revert "qcow2: Use bdrv_(p)write_sync for metadata writes", (continued)
- [Qemu-devel] Re: [RFC][STABLE 0.13] Revert "qcow2: Use bdrv_(p)write_sync for metadata writes", Juan Quintela, 2010/08/24
- [Qemu-devel] Re: [RFC][STABLE 0.13] Revert "qcow2: Use bdrv_(p)write_sync for metadata writes", Anthony Liguori, 2010/08/24
  - [Qemu-devel] Re: [RFC][STABLE 0.13] Revert "qcow2: Use bdrv_(p)write_sync for metadata writes", Kevin Wolf, 2010/08/24
    - [Qemu-devel] Re: [RFC][STABLE 0.13] Revert "qcow2: Use bdrv_(p)write_sync for metadata writes", Anthony Liguori, 2010/08/24
    - [Qemu-devel] Re: [RFC][STABLE 0.13] Revert "qcow2: Use bdrv_(p)write_sync for metadata writes", Avi Kivity, 2010/08/24
    - [Qemu-devel] Re: [RFC][STABLE 0.13] Revert "qcow2: Use bdrv_(p)write_sync for metadata writes", Anthony Liguori, 2010/08/24
    - [Qemu-devel] Re: [RFC][STABLE 0.13] Revert "qcow2: Use bdrv_(p)write_sync for metadata writes", Avi Kivity, 2010/08/24
    - [Qemu-devel] Re: [RFC][STABLE 0.13] Revert "qcow2: Use bdrv_(p)write_sync for metadata writes", Anthony Liguori, 2010/08/24
    - [Qemu-devel] Re: [RFC][STABLE 0.13] Revert "qcow2: Use bdrv_(p)write_sync for metadata writes", Avi Kivity, 2010/08/24
    - [Qemu-devel] Re: [RFC][STABLE 0.13] Revert "qcow2: Use bdrv_(p)write_sync for metadata writes", Anthony Liguori, 2010/08/24
    - [Qemu-devel] Re: [RFC][STABLE 0.13] Revert "qcow2: Use bdrv_(p)write_sync for metadata writes", Avi Kivity <=
    - [Qemu-devel] Re: [RFC][STABLE 0.13] Revert "qcow2: Use bdrv_(p)write_sync for metadata writes", Anthony Liguori, 2010/08/25
    - [Qemu-devel] Re: [RFC][STABLE 0.13] Revert "qcow2: Use bdrv_(p)write_sync for metadata writes", Avi Kivity, 2010/08/25
    - [Qemu-devel] Re: [RFC][STABLE 0.13] Revert "qcow2: Use bdrv_(p)write_sync for metadata writes", Anthony Liguori, 2010/08/25
    - [Qemu-devel] Re: [RFC][STABLE 0.13] Revert "qcow2: Use bdrv_(p)write_sync for metadata writes", Avi Kivity, 2010/08/25
    - [Qemu-devel] Re: [RFC][STABLE 0.13] Revert "qcow2: Use bdrv_(p)write_sync for metadata writes", Anthony Liguori, 2010/08/25
    - [Qemu-devel] Re: [RFC][STABLE 0.13] Revert "qcow2: Use bdrv_(p)write_sync for metadata writes", Avi Kivity, 2010/08/25
    - [Qemu-devel] Re: [RFC][STABLE 0.13] Revert "qcow2: Use bdrv_(p)write_sync for metadata writes", Anthony Liguori, 2010/08/25
    - [Qemu-devel] Re: [RFC][STABLE 0.13] Revert "qcow2: Use bdrv_(p)write_sync for metadata writes", Avi Kivity, 2010/08/25
    - [Qemu-devel] Re: [RFC][STABLE 0.13] Revert "qcow2: Use bdrv_(p)write_sync for metadata writes", Anthony Liguori, 2010/08/25
    - [Qemu-devel] Re: [RFC][STABLE 0.13] Revert "qcow2: Use bdrv_(p)write_sync for metadata writes", Avi Kivity, 2010/08/25

Prev by Date: [Qemu-devel] Da lau khong hoi tham suc khoe!
Next by Date: [Qemu-devel] [Bug 595438] Re: KVM segmentation fault, using SCSI+writeback and linux 2.4 guest
Previous by thread: [Qemu-devel] Re: [RFC][STABLE 0.13] Revert "qcow2: Use bdrv_(p)write_sync for metadata writes"
Next by thread: [Qemu-devel] Re: [RFC][STABLE 0.13] Revert "qcow2: Use bdrv_(p)write_sync for metadata writes"
Index(es):
- Date
- Thread