qemu-stable
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH v1 2/2] virtio-balloon: disallow postcopy with VIRTIO_BALLOON


From: David Hildenbrand
Subject: Re: [PATCH v1 2/2] virtio-balloon: disallow postcopy with VIRTIO_BALLOON_F_FREE_PAGE_HINT
Date: Thu, 8 Jul 2021 09:23:09 +0200
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.11.0

On 07.07.21 23:22, Alexander Duyck wrote:
On Wed, Jul 7, 2021 at 1:08 PM Peter Xu <peterx@redhat.com> wrote:

On Wed, Jul 07, 2021 at 08:57:29PM +0200, David Hildenbrand wrote:
On 07.07.21 20:02, Peter Xu wrote:
On Wed, Jul 07, 2021 at 04:06:55PM +0200, David Hildenbrand wrote:
As it never worked properly, let's disable it via the postcopy notifier on
the destination. Trying to set "migrate_set_capability postcopy-ram on"
on the destination now results in "virtio-balloon: 'free-page-hint' does
not support postcopy Error: Postcopy is not supported".

Would it be possible to do this in reversed order?  Say, dynamically disable
free-page-hinting if postcopy capability is set when migration starts? Perhaps
it can also be re-enabled automatically when migration completes?

I remember that this might be quite racy. We would have to make sure that no
hinting happens before we enable the capability.

As soon as we messed with the dirty bitmap (during precopy), postcopy is no
longer safe. As noted in the patch, the only runtime alternative is to
disable postcopy as soon as we actually do clear a bit. Alternatively, we
could ignore any hints if the postcopy capability was enabled.

Logically migration capabilities are applied at VM starts, and these
capabilities should be constant during migration (I didn't check if there's a
hard requirement; easy to add that if we want to assure it), and in most cases
for the lifecycle of the vm.

Would it make sense to maybe just look at adding a postcopy value to
the PrecopyNotifyData that you could populate with
migration_in_postcopy() in precopy_notify()?

Then all you would need to do is check for that value and if it is set
you shut down the page hinting or don't start it since I suspect it
wouldn't likely add any value anyway since I would think flagging
unused pages doesn't add much value in a postcopy environment anyway.

I don't think that's true. With free page hinting you reduce the effective VM size you have to migrate. Any page that has to be migrated will consume bandwidth.

1. Although postcopy transfers only the currently requested pages, the background thread will keep pushing pages, making postcopy eventually run longer. While in postcopy (well, and in precopy) we are faced with a clear performance degradation, so we want to minimize the overall time spent.

2. Usually you let precopy run for a while before switching to postcopy. With free page hinting you might be able to greatly reduce the number of pages you'll have to migrate later in the same amount of time.


So there would be value, but at least I am not too interested in making it work in combination perfectly if it results in significant migration code changes; my goal is to not silently break guests when used in combination -- once there is the actual requirement to optimize this setup, we can work on that optimization (as discussed with MST here).

So I'll explore going the migrate_postcopy_ram() way to silently (or at least warn) disable free page hinting. Thanks.

--
Thanks,

David / dhildenb




reply via email to

[Prev in Thread] Current Thread [Next in Thread]