Re: [Qemu-devel] [PATCH] A small patch to introduce stop conditions to t

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-devel] [PATCH] A small patch to introduce stop conditions to t

From:	Thomas Treutner
Subject:	Re: [Qemu-devel] [PATCH] A small patch to introduce stop conditions to the live migration.
Date:	Thu, 15 Sep 2011 10:27:45 +0200
User-agent:	Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.2.21) Gecko/20110831 Thunderbird/3.1.13

Am 14.09.2011 17:45, schrieb Anthony Liguori:

On 09/14/2011 08:18 AM, Thomas Treutner wrote:

Currently, it is possible that a live migration never finishes, when
the dirty page rate is high compared to the scan/transfer rate. The
exact values for MAX_MEMORY_ITERATIONS and
MAX_TOTAL_MEMORY_TRANSFER_FACTOR are arguable, but there should be
*some* limit to force the final iteration of a live migration that
does not converge.


No, there shouldn't be.

I think there should be. The iterative pre-copy mechanism is completelydepending on the assumption of convergence. Currently, the probablechance that this assumption does not hold is totally ignored, which iskind of burying one's head in the sand to me.

A management app

I do not know of any management app that takes care of this. Can yougive an example where management app developers actually knew about thisissue and took care of it? I didn't see any big warning regardingmigration, but just stumbled upon it by coincidence. libvirt just seemsto program around MAX_THROTTLE nowadays, which is another PITA. As auser, I can and have to assume that a certain function actually doeswhat it promises and if it can't for whatever reason, it throws anerror. Would you be happy with a function that promises the write of afile, but if the location given is not writable, it just sits there andwaits forever until you somehow, manually notice why and what the remedy is?

can always stop a guest to force convergence.

What do you mean by stop exactly? Pausing the guest? Is it thenautomatically unpaused by qemu again at the destination host?



> If you make migration have unbounded downtime by default
> then you're making migration unsafe for smarter consumers.

I'd prefer that compared to having the common case unsafe. If migrationdoesn't converge, it is now eventually finished at some distant point intime only because the VM's service severely suffers from the migration,meaning it can do less and less page dirtying. In reality, users wouldquickly stop using the service, as response times etc. are going throughthe roof and they're running in network timeouts. Having a single,longer downtime is better than a potentially everlasting unresponsive VM.

You can already set things like maximum downtime to force convergence.

The maximum downtine parameter seems to be a nice switch, but it isanother example of surprise. The value you choose is not even in withina magnitude of what happens, as the "bandwidth" used for calculationsseems to be a buffer bandwidth, but not the real network bandwidth. Evenwith extremely aggressive bridge-timings, there is a factor of ~20between the default 30ms setting and the actual result.

I know the - arguable, in my pov - policy is "just give progress infowhen requested (although our algorithm strictly requires steadyprogress, but we do no want to hear that when things go hot), and letmgmt apps decide", but that is not implemented correctly either. First,because of the bandwidth/downtime issue above, second, because ofincorrect memory transfer amounts, where duplicate (unused?) pages areaccounted as 1 byte of transfer. It may be correct regarding thephysical view, but from a logical, management app view, the migrationhas progressed by a full page, not just 1 byte. It is hard to argue thatmgmt apps should care about things working out nicely, when theinformation given to them is not consistent to each other and switchespresented are doing something but not in any way what they said they would.

If you wanted to have some logic like an exponentially increasing
maximum downtime given a fixed timeout, that would be okay provided it
was an optional feature.

I'm already doing a similar thing using libvirt, I'm just coming back tothis as such an approach is causing lots of pain and clutter-up code,and the original issue can be solved with 3-4 changed lines of code inqemu.

AFAIK, there is neither a way to synchronize on the actual start of themigration (so you can start polling and setting a custom downtime value)nor to synchronize on the end of the migration (so you know when to stoppolling). As a result, one is playing around with crude sleeps, hopingthat the migration, although of course already triggered, has actuallystarted yet, and then trying in vain not to step on any invalidated datastructures while monitoring the progress in a second thread, as no oneknows when the main thread with the blocking live migration will pullthe rug out from under the monitoring thread's feet. Then, lots of codeis needed to clean up this holy mess and regularly, a SEGV is happening:http://pastebin.com/jT6sXubu

I don't know of any way to reliably and cleanly solve this issue within"a management app", as I don't see any mechanism that the main threadsignals a monitoring thread to stop monitoring *before* it will pull therug. Sending the signal directly after the migration call unblocks isnot enough, I've tried that, the result is above. There is still roomfor two threads in one critical section.



regards,
thomas

[Prev in Thread]

Current Thread

[Next in Thread]

[Qemu-devel] [PATCH] A small patch to introduce stop conditions to the live migration., Thomas Treutner, 2011/09/14
- Re: [Qemu-devel] [PATCH] A small patch to introduce stop conditions to the live migration., Michael Roth, 2011/09/14
  - Re: [Qemu-devel] [PATCH] A small patch to introduce stop conditions to the live migration., Michael Roth, 2011/09/14
- Re: [Qemu-devel] [PATCH] A small patch to introduce stop conditions to the live migration., Anthony Liguori, 2011/09/14
  - Re: [Qemu-devel] [PATCH] A small patch to introduce stop conditions to the live migration., Thomas Treutner <=
    - Re: [Qemu-devel] [PATCH] A small patch to introduce stop conditions to the live migration., Paolo Bonzini, 2011/09/15

Prev by Date: [Qemu-devel] PATCH: s390: fix reset hypercall to reset the status
Next by Date: Re: [Qemu-devel] [PATCH 1/2] hw/usb-ohci: Honour endpoint maximum packet size
Previous by thread: Re: [Qemu-devel] [PATCH] A small patch to introduce stop conditions to the live migration.
Next by thread: Re: [Qemu-devel] [PATCH] A small patch to introduce stop conditions to the live migration.
Index(es):
- Date
- Thread