Re: [Qemu-devel] [RFC PATCH 00/20] Kemari for KVM v0.1

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-devel] [RFC PATCH 00/20] Kemari for KVM v0.1

From:	Anthony Liguori
Subject:	Re: [Qemu-devel] [RFC PATCH 00/20] Kemari for KVM v0.1
Date:	Fri, 23 Apr 2010 08:20:21 -0500
User-agent:	Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.1.5) Gecko/20091209 Fedora/3.0-4.fc12 Lightning/1.0pre Thunderbird/3.0

On 04/22/2010 08:53 PM, Yoshiaki Tamura wrote:

Anthony Liguori wrote:

On 04/22/2010 08:16 AM, Yoshiaki Tamura wrote:

2010/4/22 Dor Laor<address@hidden>:

On 04/22/2010 01:35 PM, Yoshiaki Tamura wrote:

Dor Laor wrote:

On 04/21/2010 08:57 AM, Yoshiaki Tamura wrote:

Hi all,

We have been implementing the prototype of Kemari for KVM, andwe're

sending

this message to share what we have now and TODO lists.Hopefully, we

would like
to get early feedback to keep us in the right direction. Although
advanced
approaches in the TODO lists are fascinating, we would like to run
this project

step by step while absorbing comments from the community. Thecurrent

code is
based on qemu-kvm.git 2b644fd0e737407133c88054ba498e772ce01f27.

For those who are new to Kemari for KVM, please take a look at the
following RFC which we posted last year.

http://www.mail-archive.com/address@hidden/msg25022.html

The transmission/transaction protocol, and most of the control
logic is

implemented in QEMU. However, we needed a hack in KVM to preventrip

from
proceeding before synchronizing VMs. It may also need some
plumbing in
the
kernel side to guarantee replayability of certain events and
instructions,
integrate the RAS capabilities of newer x86 hardware with the HA
stack, as well
as for optimization purposes, for example.

[ snap]

The rest of this message describes TODO lists grouped by eachtopic.


=== event tapping ===

Event tapping is the core component of Kemari, and it decides on
which
event the
primary should synchronize with the secondary. The basic assumption
here is
that outgoing I/O operations are idempotent, which is usually true
for
disk I/O
and reliable network protocols such as TCP.

IMO any type of network even should be stalled too. What if the VM
runs

non tcp protocol and the packet that the master node sent reachedsome

remote client and before the sync to the slave the master failed?

In current implementation, it is actually stalling any type ofnetwork

that goes through virtio-net.

However, if the application was using unreliable protocols, it should
have its own recovering mechanism, or it should be completely
stateless.

Why do you treat tcp differently? You can damage the entire VM this
way -
think of dhcp request that was dropped on the moment you switched
between
the master and the slave?

I'm not trying to say that we should treat tcp differently, but just
it's severe.
In case of dhcp request, the client would have a chance to retry after
failover, correct?
BTW, in current implementation,


I'm slightly confused about the current implementation vs. my
recollection of the original paper with Xen. I had thought that all disk
and network I/O was buffered in such a way that at each checkpoint, the
I/O operations would be released in a burst. Otherwise, you would have
to synchronize after every I/O operation which is what it seems the
current implementation does.


Yes, you're almost right.
It's synchronizing before QEMU starts emulating I/O at each device model.

If NodeA is the master and NodeB is the slave, if NodeA sends a networkpacket, you'll checkpoint before the packet is actually sent, and thenif a failure occurs before the next checkpoint, won't that result inboth NodeA and NodeB sending out a duplicate version of the packet?


Regards,

Anthony Liguori

[Prev in Thread]

Current Thread

[Next in Thread]

[Qemu-devel] Re: [RFC PATCH 07/20] Introduce qemu_put_vector() and qemu_put_vector_prepare() to use put_vector() in QEMUFile., (continued)
- [Qemu-devel] [RFC PATCH 09/20] Introduce writev and read to FdMigrationState., Yoshiaki Tamura, 2010/04/21
- Re: [Qemu-devel] [RFC PATCH 00/20] Kemari for KVM v0.1, Dor Laor, 2010/04/22
  - Re: [Qemu-devel] [RFC PATCH 00/20] Kemari for KVM v0.1, Yoshiaki Tamura, 2010/04/22
    - Re: [Qemu-devel] [RFC PATCH 00/20] Kemari for KVM v0.1, Dor Laor, 2010/04/22
    - Re: [Qemu-devel] [RFC PATCH 00/20] Kemari for KVM v0.1, Yoshiaki Tamura, 2010/04/22
    - Re: [Qemu-devel] [RFC PATCH 00/20] Kemari for KVM v0.1, Anthony Liguori, 2010/04/22
    - Re: [Qemu-devel] [RFC PATCH 00/20] Kemari for KVM v0.1, Yoshiaki Tamura, 2010/04/22
    - Re: [Qemu-devel] [RFC PATCH 00/20] Kemari for KVM v0.1, Anthony Liguori <=
    - Re: [Qemu-devel] [RFC PATCH 00/20] Kemari for KVM v0.1, Yoshiaki Tamura, 2010/04/26
    - Re: [Qemu-devel] [RFC PATCH 00/20] Kemari for KVM v0.1, Dor Laor, 2010/04/22
    - Re: [Qemu-devel] [RFC PATCH 00/20] Kemari for KVM v0.1, Yoshiaki Tamura, 2010/04/23
    - Re: [Qemu-devel] [RFC PATCH 00/20] Kemari for KVM v0.1, Fernando Luis Vázquez Cao, 2010/04/23
    - Re: [Qemu-devel] [RFC PATCH 00/20] Kemari for KVM v0.1, Dor Laor, 2010/04/25
    - Re: [Qemu-devel] [RFC PATCH 00/20] Kemari for KVM v0.1, Takuya Yoshikawa, 2010/04/22
    - Re: [Qemu-devel] [RFC PATCH 00/20] Kemari for KVM v0.1, Yoshiaki Tamura, 2010/04/22
    - Re: [Qemu-devel] [RFC PATCH 00/20] Kemari for KVM v0.1, Jamie Lokier, 2010/04/22
    - Re: [Qemu-devel] [RFC PATCH 00/20] Kemari for KVM v0.1, Yoshiaki Tamura, 2010/04/22
    - Re: [Qemu-devel] [RFC PATCH 00/20] Kemari for KVM v0.1, Jamie Lokier, 2010/04/23

Prev by Date: [Qemu-devel] Re: [RFC PATCH 04/20] Make QEMUFile buf expandable, and introduce qemu_realloc_buffer() and qemu_clear_buffer().
Next by Date: [Qemu-devel] Re: [RFC PATCH 05/20] Introduce put_vector() and get_vector to QEMUFile and qemu_fopen_ops().
Previous by thread: Re: [Qemu-devel] [RFC PATCH 00/20] Kemari for KVM v0.1
Next by thread: Re: [Qemu-devel] [RFC PATCH 00/20] Kemari for KVM v0.1
Index(es):
- Date
- Thread