Re: [PATCH 0/2] read kvmclock from guest memory if !correct_tsc

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH 0/2] read kvmclock from guest memory if !correct_tsc_shift

From:	Paolo Bonzini
Subject:	Re: [PATCH 0/2] read kvmclock from guest memory if !correct_tsc_shift
Date:	Fri, 20 Jan 2023 09:54:03 +0100
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:102.0) Gecko/20100101 Thunderbird/102.5.1

On 1/20/23 02:11, Marcelo Tosatti wrote:

Before kernel commit 78db6a5037965429c04d708281f35a6e5562d31b,
kvm_guest_time_update() would use vcpu->virtual_tsc_khz to calculate
tsc_shift value in the vcpus pvclock structure written to guest memory.


To clarify, the problem is that kvm_guest_time_update() uses the guest

TSC frequency *that userspace desired* instead of the *actual* TSCfrequency. Because, within the 250 ppm tolerance, TSC scaling is notenabled, the guest kvmclock is incorrect; KVM_GET_CLOCK instead returnsthe correct value, and the bug occurs when migrating from a host that ispublishing a buggy kvmclock to the guest.

For those kernels, if vcpu->virtual_tsc_khz != tsc_khz (which can be the
case when guest state is restored via migration, or if tsc-khz option is
passed to QEMU), and TSC scaling is not enabled (which happens if the
difference between the frequency requested via KVM_SET_TSC_KHZ and the
host TSC KHZ is smaller than 250ppm), then there can be a difference
between what KVM_GET_CLOCK would return and what the guest reads as
kvmclock value.

In practice, to trigger the bug you need to do two migrations from asix-year-old kernel; I just can't see too many people stumbling uponthis in the wild, and I don't think it makes sense to hobble _all_migrations from a kernel that is less than six years old for such anedge case. New versions of QEMU do not even support running with suchold kernels (it will for example complain about no support for certainKVM PV features).

It is not a huge request for the user to know if they are in theproblematic case. It is easiest to use a custom QEMU on thedestination, and always compute the kvmclock value from memory if thepage is valid.

Once you do a migration to the custom QEMU + a fixed kernel, the bug isgone for good and there is no need to introduce new user API for that.


Paolo

The effect is that the guest sees a jump in kvmclock value
(either forwards or backwards) in such case.

To fix incoming migration from pre-78db6a5037965 hosts,
read kvmclock value from guest memory.

Unless the KVM_CLOCK_CORRECT_TSC_SHIFT bit indicates
that the value retrieved by KVM_GET_CLOCK on the source
is safe to be used.

[Prev in Thread]

Current Thread

[Next in Thread]

[PATCH 0/2] read kvmclock from guest memory if !correct_tsc_shift, Marcelo Tosatti, 2023/01/19
- [PATCH 1/2] linux-headers: sync KVM_CLOCK_CORRECT_TSC_SHIFT flag, Marcelo Tosatti, 2023/01/19
- [PATCH 2/2] hw/i386/kvm/clock.c: read kvmclock from guest memory if !correct_tsc_shift, Marcelo Tosatti, 2023/01/19
- Re: [PATCH 0/2] read kvmclock from guest memory if !correct_tsc_shift, Paolo Bonzini <=

Prev by Date: Re: [PATCH v2 02/11] tests/qtest/boot-serial-test: Simplify test_machine() a bit
Next by Date: [PATCH 3/3] migration: save/delete migration thread info
Previous by thread: [PATCH 2/2] hw/i386/kvm/clock.c: read kvmclock from guest memory if !correct_tsc_shift
Next by thread: [PULL 00/12] Header cleanup patches for 2023-01-20
Index(es):
- Date
- Thread