[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Qemu-devel] [PATCH v10 20/23] target-arm/powerctl: defer cpu reset
From: |
Alex Bennée |
Subject: |
Re: [Qemu-devel] [PATCH v10 20/23] target-arm/powerctl: defer cpu reset work to CPU context |
Date: |
Tue, 07 Feb 2017 16:52:14 +0000 |
User-agent: |
mu4e 0.9.19; emacs 25.2.1 |
Peter Maydell <address@hidden> writes:
> On 6 February 2017 at 15:31, Alex Bennée <address@hidden> wrote:
>> When switching a new vCPU on we want to complete a bunch of the setup
>> work before we start scheduling the vCPU thread. To do this cleanly we
>> defer vCPU setup to async work which will run the vCPUs execution
>> context as the thread is woken up. The scheduling of the work will kick
>> the vCPU awake.
>>
>> This avoids potential races in MTTCG system emulation.
>>
>> Signed-off-by: Alex Bennée <address@hidden>
>> Reviewed-by: Richard Henderson <address@hidden>
>
>> @@ -77,7 +159,7 @@ int arm_set_cpu_on(uint64_t cpuid, uint64_t entry,
>> uint64_t context_id,
>> }
>>
>> target_cpu = ARM_CPU(target_cpu_state);
>> - if (!target_cpu->powered_off) {
>> + if (!atomic_mb_read(&target_cpu->powered_off)) {
>> qemu_log_mask(LOG_GUEST_ERROR,
>> "[ARM]%s: CPU %" PRId64 " is already on\n",
>> __func__, cpuid);
>
>> + /*
>> + * If another CPU has powered the target on we are in the state
>> + * ON_PENDING and additional attempts to power on the CPU should
>> + * fail (see 6.6 Implementation CPU_ON/CPU_OFF races in the PSCI
>> + * spec)
>> + */
>> + if (atomic_cmpxchg(&target_cpu->powering_on, false, true)) {
>> + qemu_log_mask(LOG_GUEST_ERROR,
>> + "[ARM]%s: CPU %" PRId64 " is already powering on\n",
>> + __func__, cpuid);
>> + return QEMU_ARM_POWERCTL_ON_PENDING;
>> }
>
> I find this too hard to reason about :-( We store the current
> CPU state in not one but two CPU structure fields, and then
> we check and manipulate it not by doing "take lock; access
> and update; drop lock" but by doing successive atomic
> operations on the various different flags. This seems unnecessarily
> tricky, and the patch doesn't provide any documentation about
> what's going on and what the constraints are to avoid races.
I was trying to avoid changing the powered_off state to a new variable
(maybe pcsi_power_state?) because it meant I didn't have to touch the
kvm code that treated powered_off as a simple bool. However I can unify
the state into one variable if you wish.
As for locking do we have a handy per-CPU lock already or would it be
enough to use the BQL for this? Although that may be problematic as the
cpu_reset/on can come from two places (the initial realization and the
runtime switching of power state.)
>
>>
>> - /* Start the new CPU at the requested address */
>> - cpu_set_pc(target_cpu_state, entry);
>> + /* To avoid racing with a CPU we are just kicking off we do the
>> + * final bit of preparation for the work in the target CPUs
>> + * context.
>> + */
>> + info = g_new(struct cpu_on_info, 1);
>> + info->entry = entry;
>> + info->context_id = context_id;
>> + info->target_el = target_el;
>> + info->target_aa64 = target_aa64;
>>
>> - qemu_cpu_kick(target_cpu_state);
>> + async_run_on_cpu(target_cpu_state, arm_set_cpu_on_async_work,
>> + RUN_ON_CPU_HOST_PTR(info));
>>
>> /* We are good to go */
>> return QEMU_ARM_POWERCTL_RET_SUCCESS;
>> }
>
>> @@ -221,8 +284,9 @@ int arm_reset_cpu(uint64_t cpuid)
>> return QEMU_ARM_POWERCTL_IS_OFF;
>> }
>>
>> - /* Reset the cpu */
>> - cpu_reset(target_cpu_state);
>> + /* Queue work to run under the target vCPUs context */
>> + async_run_on_cpu(target_cpu_state, arm_reset_cpu_async_work,
>> + RUN_ON_CPU_NULL);
>
> The imx6 datasheet says that the RST bit in the SCR register
> is set by the guest to trigger a reset, and then the bit
> remains set until the reset has finished (at which point it
> self-clears). At the moment the imx6_src.c code does this:
>
> if (EXTRACT(change_mask, CORE0_RST)) {
> arm_reset_cpu(0);
> clear_bit(CORE0_RST_SHIFT, ¤t_value);
> }
>
> ie it assumes that arm_reset_cpu() is synchronous.
Well for this case the imx6 could queue the "clear_bit" function as
async work and then it will be correctly reported after the reset has
occurred.
>
> PS: if the code does an arm_set_cpu_on() and then
> arm_reset_cpu() do we do the two lots of async work
> in the right order?
Yes. The actual run queue is protected by a lock and run on a FIFO
basis.
>
>> return QEMU_ARM_POWERCTL_RET_SUCCESS;
>> }
>> diff --git a/target/arm/arm-powerctl.h b/target/arm/arm-powerctl.h
>> index 98ee04989b..04353923c0 100644
>> --- a/target/arm/arm-powerctl.h
>> +++ b/target/arm/arm-powerctl.h
>> @@ -17,6 +17,7 @@
>> #define QEMU_ARM_POWERCTL_INVALID_PARAM QEMU_PSCI_RET_INVALID_PARAMS
>> #define QEMU_ARM_POWERCTL_ALREADY_ON QEMU_PSCI_RET_ALREADY_ON
>> #define QEMU_ARM_POWERCTL_IS_OFF QEMU_PSCI_RET_DENIED
>> +#define QEMU_ARM_POWERCTL_ON_PENDING QEMU_PSCI_RET_ON_PENDING
>>
>> /*
>> * arm_get_cpu_by_id:
>> @@ -43,6 +44,7 @@ CPUState *arm_get_cpu_by_id(uint64_t cpuid);
>> * Returns: QEMU_ARM_POWERCTL_RET_SUCCESS on success.
>> * QEMU_ARM_POWERCTL_INVALID_PARAM if bad parameters are provided.
>> * QEMU_ARM_POWERCTL_ALREADY_ON if the CPU was already started.
>> + * QEMU_ARM_POWERCTL_ON_PENDING if the CPU is still powering up
>> */
>> int arm_set_cpu_on(uint64_t cpuid, uint64_t entry, uint64_t context_id,
>> uint32_t target_el, bool target_aa64);
>> diff --git a/target/arm/cpu.h b/target/arm/cpu.h
>> index 39bff86daf..b8f82d5d20 100644
>> --- a/target/arm/cpu.h
>> +++ b/target/arm/cpu.h
>> @@ -582,8 +582,16 @@ struct ARMCPU {
>>
>> /* Should CPU start in PSCI powered-off state? */
>> bool start_powered_off;
>> - /* CPU currently in PSCI powered-off state */
>> + /* CPU PSCI state.
>> + *
>> + * For TCG these can be cross-vCPU accesses and should be done
>> + * atomically to avoid races.
>> + *
>> + * - powered_off indicates the vCPU state
>> + * - powering_on true if the vCPU has had a CPU_ON but not yet up
>> + */
>> bool powered_off;
>> + bool powering_on; /* PSCI ON_PENDING */
>> /* CPU has virtualization extension */
>> bool has_el2;
>> /* CPU has security extension */
>> --
>> 2.11.0
>>
>
> thanks
> -- PMM
--
Alex Bennée
- [Qemu-devel] [PATCH v10 15/23] cputlb: introduce tlb_flush_* async work., (continued)
- [Qemu-devel] [PATCH v10 15/23] cputlb: introduce tlb_flush_* async work., Alex Bennée, 2017/02/06
- [Qemu-devel] [PATCH v10 18/23] cputlb: atomically update tlb fields used by tlb_reset_dirty, Alex Bennée, 2017/02/06
- [Qemu-devel] [PATCH v10 16/23] cputlb and arm/sparc targets: convert mmuidx flushes from varg to bitmap, Alex Bennée, 2017/02/06
- [Qemu-devel] [PATCH v10 19/23] cputlb: introduce tlb_flush_*_all_cpus[_synced], Alex Bennée, 2017/02/06
- [Qemu-devel] [PATCH v10 23/23] tcg: enable MTTCG by default for ARM on x86 hosts, Alex Bennée, 2017/02/06
- [Qemu-devel] [PATCH v10 21/23] target-arm: don't generate WFE/YIELD calls for MTTCG, Alex Bennée, 2017/02/06
- [Qemu-devel] [PATCH v10 22/23] target-arm: ensure all cross vCPUs TLB flushes complete, Alex Bennée, 2017/02/06
- [Qemu-devel] [PATCH v10 20/23] target-arm/powerctl: defer cpu reset work to CPU context, Alex Bennée, 2017/02/06
- Re: [Qemu-devel] [PATCH v10 00/23] MTTCG Base enabling patches with ARM enablement, Pranith Kumar, 2017/02/06