Re: [Qemu-devel] [PATCH 00/10] cputlb: track dirty tlbs and general clea

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-devel] [PATCH 00/10] cputlb: track dirty tlbs and general clea

From:	Emilio G. Cota
Subject:	Re: [Qemu-devel] [PATCH 00/10] cputlb: track dirty tlbs and general cleanup
Date:	Tue, 23 Oct 2018 13:11:14 -0400
User-agent:	Mutt/1.9.4 (2018-02-28)

On Tue, Oct 23, 2018 at 08:02:42 +0100, Richard Henderson wrote:
> The motivation here is reducing the total overhead.
> 
> Before a few patches went into target-arm.next, I measured total
> tlb flush overhead for aarch64 at 25%.  This appears to reduce the
> total overhead to about 5% (I do need to re-run the control tests,
> not just watch perf top as I'm doing now).

I'd like to see those absolute perf numbers; I ran a few Ubuntu aarch64
boots and the noise is just too high to draw any conclusions (I'm
using your tlb-dirty branch on github).

When booting the much smaller debian image, these patches are
performance-neutral though. So,
  Reviewed-by: Emilio G. Cota <address@hidden>
for the series.

(On a pedantic note: consider s/miniscule/minuscule/ in patches 6-7)

> The final patch is somewhat of an RFC.  I'd like to know what
> benchmark was used when putting in pending_tlb_flushes, and I
> have not done any archaeology to find out.  I suspect that it
> does make any measurable difference beyond tlb_c.dirty, and I
> think the code is a bit cleaner without it.

I suspect that pending_tlb_flushes was premature optimization.
Avoiding an async job sounds like a good idea, since it is very
expensive for the remote vCPU.
However, in most cases we'll be taking a lock (or a full barrier
in the original code) but we won't avoid the async job (because
a race when flushing other vCPUs is unlikely), therefore wasting
cycles in the lock (formerly barrier).

Thanks,

                Emilio

[Prev in Thread]

Current Thread

[Next in Thread]

[Qemu-devel] [PATCH 08/10] cputlb: Count "partial" and "elided" tlb flushes, (continued)
- [Qemu-devel] [PATCH 08/10] cputlb: Count "partial" and "elided" tlb flushes, Richard Henderson, 2018/10/23
- [Qemu-devel] [PATCH 09/10] cputlb: Filter flushes on already clean tlbs, Richard Henderson, 2018/10/23
- [Qemu-devel] [PATCH 04/10] cputlb: Split large page tracking per mmu_idx, Richard Henderson, 2018/10/23
  - Re: [Qemu-devel] [PATCH 04/10] cputlb: Split large page tracking per mmu_idx, Emilio G. Cota, 2018/10/26
    - Re: [Qemu-devel] [PATCH 04/10] cputlb: Split large page tracking per mmu_idx, Richard Henderson, 2018/10/27
- [Qemu-devel] [PATCH, build fix] osdep: Work around MinGW assert, Richard Henderson, 2018/10/23
  - Re: [Qemu-devel] [PATCH, build fix] osdep: Work around MinGW assert, Philippe Mathieu-Daudé, 2018/10/23
- [Qemu-devel] [PATCH 07/10] cputlb: Merge tlb_flush_page into tlb_flush_page_by_mmuidx, Richard Henderson, 2018/10/23
- [Qemu-devel] [PATCH 05/10] cputlb: Move env->vtlb_index to env->tlb_d.vindex, Richard Henderson, 2018/10/23
  - Re: [Qemu-devel] [PATCH 05/10] cputlb: Move env->vtlb_index to env->tlb_d.vindex, Philippe Mathieu-Daudé, 2018/10/23
- Re: [Qemu-devel] [PATCH 00/10] cputlb: track dirty tlbs and general cleanup, Emilio G. Cota <=

Prev by Date: [Qemu-devel] [PATCH v6 18/18] target/mips: Add emulation of MXU instructions S32LDD and S32LDDR
Next by Date: Re: [Qemu-devel] [PATCH v2 3/3] linux-user: Add support for SO_REUSEPORT
Previous by thread: Re: [Qemu-devel] [PATCH 05/10] cputlb: Move env->vtlb_index to env->tlb_d.vindex
Next by thread: [Qemu-devel] [PATCH v2 0/2] [RFC] qemu: arm: Migration between machines with different MIDR values
Index(es):
- Date
- Thread