Re: [Qemu-ppc] [RFC PATCH 0/6] target/ppc: convert VMX instructions to u

qemu-ppc

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-ppc] [RFC PATCH 0/6] target/ppc: convert VMX instructions to u

From:	BALATON Zoltan
Subject:	Re: [Qemu-ppc] [RFC PATCH 0/6] target/ppc: convert VMX instructions to use TCG vector operations
Date:	Tue, 11 Dec 2018 04:03:36 +0100 (CET)
User-agent:	Alpine 2.21.9999 (BSF 287 2018-06-16)

On Tue, 11 Dec 2018, David Gibson wrote:

On Mon, Dec 10, 2018 at 09:54:51PM +0100, BALATON Zoltan wrote:

Yes, I don't really know what these tests use but I think "lame" test is
mostly floating point but tried with "lame_vmx" which should at least use
some vector ops and "mplayer -benchmark" test is more vmx dependent based on
my previous profiling and testing with hardfloat but I'm not sure. (When
testing these with hardfloat I've found that lame was benefiting from
hardfloat but mplayer wasn't and more VMX related functions showed up with
mplayer so I assumed it's more VMX bound.)


I should clarify here.  When I say "floating point" above, I'm not
meaning things using the regular FPU instead of the vector unit.  I'm
saying *anything* involving floating point calculations whether
they're done in the FPU or the vector unit.

OK that clarifies it. I admit I was only testing these but didn't havetime to look what changed exactly.

The patches here don't convert all VMX instructions to use vector TCG
ops - they only convert a few, and those few are about using the
vector unit for integer (and logical) operations.  VMX instructions
involving floating point calculations are unaffected and will still
use soft-float.

What I've said above about lame test being more FPU and mplayer more VMXintensive probably still holds as I've retried now on a Haswell i5 and got1-2% difference with lame_vmx and ~6% with mplayer. That's very littleimprovement but if only some VMX instructions should be faster then thismay make sense.

These tests are not the best, maybe there are better ways to measure thisbut I don't know of any,

Maybe the PPC softmmu should be reviewed and optimised by someone who knows
it...


I'm not sure there is anyone who knows it at this point.  I probably
know it as well as anybody, and the ppc32 code scares me.  It's a
crufty mess and it would be nice to clean up, but that requires
someone with enough time and interest.

At least this seems to be a big bottleneck in PPC emulation and one that'snot being worked on (others like hardfloat and VMX while not finished andstill lot to do but already there are some results but no one is lookingat softmmu). I was just trying to direct some attention to that softmmumay also need some optimisation and hope someone would notice this. I havesome interest but not much time these days and if it scares you whatshould I say. I don't even understand most of it so it would take a lot oftime to even get how it works and what would need to be done. So I hopesomeone with more time or knowledge shows up and maybe at least providessome hints on what may need to be done.


Regards,
BALATON Zoltan

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [Qemu-ppc] [RFC PATCH 6/6] target/ppc: convert vaddu[b, h, w, d] and vsubu[b, h, w, d] over to use vector operations, (continued)
- [Qemu-ppc] [RFC PATCH 3/6] target/ppc: introduce get_cpu_vsr{l, h}() and set_cpu_vsr{l, h}() helpers for VSR register access, Mark Cave-Ayland, 2018/12/07
  - Re: [Qemu-ppc] [RFC PATCH 3/6] target/ppc: introduce get_cpu_vsr{l, h}() and set_cpu_vsr{l, h}() helpers for VSR register access, Richard Henderson, 2018/12/10
    - Re: [Qemu-ppc] [Qemu-devel] [RFC PATCH 3/6] target/ppc: introduce get_cpu_vsr{l, h}() and set_cpu_vsr{l, h}() helpers for VSR register access, Mark Cave-Ayland, 2018/12/11
- Re: [Qemu-ppc] [RFC PATCH 0/6] target/ppc: convert VMX instructions to use TCG vector operations, BALATON Zoltan, 2018/12/09
  - Re: [Qemu-ppc] [RFC PATCH 0/6] target/ppc: convert VMX instructions to use TCG vector operations, David Gibson, 2018/12/09
    - Re: [Qemu-ppc] [RFC PATCH 0/6] target/ppc: convert VMX instructions to use TCG vector operations, BALATON Zoltan, 2018/12/10
    - Re: [Qemu-ppc] [RFC PATCH 0/6] target/ppc: convert VMX instructions to use TCG vector operations, Richard Henderson, 2018/12/10
    - Re: [Qemu-ppc] [RFC PATCH 0/6] target/ppc: convert VMX instructions to use TCG vector operations, BALATON Zoltan, 2018/12/10
    - Re: [Qemu-ppc] [RFC PATCH 0/6] target/ppc: convert VMX instructions to use TCG vector operations, David Gibson, 2018/12/10
    - Re: [Qemu-ppc] [RFC PATCH 0/6] target/ppc: convert VMX instructions to use TCG vector operations, BALATON Zoltan <=
    - Re: [Qemu-ppc] [Qemu-devel] [RFC PATCH 0/6] target/ppc: convert VMX instructions to use TCG vector operations, Mark Cave-Ayland, 2018/12/11
    - Re: [Qemu-ppc] [Qemu-devel] [RFC PATCH 0/6] target/ppc: convert VMX instructions to use TCG vector operations, Richard Henderson, 2018/12/11
- Re: [Qemu-ppc] [Qemu-devel] [RFC PATCH 0/6] target/ppc: convert VMX instructions to use TCG vector operations, Aleksandar Markovic, 2018/12/10
  - Re: [Qemu-ppc] [Qemu-devel] [RFC PATCH 0/6] target/ppc: convert VMX instructions to use TCG vector operations, Mark Cave-Ayland, 2018/12/11

Prev by Date: Re: [Qemu-ppc] [PATCH v7 15/19] spapr/xive: enable XIVE MMIOs at reset
Next by Date: Re: [Qemu-ppc] [PATCH qemu] ppc/spapr: Receive and store device tree blob from SLOF
Previous by thread: Re: [Qemu-ppc] [RFC PATCH 0/6] target/ppc: convert VMX instructions to use TCG vector operations
Next by thread: Re: [Qemu-ppc] [Qemu-devel] [RFC PATCH 0/6] target/ppc: convert VMX instructions to use TCG vector operations
Index(es):
- Date
- Thread