[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Qemu-devel] [PATCH v11 00/20] tcg: generic vector operations
From: |
Alex Bennée |
Subject: |
Re: [Qemu-devel] [PATCH v11 00/20] tcg: generic vector operations |
Date: |
Tue, 06 Feb 2018 11:24:38 +0000 |
User-agent: |
mu4e 1.0-alpha3; emacs 26.0.91 |
Richard Henderson <address@hidden> writes:
> Changes since v11:
> * Use dup_const more.
> * Cleanup some gvec 2i and 2s routines.
> * Use more helpers and less gotos in target/arm/translate-a64.c.
I think this series is good to go.
A quick word on performance. I saw a slight dip for the string sort in
Emilio's dbt-bench/nbench:
https://i.imgur.com/K5AFr1u.png
And:
NBench score; higher is better
140 +-+-----+------+-------+-------+------+-------+-------+------+-----+-+
| **** |
| * *## development |
120 +-+.........................*..*.#....................master.......+-+
| * * # |
100 +-+............####.........*..*.#.................................+-+
| # # * * # |
| *** # * * # |
80 +-+..........*.*..#.........*..*.#.........****###.................+-+
| * * # * * # * * # |
60 +-+..........*.*..#.........*..*.#..***###.*..*..#.........***###..+-+
| * * # ****### * * # * * # * * # * * # |
| * * # * * # * * # * * # * * # ****## * * # |
40 +-+..........*.*..#.*..*..#.*..*.#..*.*..#.*..*..#.*..*.#..*.*..#..+-+
| * * # * * # * * # * * # * * # * * # * * # |
20 +-+..........*.*..#.*..*..#.*..*.#..*.*..#.*..*..#.*..*.#..*.*..#..+-+
| ****## * * # * * # * * # * * # * * # * * # * * # |
| * * # * * # * * # * * # * * # * * # * * # * * # |
0 +-+--****##--***###-****###-****##--***###-****###-****##--***###--+-+
NUMERIC STRING SOBITFIEFP EMULAASSIGNMENT IDEA HUFFMAN gmean
We think this is likely the strajust function which hits a loop
utilising a single vector. We already know a single vector-op is a worse
case given the latency but this improves if the code is -funrolled or
ultimately re-built with support for bigger vectors ;-)
I certainly don't think it's a blocker to merging given the other
benchmarks look pretty good including slight wins on others.
>
>
> Richard Henderson (20):
> tcg: Allow multiple word entries into the constant pool
> tcg: Add types and basic operations for host vectors
> tcg: Standardize integral arguments to expanders
> tcg: Add generic vector expanders
> tcg: Add generic vector ops for constant shifts
> tcg: Add generic vector ops for comparisons
> tcg: Add generic vector ops for multiplication
> tcg: Add generic helpers for saturating arithmetic
> tcg: Add generic vector helpers with a scalar operand
> tcg/optimize: Handle vector opcodes during optimize
> target/arm: Align vector registers
> target/arm: Use vector infrastructure for aa64 add/sub/logic
> target/arm: Use vector infrastructure for aa64 mov/not/neg
> target/arm: Use vector infrastructure for aa64 dup/movi
> target/arm: Use vector infrastructure for aa64 constant shifts
> target/arm: Use vector infrastructure for aa64 compares
> target/arm: Use vector infrastructure for aa64 multiplies
> target/arm: Use vector infrastructure for aa64 orr/bic immediate
> tcg/i386: Add vector operations
> tcg/aarch64: Add vector operations
>
> Makefile.target | 4 +-
> accel/tcg/tcg-runtime.h | 118 +++
> target/arm/cpu.h | 2 +-
> tcg/aarch64/tcg-target.h | 25 +-
> tcg/aarch64/tcg-target.opc.h | 3 +
> tcg/i386/tcg-target.h | 41 +-
> tcg/i386/tcg-target.opc.h | 13 +
> tcg/tcg-gvec-desc.h | 49 +
> tcg/tcg-op-gvec.h | 306 ++++++
> tcg/tcg-op.h | 52 +-
> tcg/tcg-opc.h | 46 +
> tcg/tcg.h | 87 ++
> accel/tcg/tcg-runtime-gvec.c | 997 +++++++++++++++++++
> target/arm/translate-a64.c | 979 ++++++++++++++-----
> tcg/aarch64/tcg-target.inc.c | 588 ++++++++++-
> tcg/i386/tcg-target.inc.c | 987 ++++++++++++++++++-
> tcg/optimize.c | 150 +--
> tcg/tcg-op-gvec.c | 2215
> ++++++++++++++++++++++++++++++++++++++++++
> tcg/tcg-op-vec.c | 389 ++++++++
> tcg/tcg-op.c | 42 +-
> tcg/tcg-pool.inc.c | 115 ++-
> tcg/tcg.c | 125 ++-
> accel/tcg/Makefile.objs | 2 +-
> configure | 48 +
> tcg/README | 86 ++
> 25 files changed, 6973 insertions(+), 496 deletions(-)
> create mode 100644 tcg/aarch64/tcg-target.opc.h
> create mode 100644 tcg/i386/tcg-target.opc.h
> create mode 100644 tcg/tcg-gvec-desc.h
> create mode 100644 tcg/tcg-op-gvec.h
> create mode 100644 accel/tcg/tcg-runtime-gvec.c
> create mode 100644 tcg/tcg-op-gvec.c
> create mode 100644 tcg/tcg-op-vec.c
--
Alex Bennée
- Re: [Qemu-devel] [PATCH v11 00/20] tcg: generic vector operations,
Alex Bennée <=