[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH v2] target/arm: Speed up aarch64 TBL/TBX

From: Alex Bennée
Subject: Re: [PATCH v2] target/arm: Speed up aarch64 TBL/TBX
Date: Thu, 25 Feb 2021 09:52:51 +0000
User-agent: mu4e 1.5.8; emacs 28.0.50

Richard Henderson <richard.henderson@linaro.org> writes:

> Always perform one call instead of two for 16-byte operands.
> Use byte loads/stores directly into the vector register file
> instead of extractions and deposits to a 64-bit local variable.
> In order to easily receive pointers into the vector register file,
> convert the helper to the gvec out-of-line signature.  Move the
> helper into vec_helper.c, where it can make use of H1 and clear_tail.
> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>

Reviewed-by: Alex Bennée <alex.bennee@linaro.org>
Tested-by: Alex Bennée <alex.bennee@linaro.org>

Looked marginally slower on the (1!) run I did but probably in the noise
and the generated code looks nicer.

Alex Bennée

reply via email to

[Prev in Thread] Current Thread [Next in Thread]