[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Qemu-riscv] [Qemu-devel] [RFC v1 12/23] riscv: tcg-target: Add the
From: |
Richard Henderson |
Subject: |
Re: [Qemu-riscv] [Qemu-devel] [RFC v1 12/23] riscv: tcg-target: Add the mov and movi instruction |
Date: |
Fri, 16 Nov 2018 09:55:42 +0100 |
User-agent: |
Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.3.0 |
On 11/15/18 11:35 PM, Alistair Francis wrote:
> Signed-off-by: Alistair Francis <address@hidden>
> Signed-off-by: Michael Clark <address@hidden>
> ---
> tcg/riscv/tcg-target.inc.c | 62 ++++++++++++++++++++++++++++++++++++++
> 1 file changed, 62 insertions(+)
>
> diff --git a/tcg/riscv/tcg-target.inc.c b/tcg/riscv/tcg-target.inc.c
> index 475feca906..0e891e24c9 100644
> --- a/tcg/riscv/tcg-target.inc.c
> +++ b/tcg/riscv/tcg-target.inc.c
> @@ -422,6 +422,68 @@ static void patch_reloc(tcg_insn_unit *code_ptr, int
> type,
> }
> }
>
> +/*
> + * TCG intrinsics
> + */
> +
> +static void tcg_out_mov(TCGContext *s, TCGType type, TCGReg ret, TCGReg arg)
> +{
> + if (ret == arg) {
> + return;
> + }
> + switch (type) {
> + case TCG_TYPE_I32:
> + case TCG_TYPE_I64:
> + tcg_out_opc_imm(s, OPC_ADDI, ret, arg, 0);
> + break;
> + default:
> + g_assert_not_reached();
> + }
> +}
> +
> +static void tcg_out_movi(TCGContext *s, TCGType type, TCGReg rd,
> + tcg_target_long val)
> +{
> + tcg_target_long lo = sextract32(val, 0, 12);
sextract64, otherwise you'll make wrong decisions for rv64.
(Although it might be worthwhile to add a local alias so that rv32 doesn't do
more work than necessary.)
> + tcg_target_long hi = val - lo;
> +
> + RISCVInsn add32_op = TCG_TARGET_REG_BITS == 64 ? OPC_ADDIW : OPC_ADDI;
> +
> +#if TCG_TARGET_REG_BITS == 64
> + ptrdiff_t offset = tcg_pcrel_diff(s, (void *)val);
> +#endif
> +
> + if (val == lo) {
> + tcg_out_opc_imm(s, OPC_ADDI, rd, TCG_REG_ZERO, val);
return;
}
Should match
if (TCG_TARGET_REG_BITS == 32 || val == (int32_t)val) {
tcg_out_opc_upper(s, OPC_LUI, rd, hi);
if (lo != 0) {
tcg_out_opc_imm(s, add32_op, rd, rd, lo);
}
return;
}
here.
(1) Almost all values requested are 32-bit constants, so check the most common
cases first. (2) You know hi != 0 because you just eliminated val == lo. (3)
This handles the cases where LUI alone can load the constant, e.g. 0x1000,
which would otherwise have been matched by your power-of-two test.
> + } else if (val && !(val & (val - 1))) {
> + /* power of 2 */
> + tcg_out_opc_imm(s, OPC_ADDI, rd, TCG_REG_ZERO, 1);
> + tcg_out_opc_imm(s, OPC_SLLI, rd, rd, ctz64(val));
There's no reason to restrict this to powers of 2 and a shift of the constant 1:
shift = ctz64(val);
tmp = val >> shift;
if (tmp == sextract64(tmp, 0, 12))
> + } else if (TCG_TARGET_REG_BITS == 64 &&
> + !(val >> 31 == 0 || val >> 31 == -1)) {
> + int shift = 12 + ctz64(hi >> 12);
This is just ctz64(hi), since you've already cleared the lo 12 bits.
> + hi >>= shift;
> + tcg_out_movi(s, type, rd, hi);
> + tcg_out_opc_imm(s, OPC_SLLI, rd, rd, shift);
> + if (lo != 0) {
> + tcg_out_opc_imm(s, OPC_ADDI, rd, rd, lo);
> + }
> +#if TCG_TARGET_REG_BITS == 64
> + } else if (offset == sextract32(offset, 1, 31) << 1) {
sextract64.
> + tcg_out_opc_upper(s, OPC_AUIPC, rd, 0);
> + tcg_out_opc_imm(s, OPC_ADDI, rd, rd, 0);
> + reloc_call(s->code_ptr - 2, (tcg_insn_unit *)val);
> +#endif
Move this pc-rel case above the fully general case and then you can make the
fully general case unconditional. Also, that preserves an invariant of
increasing order of complexity of the cases. No need for the ifdef, since this
code should be removed as dead for rv32 (which saw the lui+addi case as
unconditional).
r~
- Re: [Qemu-riscv] [Qemu-devel] [RFC v1 09/23] riscv: tcg-target: Add the immediate encoders, (continued)
[Qemu-riscv] [RFC v1 10/23] riscv: tcg-target: Add the instruction emitters, Alistair Francis, 2018/11/15
[Qemu-riscv] [RFC v1 12/23] riscv: tcg-target: Add the mov and movi instruction, Alistair Francis, 2018/11/15
- Re: [Qemu-riscv] [Qemu-devel] [RFC v1 12/23] riscv: tcg-target: Add the mov and movi instruction,
Richard Henderson <=
[Qemu-riscv] [RFC v1 13/23] riscv: tcg-target: Add the extract instructions, Alistair Francis, 2018/11/15
[Qemu-riscv] [RFC v1 14/23] riscv: tcg-target: Add the out load and store instructions, Alistair Francis, 2018/11/15
[Qemu-riscv] [RFC v1 15/23] riscv: tcg-target: Add branch and jump instructions, Alistair Francis, 2018/11/15
[Qemu-riscv] [RFC v1 16/23] riscv: tcg-target: Add slowpath load and store instructions, Alistair Francis, 2018/11/15