qemu-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-devel] [PATCH] Target-arm: Add the THUMB_DSP feature


From: Peter Crosthwaite
Subject: Re: [Qemu-devel] [PATCH] Target-arm: Add the THUMB_DSP feature
Date: Mon, 1 Jun 2015 11:36:15 -0700

On Mon, Jun 1, 2015 at 10:54 AM, Peter Maydell <address@hidden> wrote:
> On 1 June 2015 at 15:30, Aurelio C. Remonda <address@hidden> wrote:
>
> Thanks for sending this patch. I have a few comments below.
>
>> I created an ARM_FEATURE_THUMB_DSP to be added to any non-M
>> thumb2-compatible CPU that uses DSP instructions.
>> There are 85 DSP instructions (all of them thumb2). On disas_thumb2_insn
>> the DSP feature is tested before the instruction is generated; if it's not
>> enabled then its an illegal op.
>
> Our general style for commit messages tends to be a bit more
> impersonal, so for instance "Create an ARM_FEATURE_THUMB_DSP"
> rather than "I created...".
>
>> Signed-off-by: Aurelio C. Remonda <address@hidden>
>
> As I noted in the other patch, the code that sets the feature bit
> for non-M-profile Thumb2 CPUs needs to go in this patch.
>
>> ---
>>  target-arm/cpu.h       |   1 +
>>  target-arm/translate.c | 110 
>> +++++++++++++++++++++++++++++++++++++++++++++++--
>>  2 files changed, 107 insertions(+), 4 deletions(-)
>>
>> diff --git a/target-arm/cpu.h b/target-arm/cpu.h
>> index 21b5b8e..2e03d8e 100644
>> --- a/target-arm/cpu.h
>> +++ b/target-arm/cpu.h
>> @@ -890,6 +890,7 @@ enum arm_features {
>>      ARM_FEATURE_V8_SHA1, /* implements SHA1 part of v8 Crypto Extensions */
>>      ARM_FEATURE_V8_SHA256, /* implements SHA256 part of v8 Crypto 
>> Extensions */
>>      ARM_FEATURE_V8_PMULL, /* implements PMULL part of v8 Crypto Extensions 
>> */
>> +    ARM_FEATURE_THUMB_DSP, /* DSP insns supported in the Thumb encodings */
>>  };
>>
>>  static inline int arm_feature(CPUARMState *env, int feature)
>> diff --git a/target-arm/translate.c b/target-arm/translate.c
>> index 39692d7..2d14a2c 100644
>> --- a/target-arm/translate.c
>> +++ b/target-arm/translate.c
>> @@ -9444,6 +9444,10 @@ static int disas_thumb2_insn(CPUARMState *env, 
>> DisasContext *s, uint16_t insn_hw
>>
>>          op = (insn >> 21) & 0xf;
>>          if (op == 6) {
>> +            if (!arm_dc_feature(s, ARM_FEATURE_THUMB_DSP)) {
>> +                /* pkhtb, pkfbt are DSP instructions  */
>
> These comments aren't really necessary I think -- it's obvious from
> the feature bit we're testing and from the code below what's
> going on.
>
>> +                goto illegal_op;
>> +            }
>>              /* Halfword pack.  */
>>              tmp = load_reg(s, rn);
>>              tmp2 = load_reg(s, rm);
>> @@ -9518,13 +9522,35 @@ static int disas_thumb2_insn(CPUARMState *env, 
>> DisasContext *s, uint16_t insn_hw
>>              switch (op) {
>>              case 0: gen_sxth(tmp);   break;
>>              case 1: gen_uxth(tmp);   break;
>> -            case 2: gen_sxtb16(tmp); break;
>> -            case 3: gen_uxtb16(tmp); break;
>> +            case 2:
>> +                if (!arm_dc_feature(s, ARM_FEATURE_THUMB_DSP)) {
>> +                    /* sxtab16, sxtb16 are DSP instructions  */
>> +                    tcg_temp_free_i32(tmp);
>> +                    goto illegal_op;
>> +                }
>> +                gen_sxtb16(tmp);
>> +                break;
>> +            case 3:
>> +                if (!arm_dc_feature(s, ARM_FEATURE_THUMB_DSP)) {
>> +                    /* uxtb16, uxtab16 are DSP instructions */
>> +                    tcg_temp_free_i32(tmp);
>> +                    goto illegal_op;
>> +                }
>> +                gen_uxtb16(tmp);
>> +                break;
>>              case 4: gen_sxtb(tmp);   break;
>>              case 5: gen_uxtb(tmp);   break;
>>              default: goto illegal_op;
>>              }
>>              if (rn != 15) {
>> +                if (!arm_dc_feature(s, ARM_FEATURE_THUMB_DSP)) {
>> +                    /* sxtab, sxtah, uxtab, uxtah are DSP instructions.
>> +                     * sxtb, sxth, uxtb, uxth are not DSP according to
>> +                     * ARMv7-M Architecture Reference Manual
>> +                     */
>> +                    tcg_temp_free_i32(tmp);
>> +                    goto illegal_op;
>> +                }
>
> It looks like it would be fairly easy to hoist the illegal_op checks
> up above the load_reg() call, which then means we don't need to do
> a temp_free. You just need an extra switch on op like
>
>     switch (op) {
>     case 0: /* SXTAH, SXTH */
>     case 1: /* UXTAH, UXTH */
>     case 4: /* SXTAB, SXTB */
>     case 5: /* UXTAB, UXTB */
>         break;
>     case 2: /* SXTAB16, SXTB16 */
>     case 3: /* UXTAB16, UXTB16 */
>         if (!arm_dc_feature(s, ARM_FEATURE_THUMB_DSP)) {
>             goto illegal_op;
>         }
>         break;
>     default:
>         goto illegal_op;
>     }
> (The default: case in the original switch then becomes a
> g_assert_not_reached();)
>
> You can check for rn != 15 up here too.
>
>
>>                  tmp2 = load_reg(s, rn);
>>                  if ((op >> 1) == 1) {
>>                      gen_add16(tmp, tmp2);
>> @@ -9537,6 +9563,12 @@ static int disas_thumb2_insn(CPUARMState *env, 
>> DisasContext *s, uint16_t insn_hw
>>              break;
>>          case 2: /* SIMD add/subtract.  */
>>              op = (insn >> 20) & 7;
>> +            if (!arm_dc_feature(s, ARM_FEATURE_THUMB_DSP)) {
>> +                /* add16, sub16, asx, sax, add8, sub8 (with q, s, sh, u, uh,
>> +                 * and uq variants) and usad8, usada8
>> +                 */
>
> If we want to document the instructions handled by this case then
> the comment belongs at the start of it, before the assignment to 'op'.
>
>> +                goto illegal_op;
>> +            }
>>              shift = (insn >> 4) & 7;
>>              if ((op & 3) == 3 || (shift & 3) == 3)
>>                  goto illegal_op;
>> @@ -9550,6 +9582,10 @@ static int disas_thumb2_insn(CPUARMState *env, 
>> DisasContext *s, uint16_t insn_hw
>>              op = ((insn >> 17) & 0x38) | ((insn >> 4) & 7);
>>              if (op < 4) {
>>                  /* Saturating add/subtract.  */
>> +                if (!arm_dc_feature(s, ARM_FEATURE_THUMB_DSP)) {
>> +                    /* qsub, qadd, qdadd, qdsub are DSP instructions. */
>> +                    goto illegal_op;
>> +                }
>>                  tmp = load_reg(s, rn);
>>                  tmp2 = load_reg(s, rm);
>>                  if (op & 1)
>> @@ -9575,6 +9611,11 @@ static int disas_thumb2_insn(CPUARMState *env, 
>> DisasContext *s, uint16_t insn_hw
>>                      gen_revsh(tmp);
>>                      break;
>>                  case 0x10: /* sel */
>> +                    if (!arm_dc_feature(s, ARM_FEATURE_THUMB_DSP)) {
>> +                    /* sel is a DSP instruction. */
>> +                    tcg_temp_free_i32(tmp);
>> +                    goto illegal_op;
>> +                    }
>
> This check could also be hoisted up to above the allocation of tmp.
>
> Your indentation on this section seems to have gone wrong.
>
> If you run your patches through scripts/checkpatch.pl they will catch
> this kind of style error for you.
>
>>                      tmp2 = load_reg(s, rm);
>>                      tmp3 = tcg_temp_new_i32();
>>                      tcg_gen_ld_i32(tmp3, cpu_env, offsetof(CPUARMState, 
>> GE));
>> @@ -9640,6 +9681,14 @@ static int disas_thumb2_insn(CPUARMState *env, 
>> DisasContext *s, uint16_t insn_hw
>>                  }
>>                  break;
>>              case 1: /* 16 x 16 -> 32 */
>> +                if (!arm_dc_feature(s, ARM_FEATURE_THUMB_DSP)) {
>> +                    /* smlabb, smlabt, smlatb, smlatt, smulbb, smulbt, 
>> smultt
>> +                     * and smultb are DSP instructions
>> +                     */
>> +                    tcg_temp_free_i32(tmp);
>> +                    tcg_temp_free_i32(tmp2);
>> +                    goto illegal_op;
>> +                }
>>                  gen_mulxy(tmp, tmp2, op & 2, op & 1);
>>                  tcg_temp_free_i32(tmp2);
>>                  if (rs != 15) {
>> @@ -9650,6 +9699,12 @@ static int disas_thumb2_insn(CPUARMState *env, 
>> DisasContext *s, uint16_t insn_hw
>>                  break;
>>              case 2: /* Dual multiply add.  */
>>              case 4: /* Dual multiply subtract.  */
>> +                if (!arm_dc_feature(s, ARM_FEATURE_THUMB_DSP)) {
>> +                    /* smlad, smladx, smlsd, smusd are DSP instructions */
>> +                    tcg_temp_free_i32(tmp);
>> +                    tcg_temp_free_i32(tmp2);
>> +                    goto illegal_op;
>> +                }
>>                  if (op)
>>                      gen_swap_half(tmp2);
>>                  gen_smul_dual(tmp, tmp2);
>> @@ -9672,6 +9727,12 @@ static int disas_thumb2_insn(CPUARMState *env, 
>> DisasContext *s, uint16_t insn_hw
>>                    }
>>                  break;
>>              case 3: /* 32 * 16 -> 32msb */
>> +                if (!arm_dc_feature(s, ARM_FEATURE_THUMB_DSP)) {
>> +                    /* smlawb, smlawt, smulwt, smulwb are DSP instructions 
>> */
>> +                    tcg_temp_free_i32(tmp);
>> +                    tcg_temp_free_i32(tmp2);
>> +                    goto illegal_op;
>> +                }
>>                  if (op)
>>                      tcg_gen_sari_i32(tmp2, tmp2, 16);
>>                  else
>> @@ -9689,6 +9750,14 @@ static int disas_thumb2_insn(CPUARMState *env, 
>> DisasContext *s, uint16_t insn_hw
>>                    }
>>                  break;
>>              case 5: case 6: /* 32 * 32 -> 32msb (SMMUL, SMMLA, SMMLS) */
>> +                if (!arm_dc_feature(s, ARM_FEATURE_THUMB_DSP)) {
>> +                    /* smmla, smmls, smmul, smuad, smmlar,
>> +                     * smmlsr, smmulr are DSP instructions
>> +                     */
>> +                    tcg_temp_free_i32(tmp);
>> +                    tcg_temp_free_i32(tmp2);
>> +                    goto illegal_op;
>> +                }
>>                  tmp64 = gen_muls_i64_i32(tmp, tmp2);
>>                  if (rs != 15) {
>>                      tmp = load_reg(s, rs);
>
> Again, all these checks inside this switch() would be more cleanly handled
> by having an initial switch() before we allocate the temporaries that deals
> with the illegal_op cases.
>
>> @@ -9735,6 +9804,12 @@ static int disas_thumb2_insn(CPUARMState *env, 
>> DisasContext *s, uint16_t insn_hw
>>                  store_reg(s, rd, tmp);
>>              } else if ((op & 0xe) == 0xc) {
>>                  /* Dual multiply accumulate long.  */
>> +                if (!arm_dc_feature(s, ARM_FEATURE_THUMB_DSP)) {
>> +                    /* smlald, smlsld are DSP instructions */
>> +                    tcg_temp_free_i32(tmp);
>> +                    tcg_temp_free_i32(tmp2);
>> +                    goto illegal_op;
>> +                }
>>                  if (op & 1)
>>                      gen_swap_half(tmp2);
>>                  gen_smul_dual(tmp, tmp2);
>
>> @@ -9758,6 +9833,14 @@ static int disas_thumb2_insn(CPUARMState *env, 
>> DisasContext *s, uint16_t insn_hw
>>                  } else {
>>                      if (op & 8) {
>>                          /* smlalxy */
>> +                        if (!arm_dc_feature(s, ARM_FEATURE_THUMB_DSP)) {
>> +                            /* smlalbb, smlalbt, smlaltb, smlaltt
>> +                             * are DSP instructions
>> +                             */
>> +                            tcg_temp_free_i32(tmp2);
>> +                            tcg_temp_free_i32(tmp);
>> +                            goto illegal_op;
>> +                        }
>>                          gen_mulxy(tmp, tmp2, op & 2, op & 1);
>>                          tcg_temp_free_i32(tmp2);
>>                          tmp64 = tcg_temp_new_i64();
>
> Doing the illegal_op checks early for these two would require
> more painful surgery to the code so I think they're OK like this.
>
>> @@ -9770,6 +9853,11 @@ static int disas_thumb2_insn(CPUARMState *env, 
>> DisasContext *s, uint16_t insn_hw
>>                  }
>>                  if (op & 4) {
>>                      /* umaal */
>> +                    if (!arm_dc_feature(s, ARM_FEATURE_THUMB_DSP)) {
>> +                        /* ummal is a DSP instruction */
>> +                        tcg_temp_free_i64(tmp64);
>> +                        goto illegal_op;
>> +                    }
>>                      gen_addq_lo(s, tmp64, rs);
>>                      gen_addq_lo(s, tmp64, rd);
>>                  } else if (op & 0x40) {
>> @@ -10034,14 +10122,28 @@ static int disas_thumb2_insn(CPUARMState *env, 
>> DisasContext *s, uint16_t insn_hw
>>                          tmp2 = tcg_const_i32(imm);
>>                          if (op & 4) {
>>                              /* Unsigned.  */
>> -                            if ((op & 1) && shift == 0)
>> +                            if ((op & 1) && shift == 0){
>
> You need a space before the '{' (again, checkpatch will tell you this).
>
>> +                                if (!arm_dc_feature(s, 
>> ARM_FEATURE_THUMB_DSP)) {
>> +                                    /* usat16 is a DSP instruction */
>> +                                    tcg_temp_free_i32(tmp);
>> +                                    tcg_temp_free_i32(tmp2);
>> +                                    goto illegal_op;
>> +                                }
>>                                  gen_helper_usat16(tmp, cpu_env, tmp, tmp2);
>> +                            }
>>                              else
>>                                  gen_helper_usat(tmp, cpu_env, tmp, tmp2);
>
> ...and since you're touching the if() statement you need to also add
> braces on the else half of it. (Checkpatch again.)
>
>>                          } else {
>>                              /* Signed.  */
>> -                            if ((op & 1) && shift == 0)
>> +                            if ((op & 1) && shift == 0){
>> +                                if (!arm_dc_feature(s, 
>> ARM_FEATURE_THUMB_DSP)) {
>> +                                    /* ssat16 is a DSP instruction */
>> +                                    tcg_temp_free_i32(tmp);
>> +                                    tcg_temp_free_i32(tmp2);
>> +                                    goto illegal_op;
>> +                                }
>>                                  gen_helper_ssat16(tmp, cpu_env, tmp, tmp2);
>> +                            }
>>                              else

Also note that the } is on same line as the else. End result will be:

} else {

Regards,
Peter

>>                                  gen_helper_ssat(tmp, cpu_env, tmp, tmp2);
>>                          }
>
> Same comments apply for this if..else.
>
>> 1.9.1
>>
>
> thanks
> -- PMM
>



reply via email to

[Prev in Thread] Current Thread [Next in Thread]