qemu-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-devel] [PATCH 1/2] ARM: add neon helpers for VQSHLU


From: Aurelien Jarno
Subject: Re: [Qemu-devel] [PATCH 1/2] ARM: add neon helpers for VQSHLU
Date: Tue, 4 Jan 2011 23:04:51 +0100
User-agent: Mutt/1.5.20 (2009-06-14)

On Mon, Jan 03, 2011 at 04:20:47PM +0000, Peter Maydell wrote:
> Add neon helper functions to implement VQSHLU, which is a
> signed-to-unsigned version of VQSHL available only as an
> immediate form.
> 
> Signed-off-by: Juha Riihimäki <address@hidden>
> Reviewed-by: Peter Maydell <address@hidden>
> ---
>  target-arm/helpers.h     |    4 +++
>  target-arm/neon_helper.c |   47 
> ++++++++++++++++++++++++++++++++++++++++++++++
>  2 files changed, 51 insertions(+), 0 deletions(-)
> 
> diff --git a/target-arm/helpers.h b/target-arm/helpers.h
> index 0d1bc47..b88ebae 100644
> --- a/target-arm/helpers.h
> +++ b/target-arm/helpers.h
> @@ -249,6 +249,10 @@ DEF_HELPER_3(neon_qshl_u32, i32, env, i32, i32)
>  DEF_HELPER_3(neon_qshl_s32, i32, env, i32, i32)
>  DEF_HELPER_3(neon_qshl_u64, i64, env, i64, i64)
>  DEF_HELPER_3(neon_qshl_s64, i64, env, i64, i64)
> +DEF_HELPER_3(neon_qshlu_s8, i32, env, i32, i32);
> +DEF_HELPER_3(neon_qshlu_s16, i32, env, i32, i32);
> +DEF_HELPER_3(neon_qshlu_s32, i32, env, i32, i32);
> +DEF_HELPER_3(neon_qshlu_s64, i64, env, i64, i64);
>  DEF_HELPER_3(neon_qrshl_u8, i32, env, i32, i32)
>  DEF_HELPER_3(neon_qrshl_s8, i32, env, i32, i32)
>  DEF_HELPER_3(neon_qrshl_u16, i32, env, i32, i32)
> diff --git a/target-arm/neon_helper.c b/target-arm/neon_helper.c
> index dae063e..20f3c16 100644
> --- a/target-arm/neon_helper.c
> +++ b/target-arm/neon_helper.c
> @@ -632,6 +632,53 @@ uint64_t HELPER(neon_qshl_s64)(CPUState *env, uint64_t 
> valop, uint64_t shiftop)
>      return val;
>  }
>  
> +#define NEON_FN(dest, src1, src2) do { \
> +    if (src1 & (1 << (sizeof(src1) * 8 - 1))) { \
> +        SET_QC(); \
> +        dest = 0; \
> +    } else { \
> +        int8_t tmp; \
> +        tmp = (int8_t)src2; \
> +        if (tmp >= (ssize_t)sizeof(src1) * 8) { \
> +            if (src1) { \
> +                SET_QC(); \
> +                dest = ~0; \
> +            } else { \
> +                dest = 0; \
> +            } \
> +        } else if (tmp <= -(ssize_t)sizeof(src1) * 8) { \
> +            dest = 0; \
> +        } else if (tmp < 0) { \
> +            dest = src1 >> -tmp; \
> +        } else { \
> +            dest = src1 << tmp; \
> +            if ((dest >> tmp) != src1) { \
> +                SET_QC(); \
> +                dest = ~0; \
> +            } \
> +        } \
> +    }} while (0)
> +NEON_VOP_ENV(qshlu_s8, neon_u8, 4)
> +NEON_VOP_ENV(qshlu_s16, neon_u16, 2)
> +#undef NEON_FN
> +
> +uint32_t HELPER(neon_qshlu_s32)(CPUState *env, uint32_t valop, uint32_t 
> shiftop)
> +{
> +    if ((int32_t)valop < 0) {
> +        SET_QC();
> +        return 0;
> +    }
> +    return helper_neon_qshl_u32(env, valop, shiftop);
> +}

I guess defining this function using helper_neon_qshl_u32() instead of
using NEON_VOP_ENV(qshlu_s32, neon_u32, 1) is to optimize the generated
code. Note that it only make 48 bytes difference, with one jump.

> +uint64_t HELPER(neon_qshlu_s64)(CPUState *env, uint64_t valop, uint64_t 
> shiftop)
> +{
> +    if ((int64_t)valop < 0) {
> +        SET_QC();
> +        return 0;
> +    }
> +    return helper_neon_qshl_u64(env, valop, shiftop);
> +}
>  
>  /* FIXME: This is wrong.  */
>  #define NEON_FN(dest, src1, src2) do { \

Reviewed-by: Aurelien Jarno <address@hidden>

-- 
Aurelien Jarno                          GPG: 1024D/F1BCDB73
address@hidden                 http://www.aurel32.net



reply via email to

[Prev in Thread] Current Thread [Next in Thread]