Re: [PATCH v3 04/14] tcg/riscv: Add riscv vset{i}vli support

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH v3 04/14] tcg/riscv: Add riscv vset{i}vli support

From:	Richard Henderson
Subject:	Re: [PATCH v3 04/14] tcg/riscv: Add riscv vset{i}vli support
Date:	Mon, 9 Sep 2024 21:34:58 -0700
User-agent:	Mozilla Thunderbird

On 9/9/24 19:46, LIU Zhiwei wrote:

    lmul = type - riscv_lg2_vlenb;
    if (lmul < -3) {
        /* Host VLEN >= 1024 bits. */
        vlmul = VLMUL_M1;

I am not sure if we should use VLMUL_MF8,


Perhaps.  See below.

    } else if (lmul < 3) {
        /* 1/8 ... 1 ... 8 */
        vlmul = lmul & 7;
        lmul_eq_avl = true;
    } else {
        /* Guaranteed by Zve64x. */
        g_assert_not_reached();
    }

    avl = tcg_type_size(type) >> vsew;
    vtype = encode_vtype(true, true, vsew, vlmul);

    if (avl < 32) {
        insn = encode_i(OPC_VSETIVLI, TCG_REG_ZERO, avl, vtype);

Which may benifit here? we usually use  lmul as smallest as we can for macro 
ops split.


lmul is unchanged, just explicitly setting AVL as well.
The "benefit" is that AVL is visible in the disassembly,
and that we are able to discard the result.

There doesn't appear to be a down side.  Is there one?

    } else if (lmul_eq_avl) {
        /* rd != 0 and rs1 == 0 uses vlmax */
        insn = encode_i(OPC_VSETVLI, TCG_REG_TMP0, TCG_REG_ZERO, vtype);


As opposed to here, where we must clobber a register.
It is a scratch reg, sure, and probably affects nothing
in any microarch which does register renaming.

    } else {
        tcg_out_opc_imm(s, OPC_ADDI, TCG_REG_TMP0, TCG_REG_ZERO, avl);
        insn = encode_i(OPC_VSETVLI, TCG_REG_ZERO, TCG_REG_TMP0, vtype);

And perhaps here.

Here, lmul does *not* equal avl, and so we must set it, and because of non-use of VSETIVLIwe also know that it does not fit in uimm5.


But here's a follow-up question regarding current micro-architectures:

  How much benefit is there from adjusting LMUL < 1, or AVL < VLMAX?

For instance, on other hosts with 128-bit vectors, we also promise support for 64-bitregisters, just so we can support guests which have 64-bit vector operations. In existinghosts (x86, ppc, s390x, loongarch) we accept that the host instruction will operate on all128-bits; we simply ignore half of any result.

Thus the question becomes: can we minimize the number of vset* instructions by boundingminimal lmul to 1 (or whatever) and always leaving avl as the full register? If so, theonly vset* changes are for SEW changes, or for load/store that are smaller than V*1REG64.

r~

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [PATCH v3 02/14] util: Add RISC-V vector extension probe in cpuinfo, (continued)
- [PATCH v3 03/14] tcg/riscv: Add basic support for vector, LIU Zhiwei, 2024/09/04
  - Re: [PATCH v3 03/14] tcg/riscv: Add basic support for vector, Richard Henderson, 2024/09/05
    - Re: [PATCH v3 03/14] tcg/riscv: Add basic support for vector, LIU Zhiwei, 2024/09/09
- [PATCH v3 04/14] tcg/riscv: Add riscv vset{i}vli support, LIU Zhiwei, 2024/09/04
  - Re: [PATCH v3 04/14] tcg/riscv: Add riscv vset{i}vli support, Richard Henderson, 2024/09/05
    - Re: [PATCH v3 04/14] tcg/riscv: Add riscv vset{i}vli support, LIU Zhiwei, 2024/09/09
    - Re: [PATCH v3 04/14] tcg/riscv: Add riscv vset{i}vli support, Richard Henderson <=
    - Re: [PATCH v3 04/14] tcg/riscv: Add riscv vset{i}vli support, LIU Zhiwei, 2024/09/10
- [PATCH v3 05/14] tcg/riscv: Implement vector load/store, LIU Zhiwei, 2024/09/04
  - Re: [PATCH v3 05/14] tcg/riscv: Implement vector load/store, Richard Henderson, 2024/09/05
    - Re: [PATCH v3 05/14] tcg/riscv: Implement vector load/store, LIU Zhiwei, 2024/09/09
- [PATCH v3 06/14] tcg/riscv: Implement vector mov/dup{m/i}, LIU Zhiwei, 2024/09/04
  - Re: [PATCH v3 06/14] tcg/riscv: Implement vector mov/dup{m/i}, Richard Henderson, 2024/09/05
    - Re: [PATCH v3 06/14] tcg/riscv: Implement vector mov/dup{m/i}, LIU Zhiwei, 2024/09/09
- [PATCH v3 07/14] tcg/riscv: Add support for basic vector opcodes, LIU Zhiwei, 2024/09/04
  - Re: [PATCH v3 07/14] tcg/riscv: Add support for basic vector opcodes, Richard Henderson, 2024/09/05
- [PATCH v3 08/14] tcg/riscv: Implement vector cmp ops, LIU Zhiwei, 2024/09/04

Prev by Date: Re: [PATCH] hw/char/stm32l4x5_usart.c: Fix ACK and min access size
Next by Date: [PATCH v4 0/2] riscv: char: Avoid dropped charecters
Previous by thread: Re: [PATCH v3 04/14] tcg/riscv: Add riscv vset{i}vli support
Next by thread: Re: [PATCH v3 04/14] tcg/riscv: Add riscv vset{i}vli support
Index(es):
- Date
- Thread