Re: [RFC PATCH v2 5/6] target/riscv: rvv: Optimize v[l|s]e8.v with limit

qemu-riscv

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [RFC PATCH v2 5/6] target/riscv: rvv: Optimize v[l|s]e8.v with limit

From:	Max Chou
Subject:	Re: [RFC PATCH v2 5/6] target/riscv: rvv: Optimize v[l\|s]e8.v with limitations
Date:	Mon, 3 Jun 2024 23:50:45 +0800
User-agent:	Mozilla Thunderbird

Hi Richart,

Thank you for your feedback.

This version is created by referencing the gen_sve_ldr translationfunction with the similar assumptions that no mask(predication)/no tailagnostic/continuous load & store.You are right, the expansion is large in this version (over 20 TCGinstructions that suggested in tcg-op doc).I will provide next version with the helper function implementation likesve_ldN_r in ARM target.


Thank you,
Max

On 2024/6/3 1:45 AM, Richard Henderson wrote:

On 5/31/24 12:44, Max Chou wrote:

The vector unit-stride load/store instructions (e.g. vle8.v/vse8.v)
perform continuous load/store. We can replace the corresponding helper
functions by TCG ops to copy more data at a time with following
assumptions:

* Perform virtual address resolution once for entire vector at beginning
* Without mask
* Without tail agnostic
* Both host and target are little endian

Signed-off-by: Max Chou <max.chou@sifive.com>

Why are you generating all of this inline? This expansion is verylarge. I would expect you to get better performance with a helperfunction.


AGAIN, please see the Arm implementation.


r~

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [RFC PATCH v2 5/6] target/riscv: rvv: Optimize v[l|s]e8.v with limitations, Richard Henderson, 2024/06/03
- Re: [RFC PATCH v2 5/6] target/riscv: rvv: Optimize v[l|s]e8.v with limitations, Max Chou <=
  - Re: [RFC PATCH v2 5/6] target/riscv: rvv: Optimize v[l|s]e8.v with limitations, Richard Henderson, 2024/06/03

Prev by Date: Re: [RFC v2 1/7] hw/core: Make CPU topology enumeration arch-agnostic
Next by Date: Re: [PATCH] target/riscv: Use get_address() to get address with Zicbom extensions
Previous by thread: Re: [RFC PATCH v2 5/6] target/riscv: rvv: Optimize v[l|s]e8.v with limitations
Next by thread: Re: [RFC PATCH v2 5/6] target/riscv: rvv: Optimize v[l|s]e8.v with limitations
Index(es):
- Date
- Thread