[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[PATCH v3 07/88] tcg: Improve expansion of deposit of constant
|
From: |
Richard Henderson |
|
Subject: |
[PATCH v3 07/88] tcg: Improve expansion of deposit of constant |
|
Date: |
Wed, 1 Nov 2023 18:28:55 -0700 |
The extract2 expansion is too difficult for the optimizer to
simplify. If we have an immediate input, use and+or instead,
skipping the and if the field becomes all 1's.
Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
---
tcg/tcg-op.c | 28 ++++++++++++++++++++++++++++
1 file changed, 28 insertions(+)
diff --git a/tcg/tcg-op.c b/tcg/tcg-op.c
index 828eb9ee46..e305260099 100644
--- a/tcg/tcg-op.c
+++ b/tcg/tcg-op.c
@@ -608,6 +608,7 @@ void tcg_gen_deposit_i32(TCGv_i32 ret, TCGv_i32 arg1,
TCGv_i32 arg2,
{
uint32_t mask;
TCGv_i32 t1;
+ TCGTemp *ts;
tcg_debug_assert(ofs < 32);
tcg_debug_assert(len > 0);
@@ -623,6 +624,19 @@ void tcg_gen_deposit_i32(TCGv_i32 ret, TCGv_i32 arg1,
TCGv_i32 arg2,
return;
}
+ /* Deposit of a constant into a value. */
+ ts = tcgv_i32_temp(arg2);
+ if (ts->kind == TEMP_CONST) {
+ uint32_t mask0 = deposit32(-1, ofs, len, 0);
+ uint32_t maski = deposit32(0, ofs, len, ts->val);
+
+ if (mask0 != ~maski) {
+ tcg_gen_andi_i32(ret, arg1, mask0);
+ }
+ tcg_gen_ori_i32(ret, ret, maski);
+ return;
+ }
+
t1 = tcg_temp_ebb_new_i32();
if (TCG_TARGET_HAS_extract2_i32) {
@@ -2229,6 +2243,7 @@ void tcg_gen_deposit_i64(TCGv_i64 ret, TCGv_i64 arg1,
TCGv_i64 arg2,
{
uint64_t mask;
TCGv_i64 t1;
+ TCGTemp *ts;
tcg_debug_assert(ofs < 64);
tcg_debug_assert(len > 0);
@@ -2244,6 +2259,19 @@ void tcg_gen_deposit_i64(TCGv_i64 ret, TCGv_i64 arg1,
TCGv_i64 arg2,
return;
}
+ /* Deposit of a constant into a value. */
+ ts = tcgv_i64_temp(arg2);
+ if (ts->kind == TEMP_CONST) {
+ uint64_t mask0 = deposit64(-1, ofs, len, 0);
+ uint64_t maski = deposit64(0, ofs, len, ts->val);
+
+ if (mask0 != ~maski) {
+ tcg_gen_andi_i64(ret, arg1, mask0);
+ }
+ tcg_gen_ori_i64(ret, ret, maski);
+ return;
+ }
+
if (TCG_TARGET_REG_BITS == 32) {
if (ofs >= 32) {
tcg_gen_deposit_i32(TCGV_HIGH(ret), TCGV_HIGH(arg1),
--
2.34.1
- [PATCH v3 00/88] target/hppa: Implement hppa64 cpu, Richard Henderson, 2023/11/01
- [PATCH v3 03/88] target/hppa: Use IntervalTreeNode in HPPATLBEntry, Richard Henderson, 2023/11/01
- [PATCH v3 07/88] tcg: Improve expansion of deposit of constant,
Richard Henderson <=
- [PATCH v3 10/88] target/hppa: Remove get_temp_tl, Richard Henderson, 2023/11/01
- [PATCH v3 05/88] target/hppa: Split out hppa_flush_tlb_range, Richard Henderson, 2023/11/01
- [PATCH v3 01/88] target/hppa: Include PSW_P in tb flags and mmu index, Richard Henderson, 2023/11/01
- [PATCH v3 02/88] target/hppa: Rename hppa_tlb_entry to HPPATLBEntry, Richard Henderson, 2023/11/01
- [PATCH v3 14/88] target/hppa: Truncate rotate count in trans_shrpw_sar, Richard Henderson, 2023/11/01
- [PATCH v3 09/88] target/hppa: Remove get_temp, Richard Henderson, 2023/11/01
- [PATCH v3 04/88] target/hppa: Always report one page to tlb_set_page, Richard Henderson, 2023/11/01
- [PATCH v3 18/88] target/hppa: Fix extrw and depw with sar for hppa64, Richard Henderson, 2023/11/01
- [PATCH v3 15/88] target/hppa: Fix trans_ds for hppa64, Richard Henderson, 2023/11/01
- [PATCH v3 17/88] target/hppa: Fix bb_sar for hppa64, Richard Henderson, 2023/11/01