[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Qemu-devel] [PATCH 05/10] cutils: Remove aarch64 buffer zero checking
From: |
Paolo Bonzini |
Subject: |
[Qemu-devel] [PATCH 05/10] cutils: Remove aarch64 buffer zero checking |
Date: |
Tue, 13 Sep 2016 18:10:00 +0200 |
From: Richard Henderson <address@hidden>
The revised integer version is 4 times faster than the neon version
on an AppliedMicro Mustang. Even with hand scheduling and additional
unrolling I cannot make any neon version run as fast as the integer.
Signed-off-by: Richard Henderson <address@hidden>
Signed-off-by: Paolo Bonzini <address@hidden>
---
util/bufferiszero.c | 15 ---------------
1 file changed, 15 deletions(-)
diff --git a/util/bufferiszero.c b/util/bufferiszero.c
index 0bcca70..107b0e9 100644
--- a/util/bufferiszero.c
+++ b/util/bufferiszero.c
@@ -191,21 +191,6 @@ static bool select_accel_fn(const void *buf, size_t len)
return buffer_zero_int(buf, len);
}
-#elif defined(__aarch64__)
-#include "arm_neon.h"
-
-#define DO_NONZERO(X) (vgetq_lane_u64((X), 0) | vgetq_lane_u64((X), 1))
-ACCEL_BUFFER_ZERO(buffer_zero_neon, 128, uint64x2_t, DO_NONZERO)
-
-static bool select_accel_fn(const void *buf, size_t len)
-{
- uintptr_t ibuf = (uintptr_t)buf;
- if (len % 128 == 0 && ibuf % sizeof(uint64x2_t) == 0) {
- return buffer_zero_neon(buf, len);
- }
- return buffer_zero_int(buf, len);
-}
-
#else
#define select_accel_fn buffer_zero_int
#endif
--
1.8.3.1
- [Qemu-devel] [PATCH v4 00/10] Improve buffer_is_zero, Paolo Bonzini, 2016/09/13
- [Qemu-devel] [PATCH 01/10] cutils: Move buffer_is_zero and subroutines to a new file, Paolo Bonzini, 2016/09/13
- [Qemu-devel] [PATCH 04/10] cutils: Rearrange buffer_is_zero acceleration, Paolo Bonzini, 2016/09/13
- [Qemu-devel] [PATCH 03/10] cutils: Export only buffer_is_zero, Paolo Bonzini, 2016/09/13
- [Qemu-devel] [PATCH 05/10] cutils: Remove aarch64 buffer zero checking,
Paolo Bonzini <=
- [Qemu-devel] [PATCH 02/10] cutils: Remove SPLAT macro, Paolo Bonzini, 2016/09/13
- [Qemu-devel] [PATCH 07/10] cutils: Add test for buffer_is_zero, Paolo Bonzini, 2016/09/13
- [Qemu-devel] [PATCH 06/10] cutils: Remove ppc buffer zero checking, Paolo Bonzini, 2016/09/13
- [Qemu-devel] [PATCH 08/10] cutils: Add SSE4 version, Paolo Bonzini, 2016/09/13
- [Qemu-devel] [PATCH 09/10] cutils: Add generic prefetch, Paolo Bonzini, 2016/09/13
- [Qemu-devel] [PATCH 10/10] cutils: Rewrite x86 buffer zero checking, Paolo Bonzini, 2016/09/13