[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Qemu-devel] [PATCHv5 06/10] migration: search for zero instead of dup p
From: |
Peter Lieven |
Subject: |
[Qemu-devel] [PATCHv5 06/10] migration: search for zero instead of dup pages |
Date: |
Tue, 26 Mar 2013 10:58:35 +0100 |
virtually all dup pages are zero pages. remove
the special is_dup_page() function and use the
optimized buffer_find_nonzero_offset() function
instead.
here buffer_find_nonzero_offset() is used directly
to avoid the unnecssary additional checks in
buffer_is_zero().
raw performace gain checking 1 GByte zeroed memory
over is_dup_page() is approx. 10-12% with SSE2
and 8-10% with unsigned long arithmedtic.
Signed-off-by: Peter Lieven <address@hidden>
Reviewed-by: Orit Wasserman <address@hidden>
Reviewed-by: Eric Blake <address@hidden>
---
arch_init.c | 21 ++++++---------------
1 file changed, 6 insertions(+), 15 deletions(-)
diff --git a/arch_init.c b/arch_init.c
index 35974c2..dd5deff 100644
--- a/arch_init.c
+++ b/arch_init.c
@@ -146,19 +146,10 @@ int qemu_read_default_config_files(bool userconfig)
return 0;
}
-static int is_dup_page(uint8_t *page)
+static inline bool is_zero_page(uint8_t *p)
{
- VECTYPE *p = (VECTYPE *)page;
- VECTYPE val = SPLAT(page);
- int i;
-
- for (i = 0; i < TARGET_PAGE_SIZE / sizeof(VECTYPE); i++) {
- if (!ALL_EQ(val, p[i])) {
- return 0;
- }
- }
-
- return 1;
+ return buffer_find_nonzero_offset(p, TARGET_PAGE_SIZE) ==
+ TARGET_PAGE_SIZE;
}
/* struct contains XBZRLE cache and a static page
@@ -445,12 +436,12 @@ static int ram_save_block(QEMUFile *f, bool last_stage)
/* In doubt sent page as normal */
bytes_sent = -1;
- if (is_dup_page(p)) {
+ if (is_zero_page(p)) {
acct_info.dup_pages++;
bytes_sent = save_block_hdr(f, block, offset, cont,
RAM_SAVE_FLAG_COMPRESS);
- qemu_put_byte(f, *p);
- bytes_sent += 1;
+ qemu_put_byte(f, 0);
+ bytes_sent++;
} else if (migrate_use_xbzrle()) {
current_addr = block->offset + offset;
bytes_sent = save_xbzrle_page(f, p, current_addr, block,
--
1.7.9.5
- [Qemu-devel] [PATCHv5 00/10] buffer_is_zero / migration optimizations, Peter Lieven, 2013/03/26
- [Qemu-devel] [PATCHv5 04/10] buffer_is_zero: use vector optimizations if possible, Peter Lieven, 2013/03/26
- [Qemu-devel] [PATCHv5 05/10] bitops: unroll while loop in find_next_bit(), Peter Lieven, 2013/03/26
- [Qemu-devel] [PATCHv5 08/10] migration: do not sent zero pages in bulk stage, Peter Lieven, 2013/03/26
- [Qemu-devel] [PATCHv5 09/10] migration: do not search dirty pages in bulk stage, Peter Lieven, 2013/03/26
- [Qemu-devel] [PATCHv5 07/10] migration: add an indicator for bulk state of ram migration, Peter Lieven, 2013/03/26
- [Qemu-devel] [PATCHv5 06/10] migration: search for zero instead of dup pages,
Peter Lieven <=
- [Qemu-devel] [PATCHv5 10/10] migration: use XBZRLE only after bulk stage, Peter Lieven, 2013/03/26
- [Qemu-devel] [PATCHv5 01/10] move vector definitions to qemu-common.h, Peter Lieven, 2013/03/26
- [Qemu-devel] [PATCHv5 02/10] add a zero splat vector to qemu-common.h, Peter Lieven, 2013/03/26
[Qemu-devel] [PATCHv5 03/10] cutils: add a function to find non-zero content in a buffer, Peter Lieven, 2013/03/26