Re: [Qemu-devel] [PATCH RFC] mem-prealloc: Reduce large guest start-up a

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-devel] [PATCH RFC] mem-prealloc: Reduce large guest start-up a

From:	Daniel P. Berrange
Subject:	Re: [Qemu-devel] [PATCH RFC] mem-prealloc: Reduce large guest start-up and migration time.
Date:	Fri, 27 Jan 2017 13:26:50 +0000
User-agent:	Mutt/1.7.1 (2016-10-04)

On Thu, Jan 05, 2017 at 12:54:02PM +0530, Jitendra Kolhe wrote:
> Using "-mem-prealloc" option for a very large guest leads to huge guest
> start-up and migration time. This is because with "-mem-prealloc" option
> qemu tries to map every guest page (create address translations), and
> make sure the pages are available during runtime. virsh/libvirt by
> default, seems to use "-mem-prealloc" option in case the guest is
> configured to use huge pages. The patch tries to map all guest pages
> simultaneously by spawning multiple threads. Given the problem is more
> prominent for large guests, the patch limits the changes to the guests
> of at-least 64GB of memory size. Currently limiting the change to QEMU
> library functions on POSIX compliant host only, as we are not sure if
> the problem exists on win32. Below are some stats with "-mem-prealloc"
> option for guest configured to use huge pages.
> 
> ------------------------------------------------------------------------
> Idle Guest      | Start-up time | Migration time
> ------------------------------------------------------------------------
> Guest stats with 2M HugePage usage - single threaded (existing code)
> ------------------------------------------------------------------------
> 64 Core - 4TB   | 54m11.796s    | 75m43.843s
> 64 Core - 1TB   | 8m56.576s     | 14m29.049s
> 64 Core - 256GB | 2m11.245s     | 3m26.598s
> ------------------------------------------------------------------------
> Guest stats with 2M HugePage usage - map guest pages using 8 threads
> ------------------------------------------------------------------------
> 64 Core - 4TB   | 5m1.027s      | 34m10.565s
> 64 Core - 1TB   | 1m10.366s     | 8m28.188s
> 64 Core - 256GB | 0m19.040s     | 2m10.148s
> -----------------------------------------------------------------------
> Guest stats with 2M HugePage usage - map guest pages using 16 threads
> -----------------------------------------------------------------------
> 64 Core - 4TB   | 1m58.970s     | 31m43.400s
> 64 Core - 1TB   | 0m39.885s     | 7m55.289s
> 64 Core - 256GB | 0m11.960s     | 2m0.135s
> -----------------------------------------------------------------------

For comparison, what is performance like if you replace memset() in
the current code with a call to mlock().

IIUC, huge pages are non-swappable once allocated, so it feels like
we ought to be able to just call mlock() to preallocate them with
no downside, rather than spawning many threads to memset() them.

Of course you'd still need the memset() trick if qemu was given
non-hugepages in combination with --mem-prealloc, as you don't
want to lock normal pages into ram permanently.

Regards,
Daniel
-- 
|: http://berrange.com      -o-    http://www.flickr.com/photos/dberrange/ :|
|: http://libvirt.org              -o-             http://virt-manager.org :|
|: http://entangle-photo.org       -o-    http://search.cpan.org/~danberr/ :|

[Prev in Thread]

Current Thread

[Next in Thread]

[Qemu-devel] [PATCH RFC] mem-prealloc: Reduce large guest start-up and migration time., Jitendra Kolhe, 2017/01/05
- Re: [Qemu-devel] [PATCH RFC] mem-prealloc: Reduce large guest start-up and migration time., Juan Quintela, 2017/01/27
  - Re: [Qemu-devel] [PATCH RFC] mem-prealloc: Reduce large guest start-up and migration time., Paolo Bonzini, 2017/01/27
  - Re: [Qemu-devel] [PATCH RFC] mem-prealloc: Reduce large guest start-up and migration time., Jitendra Kolhe, 2017/01/30
- Re: [Qemu-devel] [PATCH RFC] mem-prealloc: Reduce large guest start-up and migration time., Dr. David Alan Gilbert, 2017/01/27
  - Re: [Qemu-devel] [PATCH RFC] mem-prealloc: Reduce large guest start-up and migration time., Jitendra Kolhe, 2017/01/30
- Re: [Qemu-devel] [PATCH RFC] mem-prealloc: Reduce large guest start-up and migration time., Daniel P. Berrange <=

Prev by Date: [Qemu-devel] [PATCH] s390-pci: fix compilation on older GCC versions
Next by Date: Re: [Qemu-devel] [PATCH v8 16/25] cputlb and arm/sparc targets: convert mmuidx flushes from varg to bitmap
Previous by thread: Re: [Qemu-devel] [PATCH RFC] mem-prealloc: Reduce large guest start-up and migration time.
Next by thread: Re: [Qemu-devel] [RESEND Patch v1 00/37] Implementation of vhost-pci for inter-vm commucation
Index(es):
- Date
- Thread