qemu-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-devel] [RFC PATCH v2 5/6] qcow2: implement bdrv_preallocate


From: Peter Lieven
Subject: Re: [Qemu-devel] [RFC PATCH v2 5/6] qcow2: implement bdrv_preallocate
Date: Thu, 28 Nov 2013 11:03:04 +0100
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:24.0) Gecko/20100101 Thunderbird/24.1.1

On 28.11.2013 09:48, Hu Tao wrote:
On Wed, Nov 27, 2013 at 11:13:40AM +0100, Peter Lieven wrote:
Am 27.11.2013 11:07, schrieb Fam Zheng:
On 2013年11月27日 18:03, Peter Lieven wrote:
Am 27.11.2013 07:40, schrieb Fam Zheng:
On 2013年11月27日 14:01, Hu Tao wrote:
On Wed, Nov 27, 2013 at 11:01:23AM +0800, Fam Zheng wrote:
On 2013年11月27日 10:15, Hu Tao wrote:
Signed-off-by: Hu Tao <address@hidden>
---
    block/qcow2.c | 7 +++++++
    1 file changed, 7 insertions(+)

diff --git a/block/qcow2.c b/block/qcow2.c
index b054a01..a23fade 100644
--- a/block/qcow2.c
+++ b/block/qcow2.c
@@ -2180,6 +2180,12 @@ static int qcow2_amend_options(BlockDriverState *bs,
        return 0;
    }

+static int qcow2_preallocate(BlockDriverState *bs, int64_t offset,
+                             int64_t length)
+{
+    return bdrv_preallocate(bs->file, offset, length);
+}
+
What's the semantics of .bdrv_preallocate? I think you should map
[offset, offset + length) to clusters in image file, and then
forward to bs->file, rather than this direct wrapper.

E.g. bdrv_preallocate(qcow2_bs, 0, cluster_size) should call
bdrv_preallocate(qcow2_bs->file, offset_off_first_cluster,
cluster_size).
You mean data clusters here, right? Is there a single function to get
the offset of the first data cluster?

There is a function, qcow2_get_cluster_offset.
This should return no valid offset as long as the cluster is not allocated.

I think you actually have to "write" all clusters of a qcow2 one by one.
Eventually this write could be an fallocate call instead of a zero write.

Yes, I was wrong about qcow2_get_cluster_offset. The logic here is more like 
cluster allocation in qcow2_alloc_cluster_offset. Maybe we can reuse that.
What I don't like about the preallocation is that we would loose the 
information that a cluster contains no valid data and would read it e.g. during
conversion.
So the information is stored in table and you mean we shouldn't clear
table when do preallocation? I'm not sure how the information could be
useful on a newly-created image, but it seems ideal to keep informations
in table.
When you want to e.g. convert this qcow2 later the performance is lower than 
needed because
you read all those preallocated sectors altough you could now they are empty.

I think what we want is a preallocated image with all clusters sequentally 
mapped into the qcow2 file. Preallocate all the cluster extends, but still
have the information in the table that the cluster in fact has no valid data. 
So we would need a valid cluster offset while still haveing the
flag that the cluster is unallocated. I think this would require thoughtfully 
checking all the cluster functions if they can easily cope with this.

The quetion is Hu, what do you want to achieve? Do you want that the space on 
the filesystem is preallocated so you can't overcommit or
do you also want a sequential mapping of all the clusters into the file?
The goal is to avoid sparse file as it can cause performance problem. So
the first one. I'm not sure about the second but IIUC, one fallocate()
is enough for all clusters if they are sequentially mapped.
If you do not premap them they are allocated in the order they are written.
So if you are going to preallocate the whole file anyway, you should 
sequentally map all clusters into the file
AND still keep the information that they are in fact not yet written.

Peter



reply via email to

[Prev in Thread] Current Thread [Next in Thread]