[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Qemu-block] [PULL 5/6] file-posix: Make bdrv_flush() failure permanent
From: |
Max Reitz |
Subject: |
[Qemu-block] [PULL 5/6] file-posix: Make bdrv_flush() failure permanent without O_DIRECT |
Date: |
Mon, 27 Mar 2017 17:52:33 +0200 |
From: Kevin Wolf <address@hidden>
Success for bdrv_flush() means that all previously written data is safe
on disk. For fdatasync(), the best semantics we can hope for on Linux
(without O_DIRECT) is that all data that was written since the last call
was successfully written back. Therefore, and because we can't redo all
writes after a flush failure, we have to give up after a single
fdatasync() failure. After this failure, we would never be able to make
the promise that a successful bdrv_flush() makes.
Signed-off-by: Kevin Wolf <address@hidden>
Message-id: address@hidden
Reviewed-by: Fam Zheng <address@hidden>
Reviewed-by: Eric Blake <address@hidden>
Reviewed-by: Stefan Hajnoczi <address@hidden>
Signed-off-by: Max Reitz <address@hidden>
---
block/file-posix.c | 22 ++++++++++++++++++++++
1 file changed, 22 insertions(+)
diff --git a/block/file-posix.c b/block/file-posix.c
index 53febd3767..beb7a4f728 100644
--- a/block/file-posix.c
+++ b/block/file-posix.c
@@ -144,6 +144,7 @@ typedef struct BDRVRawState {
bool has_write_zeroes:1;
bool discard_zeroes:1;
bool use_linux_aio:1;
+ bool page_cache_inconsistent:1;
bool has_fallocate;
bool needs_alignment;
} BDRVRawState;
@@ -824,10 +825,31 @@ static ssize_t handle_aiocb_ioctl(RawPosixAIOData *aiocb)
static ssize_t handle_aiocb_flush(RawPosixAIOData *aiocb)
{
+ BDRVRawState *s = aiocb->bs->opaque;
int ret;
+ if (s->page_cache_inconsistent) {
+ return -EIO;
+ }
+
ret = qemu_fdatasync(aiocb->aio_fildes);
if (ret == -1) {
+ /* There is no clear definition of the semantics of a failing fsync(),
+ * so we may have to assume the worst. The sad truth is that this
+ * assumption is correct for Linux. Some pages are now probably marked
+ * clean in the page cache even though they are inconsistent with the
+ * on-disk contents. The next fdatasync() call would succeed, but no
+ * further writeback attempt will be made. We can't get back to a state
+ * in which we know what is on disk (we would have to rewrite
+ * everything that was touched since the last fdatasync() at least), so
+ * make bdrv_flush() fail permanently. Given that the behaviour isn't
+ * really defined, I have little hope that other OSes are doing better.
+ *
+ * Obviously, this doesn't affect O_DIRECT, which bypasses the page
+ * cache. */
+ if ((s->open_flags & O_DIRECT) == 0) {
+ s->page_cache_inconsistent = true;
+ }
return -errno;
}
return 0;
--
2.12.1
- [Qemu-block] [PULL 0/6] Block patches for rc2, Max Reitz, 2017/03/27
- [Qemu-block] [PULL 1/6] qemu-img: show help for invalid global options, Max Reitz, 2017/03/27
- [Qemu-block] [PULL 2/6] qemu-img: fix switch indentation in img_amend(), Max Reitz, 2017/03/27
- [Qemu-block] [PULL 4/6] nbd-client: fix handling of hungup connections, Max Reitz, 2017/03/27
- [Qemu-block] [PULL 3/6] qemu-img: print short help on getopt failure, Max Reitz, 2017/03/27
- [Qemu-block] [PULL 5/6] file-posix: Make bdrv_flush() failure permanent without O_DIRECT,
Max Reitz <=
- [Qemu-block] [PULL 6/6] block/file-posix.c: Fix unused variable warning on OpenBSD, Max Reitz, 2017/03/27
- Re: [Qemu-block] [PULL 0/6] Block patches for rc2, Peter Maydell, 2017/03/27