Re: [PATCH] block/file-posix: Fix problem with fallocate(PUNCH

qemu-block

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH] block/file-posix: Fix problem with fallocate(PUNCH_HOLE) on

From:	Thomas Huth
Subject:	Re: [PATCH] block/file-posix: Fix problem with fallocate(PUNCH_HOLE) on GPFS
Date:	Wed, 19 May 2021 12:21:20 +0200
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.10.0

On 19/04/2021 07.06, Thomas Huth wrote:

On 16/04/2021 22.34, Nir Soffer wrote:
On Fri, Apr 16, 2021 at 8:23 AM Thomas Huth <thuth@redhat.com> wrote:
A customer reported that running

  qemu-img convert -t none -O qcow2 -f qcow2 input.qcow2 output.qcow2

fails for them with the following error message when the images are
stored on a GPFS file system:

  qemu-img: error while writing sector 0: Invalid argument

After analyzing the strace output, it seems like the problem is in
handle_aiocb_write_zeroes(): The call to fallocate(FALLOC_FL_PUNCH_HOLE)
returns EINVAL, which can apparently happen if the file system has
a different idea of the granularity of the operation. It's arguably
a bug in GPFS, since the PUNCH_HOLE mode should not result in EINVAL
according to the man-page of fallocate(), but the file system is out
there in production and so we have to deal with it. In commit 294682cc3a
("block: workaround for unaligned byte range in fallocate()") we also
already applied the a work-around for the same problem to the earlier
fallocate(FALLOC_FL_ZERO_RANGE) call, so do it now similar with the
PUNCH_HOLE call.

Signed-off-by: Thomas Huth <thuth@redhat.com>
---
  block/file-posix.c | 7 +++++++
  1 file changed, 7 insertions(+)

diff --git a/block/file-posix.c b/block/file-posix.c
index 20e14f8e96..7a40428d52 100644
--- a/block/file-posix.c
+++ b/block/file-posix.c
@@ -1675,6 +1675,13 @@ static int handle_aiocb_write_zeroes(void *opaque)
              }
              s->has_fallocate = false;
          } else if (ret != -ENOTSUP) {
+            if (ret == -EINVAL) {
+                /*
+ * File systems like GPFS do not like unaligned byteranges,+ * treat it like unsupported (so caller falls back topwrite)
+                 */
+                return -ENOTSUP;
This skips the next fallback, using plain fallocate(0) if we write
after the end of the file. Is this intended?

We can treat the buggy EINVAL return value as "filesystem is buggy,
let's not try other options", or "let's try the next option". Since falling
back to actually writing zeroes is so much slower, I think it is better to
try the next option.
I just did the same work-around as in commit 294682cc3a7 ... so if we agreeto try the other options, too, we should change that spot, too...
However, what is not clear to me, how would you handle s->has_write_zeroesand s->has_discard in such a case? Set them to "false"? ... but it couldstill work for some blocks with different alignment ... but if we keep themset to "true", the code tries again and again to call these ioctls, maybewasting other precious cycles for this?
Maybe we should do a different approach instead: In case we hit a EINVALhere, print an error a la:
error_report_once("You are running on a buggy file system, please complainto the file system vendor");
and return -ENOTSUP ... then it's hopefully clear to the users why they aregetting a bad performance, and that they should complain to the file systemvendor instead to get their problem fixed.


Ping!

Any recommendations how to proceed here?

 Thomas

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [PATCH] block/file-posix: Fix problem with fallocate(PUNCH_HOLE) on GPFS, Thomas Huth <=
- Re: [PATCH] block/file-posix: Fix problem with fallocate(PUNCH_HOLE) on GPFS, Thomas Huth, 2021/05/19

Prev by Date: Re: [RFC PATCH] block/vpc: Support probing of fixed-size VHD images
Next by Date: Re: [PATCH] block/file-posix: Fix problem with fallocate(PUNCH_HOLE) on GPFS
Previous by thread: Re: [RFC PATCH] block/vpc: Support probing of fixed-size VHD images
Next by thread: Re: [PATCH] block/file-posix: Fix problem with fallocate(PUNCH_HOLE) on GPFS
Index(es):
- Date
- Thread