[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-devel] [PATCH 22/23] userfaultfd: avoid mmap_sem read recursio

From: Andrea Arcangeli
Subject: Re: [Qemu-devel] [PATCH 22/23] userfaultfd: avoid mmap_sem read recursion in mcopy_atomic
Date: Sat, 23 May 2015 03:04:15 +0200
User-agent: Mutt/1.5.23 (2014-03-12)

On Fri, May 22, 2015 at 02:18:30PM -0700, Andrew Morton wrote:
> There's a more serious failure with i386 allmodconfig:
> fs/userfaultfd.c:145:2: note: in expansion of macro 'BUILD_BUG_ON'
>   BUILD_BUG_ON(sizeof(struct uffd_msg) != 32);
> I'm surprised the feature is even reachable on i386 builds?

Unless we risk to run out of vma->vm_flags there's no particular
reason not to enable it on 32bit (even if we run out, making vm_flags
an unsigned long long is a few liner patch). Certainly it's less
useful on 32bit as there's a 3G limit but the max vmas per process are
still a small fraction of that. Especially if used for the volatile
pages on demand notification of page reclaim, it could end up useful
on arm32 (S6 is 64bit I think and latest snapdragon is too, so perhaps
it's too late anyway, but again it's not big deal).

Removing the BUILD_BUG_ON I think is not ok here because while I'm ok
to support 32bit archs, I don't want translation, the 64bit kernel
should talk with the 32bit app directly without a layer in between.

I tried to avoid using packet as without packed I could not get the
alignment wrong (and future union also couldn't get it wrong), and I
could avoid those reserved1/2/3, but it's more robust to use it in
combination with the BUILD_BUG_ON to detect right away problems like
this with 32bit builds that aligns things differently.

I'm actually surprised the buildbot that sends me email about all
archs didn't actually send me anything about it for 32bit x86?
Perhaps I'm overlooking something or x86 32bit (or any other 32bit
arch for that matter) isn't being checked?  This is actually a fairly
recent change, perhaps the buildbot was shutdown recently? That
buildbot was very useful to detect for problems like this.

>From 2f0a48670dc515932dec8b983871ec35caeba553 Mon Sep 17 00:00:00 2001
From: Andrea Arcangeli <address@hidden>
Date: Sat, 23 May 2015 02:26:32 +0200
Subject: [PATCH] userfaultfd: update the uffd_msg structure to be the same on

Avoiding to using packed allowed the code to be nicer and it avoided
the reserved1/2/3 but the structure must be the same for 32bit and
64bit archs so x86 applications built with the 32bit ABI can run on
the 64bit kernel without requiring translation of the data read
through the read syscall.

$ gcc -m64 p.c && ./a.out
$ gcc -m32 p.c && ./a.out

int main()
        printf("%lu\n", sizeof(struct uffd_msg));
        printf("%lu\n", (unsigned long) &((struct uffd_msg *) 0)->event);
        printf("%lu\n", (unsigned long) &((struct uffd_msg *) 
        printf("%lu\n", (unsigned long) &((struct uffd_msg *) 
        printf("%lu\n", (unsigned long) &((struct uffd_msg *) 
        printf("%lu\n", (unsigned long) &((struct uffd_msg *) 
        printf("%lu\n", (unsigned long) &((struct uffd_msg *) 

Reported-by: Andrew Morton <address@hidden>
Signed-off-by: Andrea Arcangeli <address@hidden>
 include/uapi/linux/userfaultfd.h | 8 ++++++--
 1 file changed, 6 insertions(+), 2 deletions(-)

diff --git a/include/uapi/linux/userfaultfd.h b/include/uapi/linux/userfaultfd.h
index c8a543f..00d28e2 100644
--- a/include/uapi/linux/userfaultfd.h
+++ b/include/uapi/linux/userfaultfd.h
@@ -59,9 +59,13 @@
 struct uffd_msg {
        __u8    event;
+       __u8    reserved1;
+       __u16   reserved2;
+       __u32   reserved3;
        union {
                struct {
-                       __u32   flags;
+                       __u64   flags;
                        __u64   address;
                } pagefault;
@@ -72,7 +76,7 @@ struct uffd_msg {
                        __u64   reserved3;
                } reserved;
        } arg;
+} __attribute__((packed));
  * Start at 0x12 and not at 0 to be more strict against bugs.

reply via email to

[Prev in Thread] Current Thread [Next in Thread]