[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [PATCH v2 1/4] KVM: Dynamic sized kvm memslots array
From: |
Juraj Marcin |
Subject: |
Re: [PATCH v2 1/4] KVM: Dynamic sized kvm memslots array |
Date: |
Fri, 6 Sep 2024 12:54:37 +0200 |
Hi Peter,
On Thu, Sep 5, 2024 at 6:00 PM Peter Xu <peterx@redhat.com> wrote:
>
> On Thu, Sep 05, 2024 at 05:32:46PM +0200, Juraj Marcin wrote:
> > Hi Peter,
>
> Hi, Juraj,
>
> [...]
>
> > > unsigned int kvm_get_max_memslots(void)
> > > {
> > > KVMState *s = KVM_STATE(current_accel());
> > > @@ -193,15 +247,20 @@ unsigned int kvm_get_free_memslots(void)
> > > /* Called with KVMMemoryListener.slots_lock held */
> > > static KVMSlot *kvm_get_free_slot(KVMMemoryListener *kml)
> > > {
> > > - KVMState *s = kvm_state;
> > > int i;
> > >
> > > - for (i = 0; i < s->nr_slots; i++) {
> > > +retry:
> > > + for (i = 0; i < kml->nr_slots_allocated; i++) {
> > > if (kml->slots[i].memory_size == 0) {
> > > return &kml->slots[i];
> > > }
> > > }
> > >
> > > + /* If no free slots, try to grow first by doubling */
> > > + if (kvm_slots_double(kml)) {
> > > + goto retry;
> >
> > At this point we know all previously allocated slots were used and
> > there should be a free slot just after the last used slot (at the
> > start of the region zeroed in the grow function). Wouldn't it be
> > faster to return it here right away, instead of iterating through
> > slots that should still be used again?
>
> Good question.
>
> One trivial concern is we'll then have assumption on how kvm_slots_double()
> behaves, e.g., it must not move anything around inside, and we need to know
> that it touches nr_slots_allocated so we need to cache it. The outcome
> looks like this:
>
> ===8<===
> diff --git a/accel/kvm/kvm-all.c b/accel/kvm/kvm-all.c
> index 020fd16ab8..7429fe87a8 100644
> --- a/accel/kvm/kvm-all.c
> +++ b/accel/kvm/kvm-all.c
> @@ -249,9 +249,9 @@ unsigned int kvm_get_free_memslots(void)
> /* Called with KVMMemoryListener.slots_lock held */
> static KVMSlot *kvm_get_free_slot(KVMMemoryListener *kml)
> {
> + unsigned int n;
> int i;
>
> -retry:
> for (i = 0; i < kml->nr_slots_allocated; i++) {
> if (kml->slots[i].memory_size == 0) {
> return &kml->slots[i];
> @@ -259,8 +259,13 @@ retry:
> }
>
> /* If no free slots, try to grow first by doubling */
> + n = kml->nr_slots_allocated;
> if (kvm_slots_double(kml)) {
> - goto retry;
> + /*
> + * If succeed, we must have n used slots, then followed by n free
> + * slots.
> + */
> + return &kml->slots[n];
> }
>
> return NULL;
> ===8<===
>
> It's still good to get rid of "goto", and faster indeed. Though I wished
> we don't need those assumptions, as cons.
>
> One thing to mention that I expect this is extremely slow path, where I
> don't expect to even be reached in major uses of QEMU, and when reached
> should be only once or limited few times per VM life cycle. The re-walks
> here shouldn't be a perf concern IMHO, because when it's a concern we'll
> hit it much more frequently elsewhere... many other hotter paths around.
>
> So far it looks slightly more readable to me to keep the old way, but I'm
> ok either way. What do you think?
I agree that it requires this assumption of not moving slots around,
but I think it's intuitive to assume it when it comes to
doubling/increasing the size of an array, realloc() and g_renew() also
don't shuffle existing elements.
In addition, there already is such an assumption. If slots were moved
around, pointers returned by `return &kml->slots[i];` wouldn't point
to the same slot structure after doubling.
However, I realized there's also another problem with this return
statement. g_renew() could have moved the whole array to a new
address, making all the previously returned pointers invalid. This
could be solved by either adding another layer of indirection, so the
function returns a pointer to a single slot structure that never moves
and the array contains pointers to these structures, or the slots need
to be always accessed through an up-to-date pointer to the array,
probably from another structure or through a getter function. With the
first approach, pointers in the array could shuffle, but with the
second one, the index of a slot must not change during the lifetime of
the slot, keeping the assumption correct.
>
> Thanks,
>
> --
> Peter Xu
>
--
Juraj Marcin