qemu-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Measuring the impact of buffer copy for virtio-gpu guests


From: Alex Bennée
Subject: Re: Measuring the impact of buffer copy for virtio-gpu guests
Date: Wed, 17 Feb 2021 15:48:01 +0000
User-agent: mu4e 1.5.8; emacs 28.0.50

François Ozog <francois.ozog@linaro.org> writes:

> On Wed, 17 Feb 2021 at 15:13, Alex Bennée <alex.bennee@linaro.org> wrote:
>
>> Hi Gerd,
>>
>> I was in a discussion with the AGL folks today talking about approaches
>> to achieving zero-copy when running VirGL virtio guests. AIUI (which is
>> probably not very much) the reasons for copy can be due to a number of
>> reasons:
>>
>>   - the GPA not being mapped to a HPA that is accessible to the final HW
>>   - the guest allocation of a buffer not meeting stride/alignment
>> requirements
>>   - data needing to be transformed for consumption by the real hardware?
>>
>> any others? Is there an impedance between different buffer resource
>> allocators in the guest and the guest? Is that just a problem for
>> non-FLOSS blob drivers in the kernel?
>>
>> I'm curious if it's possible to measure the effect of these extra copies
>> and where do they occur?
>
> Making a good benchmark is going to be difficult. Copying has big impacts
> on:
> - L3 pressure (pure cost of evictions and loss of "sticky" cache lines
> benefits)
> - Memory request queue and prefetching
> - TLB pressure
> Conversely, as we are in VM environments the pressure that other VMs have
> on those resources, the jitter of the bounce copies will grow.
> (lesson learnt from high speed - > 100Gbps - user pace networking)
> All this to say that a unitest may be wrongly give impression that copy is
> not that costly.

No I'm not doubting that unneeded copying can be costly - I'm just
trying to get an understanding of the scope of the problem. How often do
buffers get copied rather than measuring the total effect which as you
say can be very load dependant.

>
>> Do all resources get copied from the guest buffer to
>> host or does this only occur when there is a mismatch in the buffer
>> requirements?
>>
>> Are there any functions where I could add trace points to measure this?
>> If this occurs in the kernel I wonder if I could use an eBPF probe to
>> count the number of bytes copied?
>>
>> Apologies for the wall of questions I'm still very new to the 3D side of
>> things ;-)
>>
>> --
>> Alex Bennée
>>


-- 
Alex Bennée



reply via email to

[Prev in Thread] Current Thread [Next in Thread]