Re: [Qemu-devel] [PATCH] Introduce cache images for the QCOW2 format

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-devel] [PATCH] Introduce cache images for the QCOW2 format

From:	Kaveh Razavi
Subject:	Re: [Qemu-devel] [PATCH] Introduce cache images for the QCOW2 format
Date:	Thu, 15 Aug 2013 14:25:08 +0200
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:17.0) Gecko/20130330 Thunderbird/17.0.5

On 08/15/2013 10:32 AM, Stefan Hajnoczi wrote:

I don't buy the argument about the page cache being evicted at any time:

At the scale where caching is important, provisioning a measily 100 MB
of RAM per guest should not be a challenge.

cgroups can be used to isolate page cache between VMs if you want to
guaranteed caches.

But it could be more interesting not to isolate so that the page cache
acts host-wide to reduce the overall I/O instead of narrowly focussing
on caching 100 MB for a specific image even if it is rarely accessed.

The real downside I see is that the page cache is volatile, so you could
see heavy I/O if multiple hosts reboot at the same time.

At the VM hosts, the memory is mostly allocated to VMs. Withoutpersisted caches, starting another VM from any of the possible backingVM images may or may not result in network traffic (depending on thepage cache). Regardless of the page cache, the existing cache imagespersisted on the disk at hosts, can eliminate this at least on VM boot.

At the storage site however, I think it makes sense to dedicate memoryfor popular backing images (via tmpfs rather than page cache). The datablocks of the popular images used for booting will be accessed by allVMs starting from these "template" images.

Streaming offers a rate limiting parameter so you can tune it to the
network conditions.

Copying the full image doesn't just reduce load on the NFS server, it
also means guests can continue to run if the NFS server becomes
unreachable.  That's an important property for reliability.

I am not really sure whether copying the entire image reduces the loadon the NFS server, specially at scale. If copying the entire image atscale is desired/necessary, peer-to-peer approaches are documented toperform better. They are mostly implemented at the host file-systemlayer though (search for e.g. VMTorrent). I agree on the reliabilityconsideration if you deal with an unreliable (remote) file-system.

1)
It is persistent.  The backing file chain looks like this:

   /nfs/template.qcow2 <- /local/cache.qcow2 <- /local/vm001.qcow2

The cache is a regular qcow2 image file that is persistent.  The discard
command is used to evict data from the file.  Copy-on-read accesses are
used to populate the cache when the guest submits a read request.

2)
You can set cache size or other parameters as a qemu-nbd option (this
doesn't exist but could be implemented):

   $ qemu-img create -f qcow2 -o backing_file=/nfs/template.qcow2 cache.qcow2
   $ qemu-nbd --options cache-size=100MB,evict=lru cache.qcow2

So it's the qemu-nbd process that performs the cache housekeeping work.
The cache.qcow2 file itself just persists data and isn't aware of cache
settings.

OK, this is better, since the user can also define a policy _and_ thecache can be shared by different VMs at the creation time without races.With an eviction policy 'none' in combination with cache_size, only thefirst accessed data blocks get cached, essentially providing the samefunctionality as this patch.


Kaveh

smime.p7s
Description: S/MIME Cryptographic Signature

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [Qemu-devel] [PATCH] Introduce cache images for the QCOW2 format, (continued)
- Re: [Qemu-devel] [PATCH] Introduce cache images for the QCOW2 format, Stefan Hajnoczi, 2013/08/14
  - Re: [Qemu-devel] [PATCH] Introduce cache images for the QCOW2 format, Kaveh Razavi, 2013/08/14
    - Re: [Qemu-devel] [PATCH] Introduce cache images for the QCOW2 format, Stefan Hajnoczi, 2013/08/15
    - Re: [Qemu-devel] [PATCH] Introduce cache images for the QCOW2 format, Kaveh Razavi <=

Prev by Date: Re: [Qemu-devel] [RFC] [PATCHv2] aio / timers pt2: Replace main_loop_tlg with qemu_dummy_timer_ctx
Next by Date: Re: [Qemu-devel] [RFC] [PATCHv10 19/31] aio / timers: Use all timerlists in icount warp calculations
Previous by thread: Re: [Qemu-devel] [PATCH] Introduce cache images for the QCOW2 format
Next by thread: [Qemu-devel] [Bug Report] Compile error in current uq/master
Index(es):
- Date
- Thread