Re: [Gluster-devel] trusted.glusterfs.version xattr

gluster-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Gluster-devel] trusted.glusterfs.version xattr

From:	Gordan Bobic
Subject:	Re: [Gluster-devel] trusted.glusterfs.version xattr
Date:	Tue, 06 May 2008 23:59:20 +0100
User-agent:	Thunderbird 1.5.0.12 (X11/20080430)

Kevan Benson wrote:

Gordan Bobic wrote:
I suspect this isn't a problem that can be solved without having aproper journal of metadata per directory, so that upon connection, thewhole journal can be replayed.
You could sort of bodge it and use timestamps as the primary versionand the xattr version as secondary, bit that is no less dangerous - itonly takes one machine to be out of sync, and we are again looking atmassive scope for data loss.
You could bodge the bodge further to work around this by ensuring thatthe nodes are heartbeating current times to sync between them andwithout the sync no data exchange takes place. But that thencomplicates things because what do you do when a node connects and isout of sync, but in the future? Who wins on time sync? Who has thelatest authoritative copy?
I think the most sane way of addressing this is to have a fully loggeddirectory metadata journal. But then we are back to the journallingfor fast updates issue with a journal shadow volume, which isnon-trivial to implement.
Unless there is some kind of a major mitigating circumstance, it seemsthat between this and the race condition that Martin is talking abouton the other thread, GlusterFS in it's current is just too dangerousto use in most environments that I can think of. And unlike Gareth afew days ago, I'm not talking about performance issues - I'm talkingabout scope for data loss in very valid and very common use cases. :'(
Hmm, what about trusted.glusterfs.createtime (epoch time) as a majorversion number, and trusted.glusterfs.version as the minor versionnumber. Couple that with a glusterfs master time node (defaults to locknode) and you should have a fairly consistent cluster, right?


There are several problems with this:

1) The concept of the "lock node" is limiting. The locking should bedistributed.2) Using creation/modification time as the major number is problematicdue to time syncing. What happens when the master node goes offline? Ifthe nodes are in not in perfect time sync, you've still got the sameproblem.3) "fairly consistent" is _really_ not good enough when we are talkingabout a file system.

IMO, it would be better to come up with a design that solves the problemonce and for all. The order of priorities really has to be: consistency,reliability, performance.

If that isn't the case, you might as well be using a distributed hashtable and hope that you'll get most of the data back most of the time.


Gordan

[Prev in Thread]

Current Thread

[Next in Thread]

[Gluster-devel] trusted.glusterfs.version xattr, gordan, 2008/05/06
- Re: [Gluster-devel] trusted.glusterfs.version xattr, Amar S. Tumballi, 2008/05/06
  - Re: [Gluster-devel] trusted.glusterfs.version xattr, Gordan Bobic, 2008/05/06
    - Re: [Gluster-devel] trusted.glusterfs.version xattr, Kevan Benson, 2008/05/06
    - Re: [Gluster-devel] trusted.glusterfs.version xattr, Gordan Bobic <=
    - Re: [Gluster-devel] trusted.glusterfs.version xattr, Krishna Srinivas, 2008/05/07
    - Re: [Gluster-devel] trusted.glusterfs.version xattr, gordan, 2008/05/07
    - Re: [Gluster-devel] trusted.glusterfs.version xattr, Martin Fick, 2008/05/07
- Re: [Gluster-devel] trusted.glusterfs.version xattr, Gordan Bobic, 2008/05/07
  - Re: [Gluster-devel] trusted.glusterfs.version xattr, Martin Fick, 2008/05/07
    - Re: [Gluster-devel] trusted.glusterfs.version xattr, Gordan Bobic, 2008/05/07
    - Re: [Gluster-devel] trusted.glusterfs.version xattr, Martin Fick, 2008/05/07
    - Re: [Gluster-devel] trusted.glusterfs.version xattr, Derek Price, 2008/05/08
    - Re: [Gluster-devel] trusted.glusterfs.version xattr, Martin Fick, 2008/05/08
    - Re: [Gluster-devel] trusted.glusterfs.version xattr, Derek Price, 2008/05/08

Prev by Date: Re: [Gluster-devel] trusted.glusterfs.version xattr
Next by Date: [Gluster-devel] Client side afr, locking, race condition, simultanous writes, out of sync
Previous by thread: Re: [Gluster-devel] trusted.glusterfs.version xattr
Next by thread: Re: [Gluster-devel] trusted.glusterfs.version xattr
Index(es):
- Date
- Thread