Re: [Gluster-devel] Re; Load balancing ...

gluster-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Gluster-devel] Re; Load balancing ...

From:	gordan
Subject:	Re: [Gluster-devel] Re; Load balancing ...
Date:	Wed, 30 Apr 2008 13:56:12 +0100 (BST)
User-agent:	Alpine 1.10 (LRH 962 2008-03-14)



On Wed, 30 Apr 2008, Gareth Bult wrote:

It would certainly ber beneficial in the cases when the network speedis slow (e.g. WAN replication).
So long as it's server side AFR and not client-side ... ?


Sure.

I'm guessing there would need to be some server side logic to ensurethat local servers generated their own hashes and only exchanged thehashes over the network rather than the data ?


Indeed - same as rsync does.

Journal per se wouldn't work, because that implies fixed size and write-ahead 
logging.
What would be required here is more like the snapshot style undo logging.


A journal wouldn't work ?!
You mean it's effectiveness would be governed by it's size?

Among other things. A "journal" just isn't suitable for this sort ofthing.

1) Categorically establish whether each server is connected and up to date
for the file being checked, and only log if the server has disconnected.
This involves overhead.
Surely you would log anyway, as there could easily be latency between anactual "down" and one's ability to detect it .. in which case detectingwhether a server has disconnected it a moot point.

Not really. A connected client/server will have a live/working TCPconnection open. Read-locks don't matter as they can be served locally,but when a write occurs, the file gets locked. If a remote machine doesn'tack the lock, and/or it's TCP connection resets, then it's safe to assumethat it's not connected.

In terms of theoverhead of logging, I guess this would be a decision for the sysadminconcerned, whether the overhead of logging to a journal was worthwhile.vs. the potential issues involved in recovering from an outage?

That complicates things further, then. You'd essentially have asynchronouslogging/replication. At that point you pretty much have to log all writesall the time. That means potentially huge space and speed overheads.

From my point of view, if journaling halved my write performance (whichit wouldn't) I wouldn't even have to think about it.

Actually, saving an undo-log a-la snapshots, which is what would berequired, _WOULD_ halve your write performance on all surviving servers ifone server was out. If multiple servers were out, you could probably workaround some of this with merging/splitting the undo logs for variousmachines, so your write performance would generally be around 1/2 ofstandard, but wouldn't end up degrading to 1/n+1 where n is the number offailed servers for which the logging needs to be done.

The problem that arises then is that the fast(er) resyncs on small changes
come at the cost of massive slowdown in operation when you have multiple
downed servers. As the number of servers grows, this rapidly stops being a
workable solution.
Ok, I don't know about anyone else, but my setups all rely onconsistency rather than peaks and troughs. I'd far rather run a journalat half potential speed, and have everything run at that speed all thetime .. than occasionally have to stop the entire setup while the systemrecovers, or essentially wait for 5-10 minutes while the system re-syncsafter a node is reloaded.

There may be a way to address the issue of halting the rest of the clusterduring the sync, though. Read lock on a syncing file shouldn't stop otherread locks. Of course, it will block writes while the file syncs and thereading app finishes the operation.


Gordan

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [Gluster-devel] Re; Load balancing ..., (continued)
- Re: [Gluster-devel] Re; Load balancing ..., Gareth Bult, 2008/04/30
  - Re: [Gluster-devel] Re; Load balancing ..., gordan, 2008/04/30
- Re: [Gluster-devel] Re; Load balancing ..., Gareth Bult, 2008/04/30
  - Re: [Gluster-devel] Re; Load balancing ..., gordan <=
- Re: [Gluster-devel] Re; Load balancing ..., Gareth Bult, 2008/04/30
  - Re: [Gluster-devel] Re; Load balancing ..., Gareth Bult, 2008/04/30
  - Re: [Gluster-devel] Re; Load balancing ..., Mickey Mazarick, 2008/04/30
    - Re: [Gluster-devel] Re; Load balancing ..., gordan, 2008/04/30
- Re: [Gluster-devel] Re; Load balancing ..., Gareth Bult, 2008/04/30

Prev by Date: Re: [Gluster-devel] Re; Load balancing ...
Next by Date: RE: [Gluster-devel] AFR: machine crash hangs other mountsortransportendpoint not connected
Previous by thread: Re: [Gluster-devel] Re; Load balancing ...
Next by thread: Re: [Gluster-devel] Re; Load balancing ...
Index(es):
- Date
- Thread