Re: [Gluster-devel] Performance tuning for MySQL

gluster-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Gluster-devel] Performance tuning for MySQL

From:	Gordan Bobic
Subject:	Re: [Gluster-devel] Performance tuning for MySQL
Date:	Wed, 11 Feb 2009 09:18:23 +0000
User-agent:	Thunderbird 2.0.0.19 (X11/20090107)

David Sickmiller wrote:

I'm running 2.0rc1 with the 2.6.27 kernel. I have a 2-node cluster.GlusterFS runs on both nodes, and MySQL runs on the active node. If theactive node fails or is put on standby, MySQL fires up on the othernode. Unlike MySQL Replication with its slave lag, I know my datachanges are durable in the event of a server failure. Most people useDRBD for this, but I'm hoping to enjoy GlusterFS's benefits of handlingsplit-brain situations at the file level instead of the volume level,future scalability avenues, and general ease of use. Hopefully DRBDdoesn't have unmatchable performance advantages I'm overlooking.

Note that DRBD resync is more efficient - it only resyncs dirty blocks,which in the case of big databases, can be much faster. Gluster willcopy the whole file.

I'm going to report my testing in order, because the changes werecumulative. I used server-side io-threads from the start. Before Istarted recording the speed, I discovered that running in single processmode was dramatically faster. At that time, I also configuredread-subvolume to use the local server. At this point I started measuring:
    * Printing schema: 18s
    * Compressed export: 2m45s
For a benchmark, I moved MySQL's datafiles to the local ext3 disk (butkept writing the export to GlusterFS). It was 10-100X faster!
    * Printing schema: 0.2s
    * Compressed export: 28s

Did you flush the caches inbetween the tries? What is your networkconnection between the nodes?

There was no appreciable changes from installing fuse-2.7.4glfs11, usingBooster, or running blockdev to increase readahead from 256 to 16384.
Adding the io-cache client-side translator didn't affect printing theschema but cut the export in half:
    * Compressed export: 1m10s
Going off on a tangent, I shut down the remote node. This increased theperformance by an order of magnitude:
    * Printing schema: 2s
    * Compressed export: 24s

What is the ping time between the servers? Have you measured thethroughput between the servers with something like ftp on big files? Isit the writes or the reads that slow down? Try dumping to a ext3 fromgluster.

I resumed testing with both servers running. Switching the I/Oscheduler to deadline had no appreciable affect. Neither did addingclient-side io-threads, or server-side write-behind. Surprisingly, Ifound that changing read-subvolume to the remove server had only a minorpenalty.

Are you using single process client/server on each node, or separateclient and server processes on both nodes?

Then I noticed that the remote server was listed first in the volfile,which means that it gets used for the lock server. Swapping the orderin the volfile on one server seemed to cause split-brain errors -- doesthe order need to be the same on both servers?

Yes, the first server listed is the lock server. If you list them indifferent order, locking will break. The order listed is the lockingserver fail-over order.

When I changed bothservers' volfiles to use the active MySQL server as the lock server,there was a dramatic performance increase, to roughly around the 2s/24sspeed I saw with one server down. (I lost the exact stats.)
In summary, running in single process mode, client-side io-cache, and alocal lock file were the changes that made a significant difference.

That makes sense, especially on the local lock file. The time it takesto write a lock to page cache is going to be some orders of magnitudefaster than the ping time, even on gigabit ethernet.

Since I'm only going to have one server writing to the filesystem at atime, I could mount it read-only (or not at all) on the other server.Would that mean I could safely set data-lock-server-count=0 andentry-lock-server-count=0 because I can be confident that there won't beany conflicting writes? I don't want to take unnecessary risks, but itseems like unnecessary overhead for my use case.

Hmm... If the 1st server fails, the lock server will fail to the nextone, and you fire up MySQL there then. I thought you said it was onlythe 2nd server that suffers the penalty. Since the 2nd server will failover locking from the 1st if the 1st fails, the performance should bethe same after fail-over. You'll still have the active server being thelock server.


Gordan

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [Gluster-devel] Which performance translators are risky for MySQL?, Raghavendra G, 2009/02/01
- [Gluster-devel] Performance tuning for MySQL, David Sickmiller, 2009/02/11
  - Re: [Gluster-devel] Performance tuning for MySQL, Gordan Bobic <=
    - Re: [Gluster-devel] Performance tuning for MySQL, David Sickmiller, 2009/02/11
    - Re: [Gluster-devel] Performance tuning for MySQL, Gordan Bobic, 2009/02/11

Prev by Date: [Gluster-devel] Performance tuning for MySQL
Next by Date: Re: [Gluster-devel] Performance tuning for MySQL
Previous by thread: [Gluster-devel] Performance tuning for MySQL
Next by thread: Re: [Gluster-devel] Performance tuning for MySQL
Index(es):
- Date
- Thread