Re: [Gluster-devel] replication client or server side

gluster-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Gluster-devel] replication client or server side

From:	Kevan Benson
Subject:	Re: [Gluster-devel] replication client or server side
Date:	Tue, 16 Oct 2007 10:35:37 -0700
User-agent:	Thunderbird 2.0.0.6 (X11/20070728)

Vincent Régnard wrote:

Hi all,

I am presently re-thinking the way we are using glusterfs 1.3.5 here. We
are doing replication (*3 with 3 bricks) on client side to produce a
small HA cluster. We are planning to extend the brick number. Drawing
that again and looking at some examples on the wiki doing it another way
(Kritical's tutorial), we are wondering wether doing the replication
(AFR) on the server side (glusterfsd) would be more suitable than doing
it on the client side ? Have you any experience or remark on that ? Does
this have performance impact in your opinion ?

If replication is transfered to server side, we'll have to use
unification on client side to achive HA (and then obtain active
self-heal?). Is this latter configuration reasonable ?


Present configuration:

Client stack:    FUSE
        PERFORMANCE TRANSLATORS (write-b/io-cache/io-thread)
        AFR
        CLIENT TRANSPORT

Server stack:    SERVER TRANSPORT
        PERFORMANCE TRANSLATOR (io-thread)
        POSIX LOCKS FEATURE
        POSIX STORAGE


Planned configuration:

Client stack:    FUSE
        PERFORMANCE TRANSLATORS (write-b/io-cache/io-thread)
        UNIFY
        CLIENT TRANSPORT

Server stack:    SERVER TRANSPORT
        PERFORMANCE TRANSLATOR (io-thread)
        AFR
        POSIX LOCKS FEATURE
        POSIX STORAGE

Vincent

I find using AFR and Unify from the client yields a more robust configwith respect to high availability, but using unify on the clientcomplicates the configs and file storage (it necessitates splitting theshare between a main and AFR split per server). It may be possible tooverload the AFR definitions to get around this, I haven't tried thatyet. It's also possible that tweaking the timeout values for the clientand server to make the server timeout before the client might yield amore stable config.

Performance wise, moving AFR to the server side will allow you structurethe network for more performance, such as implementing a secondarynetwork to handle all the AFR traffic. As it is now (with you doingeverything on the client), your writes are constrained to 1/3 of thetotal available network bandwidth, since you have to write each file 3times. By moving the AFR to the server and implementing a secondnetwork to carry the AFR traffic, you could increase your theoreticalnetwork performance by 50% (if the AFR network is the same speed as theclient network connection, and you want data stored on 3 servers).

It seems like every other day I think of a new way to set up glusterfs.I have to say this is the most fun I've had with a software product insome time. ;)


--

-Kevan Benson
-A-1 Networks

[Prev in Thread]

Current Thread

[Next in Thread]

[Gluster-devel] replication client or server side, Vincent Régnard, 2007/10/16
- Re: [Gluster-devel] replication client or server side, Kevan Benson <=
  - Re: [Gluster-devel] replication client or server side, Krishna Srinivas, 2007/10/18

Prev by Date: [Gluster-devel] config parameter list
Next by Date: [Gluster-devel] Segfault
Previous by thread: [Gluster-devel] replication client or server side
Next by thread: Re: [Gluster-devel] replication client or server side
Index(es):
- Date
- Thread