[Slony1-general] Working out who is master after failover

Tue May 3 13:07:37 PDT 2005

I think you realize by now that this is a functionality that falls into 
the category of network monitoring.

When node 2 loses connectivity to node 1, it has no chance to detect if 
the client application lost connectivity too (short of asking it). The 
client and node 2 can have different network paths to node 1, so it 
might be just the 1<->2 connection that's broken.

When your "network monitor" detects that either the client or node 2 
can't talk to node 1 any more, "it" has to make sure that they will not 
do so all of the sudden (reconfigure switches or iptables), fire the 
slonik script telling node 2 to take over, and reconfigure something 
like pgpool to redirect the client to the new master. This needs very 
detailed knowledge of the involved hard- and software components in your 
network and usually kernel-level privileges. Network monitoring is a 
whole separate software, more complex than a replication system by 
itself. I don't think it is reasonable to attempt building that into 
Slony-I.

In any case, this still means that you can have transactions that the 
client successfully committed against node 1, but that not yet have been 
replicated to node 2. This is the very nature of asynchronous design and 
therefore can by no means be "fixed" in Slony-I.

Jan

On 5/3/2005 4:22 AM, Julian Scarfe wrote:

>>> However, after a failover, there's a problem.  Both nodes think
>>> they're the master node according to the criteria above, because the
>>> old master hasn't received the news of his demotion.  Is there a
>>> simple way of working out on the "abandoned" node that it has indeed
>>> been abandoned?
> 
>> Consider some scenarios:
>>
>> 1.  Node #1 is in Ottawa; other nodes are in Toronto.  Failover due to
>> persistent network failure.
>>
>> The network falls over, and we decide that the Ottawa data source must
>> be abandoned.  The database host and its database is undamaged, and
>> since we had no way to communicate with node #1 that it got stepped on,
>> it thinks it's running fine.
>>
>> Note that in this case, no data was ever corrupted in any way.
>>
>> Note also that since the network was dead, we had no way to tell node #1
>> that it is has been abandoned.
>>
>> Supposing there are some client machines in Ottawa, they might be able
>> to talk to node #1 even after it is abandoned, as they were on the
>> subnet there.  Could be trouble...
> 
> Yup, that's the scenario I'm worried about.
> 
> My application has a somewhat special view of what "corrupted" means.  We 
> have a constant stream of write-once-read-many perishable data.  If I lose 
> 10 minutes of data, the entire database might as well be corrupted, because 
> it's incomplete (which is in many ways worse than unavailable).
> 
> If I failover from node 1 to node 2 as soon as I've detected a failure of 
> connectivity of node 1, node 2 can takeover as master with no data loss.  I 
> want my client applications that feed the data in to *realise* that node 2 
> is now master and the data can be fed to it, not node 1.  I'd prefer that 
> realisation to be stateless -- i.e. they don't have to remember who is 
> currently master.
> 
> The nightmare is that node 1 suddenly comes back online, and the 
> applications start feeding data to node 1 instead of node 2 because node 1 
> is then looks like a master node.  I then have *two* incomplete databases!
> 
> One workaround would be for the applications feeding the data to check that 
> there is one and only one master by executing the query
> 
> select "_T1".getlocalnodeid('_T1') =
> (select set_origin from"_T1".sl_set where set_id = 1)
> 
> on *all* the hosts every time they want to write data.    If more that one 
> claims to be master, something's wrong.  But that's a little inefficient 
> when the first host they try is usually the master.
> 
> Julian 
> 
> 
> _______________________________________________
> Slony1-general mailing list
> Slony1-general at gborg.postgresql.org
> http://gborg.postgresql.org/mailman/listinfo/slony1-general

-- 
#======================================================================#
# It's easier to get forgiveness for being wrong than for being right. #
# Let's break this rule - forgive me.                                  #
#================================================== JanWieck at Yahoo.com #