Mon Oct 27 11:04:13 PDT 2008
- Previous message: [Slony1-general] Set 2 not found in runtime configuration
- Next message: [Slony1-general] MIXED VERSIONS OF POSTGRES
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
OK, so blowing away all of the records in sl_event *did* get
replication started again.
However, as I said in the original post, this is not something I want
to try in production. Can anyone provide any information as to how to
get Slony back on track when replication gets messed up as I described
below? Of particular use would be how to identify problem sl_event
records (if doing that is the best way).
Thanks,
Michael
On Oct 23, 2008, at 12:38 PM, Michael Lorenz wrote:
> Hi all,
>
> I ran into a problem with Slony yesterday, and I'm hoping someone
> can help me resolve it. I've checked the mailing lists, searched
> the web, etc., but I didn't find a definitive answer anywhere. The
> closest set of messages I've found on this list can be found at: http://www.mail-archive.com/pgadmin-support@postgresql.org/msg07704.html
>
> What I was doing was adding a new table to the schema and trying to
> set up replication for it. I did it twice; once for a test table
> (which worked), then a second time with a different table (which
> didn't). The script I used looked like this:
>
> CREATE SET ( ID = 2, ORIGIN = 1, COMMENT = 'set comment' );
> SET ADD TABLE ( SET ID = 2, ORIGIN = 1, ID = 39, FULLY QUALIFIED
> NAME = 'public.testtable', COMMENT = 'table comment' );
> SUBSCRIBE SET (ID = 2, PROVIDER = 1, RECEIVER = 2);
> SYNC ( ID = 1 );
> WAIT FOR EVENT (ORIGIN = 1, CONFIRMED = 2);
> MERGE SET ( ID = 1, ADD ID = 2, ORIGIN = 1 );
>
> Now, I'm not sure if this is relevant, but I tried to add the second
> table about 20-30 minutes after the first one. As far as I can
> tell, the MERGE SET from the first table had completed, since there
> was no set 2 in the sl_set table anymore.
>
> Replication has stopped since that error happened, and here's what
> keeps on showing up in the log now:
>
> 2008-10-23 13:26:47 GMT DEBUG2 localListenThread: Received event
> 2,1082313 SYNC
> 2008-10-23 13:26:50 GMT DEBUG2 remoteListenThread_1: LISTEN
> 2008-10-23 13:26:50 GMT DEBUG1 copy_set 2
> 2008-10-23 13:26:50 GMT ERROR remoteWorkerThread_1: set 2 not
> found in runtime configuration
> 2008-10-23 13:26:50 GMT WARN remoteWorkerThread_1: data copy for
> set 2 failed - sleep 60 seconds
>
> Short of doing a node subscribe/unsubscribe (which I don't want to
> try if this happens in production), is there some way to get this
> cleaned up and replication started again? If it means deleting some
> record(s) from sl_event, is there a good way to identify which is/
> are the offending records to remove? Or, since replication seems to
> be frozen, could I just blow away all records in sl_event on the
> slave, or...?
>
> Thanks,
> Michael
>
> _______________________________________________
> Slony1-general mailing list
> Slony1-general at lists.slony.info
> http://lists.slony.info/mailman/listinfo/slony1-general
>
Michael Lorenz
evryx technologies inc.
412 w. broadway, suite 201
glendale, california 91204
818.552.3568 x117 www.evryx.com
- Previous message: [Slony1-general] Set 2 not found in runtime configuration
- Next message: [Slony1-general] MIXED VERSIONS OF POSTGRES
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
More information about the Slony1-general mailing list