Random Cluster Failure

Hi There,

I have a 3 node AlwaysOn Availability Group cluster. Each node a has vote for quoum and there is no witness.

Node 1 and 2 are in the same data center while node 3 is in a remote data center connected by a high speed layer 2 link. All nodes are on the same subset. Node 1 is a primary for one AG1 and node 2 is a primary for AG2 while node 3 is only a secondary for both AGs.

The other night node 3 crashed (the hypervisor went down) which seems to have caused nodes 1 and 2 both to fail at exactly the same time a few minutes later, which seems odd as the whole point of the cluster is to avoid this scenario.

In the system event log I can see the below error on nodes 1 and 2 which I believe is the result a failed health check.

Looking in the failoverclustering diagnostic log on nodes 1 and 2 I see the this error just a second later and then the cluster service is terminated on both. I've searched the internet for references to "GumLockIsStuck" but come back with nothing but I believe this is related to updating the cluster databases with the cluster status on all nodes.

Because the above error was referring to no reporting for 600 seconds I went back 10 minutes in the same log and could a whole bunch of errors like this one stating nodes 1 and 2 were unable receive an ACK from node 3 which had of course crashed (around this time).

So what looks like has happened is node 3 has gone down and nodes 1 and 2 can't update the status of the cluster and because of that they have then stopped the cluster service on themselves. This doesn't seem right at all so can anyone explain this behavior?

Cheers
C

Random Cluster Failure

Trending Articles

Scuffham Amps - S-GEAR 2.6.0 VST, AAX, STANDALONE x86 x64 (R2R NO iLok2, +NO...

Practice Sheet of Right form of verbs for HSC Students

VHSE First (1st) Allotment 2025 - vhscap.kerala.gov.in

UNIVERSE LEAGUE – UNIVERSE LEAGUE – WAR (We Are Ready) – EP [iTunes Plus M4A]

City Hunter Teledrama – Episode 18 – 07th May 2016

Comment on Proposed Criteria for Identifying Predatory Conferences by Luke...

Bureau of Internal Revenue: Regional Offices (Directory)

Kendrick Lamar – Not Like Us (2024) [24Bit-88.2kHz] [PMEDIA] ⭐️

Inception 2010 Hindi Dual Audio 650MB BRRip 720p ESubs HEVC

East Hull MD admits sexual assaults after another victim comes forward

Download: Ziba Zako ft Rich Bizzy & General Kanene – Chikwati (Prod by: Bicko...

R. v. Sargeant, 2023 ONSC 6406 (CanLII)

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

Who’s been sentenced at Northampton Magistrates’ Court

मतलबी दोस्त स्टेट्स | Matlabi Dost Status in Hindi – Selfish Friends Status

Family cries out as traditional ruler allegedly abducts brother, extorts N2.5m

Long-Running Conflict In Springfield (MA) Gangland Sphere Has Manzi Family &...

Wondershare Filmora X v10.1.20.16 x64

Man arrested after fracas in flat

Man charged in ongoing Sexual Assault Investigation Derek Nyilas, 46, Faces...