Hi Team,
I am looking for disaster recovery process in case Primary datacenter is down and AG is moved to DR. I will have 2 node SQL FCI at primary and SQL standalone node (will run AG secondary replication). with shared storage. Quorum disk is shared storage disk for failover cluster in Primary DC. I am looking for a process that need to be followed and need to start by bringing down the node 1 and then node 2 and then how to bring the cluster up if both node 1 and node 2 is down in Primary DC. Node 3 will not be a owner of any instance in WSFC for SQL FCI. SQL node 3 has multiple standalone instances only
AG replication is Asynch between primary and DR nodes.
Node will not be owner and will not be have vote for cluster majority.
My understanding is as below, pls. share any inputs.
1. In Primary DC - Bring down the SQL node 1 - Result should be that SQL FCI should failover to node 2 and bring all the cluster resources online.
2. In Primary DC - Bring down the SQL node 2 - Result should be cluster down since both nodes are down
Query:
1. Since both nodes are down, quorum is down, Will Cluster still be accessible in DR from Node 3 from Cluster manager?
2. Since now both nodes are down in Primary, what should be the next step to bring the AG online in DR? login to each SQL instance and do failover (secondary AG will become primary)?
3. What should be the next step from here to resume the cluster on node 1 and node 2 in Primary DC?
4. Once cluster nodes are back, do we need to do failover again from DR to Primary for AG?
5. Any test plan that i can follow for testing AG with failover clusters? Any article/link
Any inputs will be appreciate. Thanks
Regards,