Rebuild/Recover for SQL cluster after cluster unable to bring up

June 2, 2016, 11:05 pm

≫ Next: Primary replica can't online when clustered instance and AlwaysON use together

≪ Previous: Always On High Availability Help

Hi All,

Current environment as below.

2 x physical nodes running windows 2008 cluster with san disks and quorum disk, running application and SQL/Oracle database.

My domain admin accidentally delete both cluster node computer object and cluster admin user from AD, when domain admin aware on this, domain admin unjoin the both cluster node server from domain and rejoin back to domain, recreate the new AD user. Now the problem is when try to perform failover of application and database it will failed and cluster group unable to bring up. It was giving many error message when online the resources group (permission errors, CNO error and etc). Currently i down one of the cluster node (node02) and manually up the application and database run in on node01.

My plan is to perform full backup of server OS, SQL database, remove node, disk and application from cluster and force clean up the cluster. After his, unjoin and rejoin both nodes back to domain, reconfigure the cluster.

I'm not so sure what to do with SQL and SQL database? Do you have any idea what to do with SQL cluster? What to do with existing cluster SQL database before proceed on cluster clean up?

Do i need to remove SQL from node when perform cluster clean up or before unjoin domain ? Rerun the SQL/Oracle setup after rejoin the domain, after this restore the database?Anyone can share out the steps how to do fix this issue?

Regards,

↧

Primary replica can't online when clustered instance and AlwaysON use together

June 3, 2016, 7:47 pm

≫ Next: Move Windows Cluster resources manually

≪ Previous: Rebuild/Recover for SQL cluster after cluster unable to bring up

Hi All,
We have met a strange big problem on SQL Server 2014(and SQL Server 2012) AlwaysON. The details are:

For example we have a three node cluster WinclsCluster:
Two are an active (name: node-active) and passive (name: passive-node) node for a clustered sql server instance named ClsSQL, and the third node (name: node-three) installed a default non-clustered sql instance. OS is Windows Server Enterprise 2008 R2 SP1. Two sql instances configure an AlwaysON group: AGtest.

Clssql is current on node_active. We manually failover the Clssql from node-active to node-passive, it is successful, so the ClsSQL and AGtest are both on node-passive.

But if I failover the ClsSQL from node-passive to node-active, the problem comes: ClsSQL failover succeed, but the AGtest remains on node-passive, and the strange is the AGtest still show online, but the database on ClsSQL is in a recover pending state, can't be accessed read or write. In this situation, if sql server is 2014 ,the node-three can access read-only.

In ClsSQL errolog, it has a log:
The state of the local availability replica in availability group 'AGtest' has changed from 'NOT_AVAILABLE' to 'RESOLVING_NORMAL'. The replica state changed because of a startup, a failover, a communication issue, or a cluster error. For more information, see the availability group dashboard, SQL Server error log, Windows Server Failover Cluster management console or Windows Server Failover Cluster log.
But the sql errolog doesn't tell why AlwaysON Group can't change from RESOLVING_NORMAL to PRIMARY_PENDING, and the cluster log doesn't have any meaningful log either.

To fix this problem, you must either execute the AlwaysON group failover tsql command at ClsSQL, or fail the AGtest:
-- // at clsssql
alter availability group test failover

-- // at cmd
Cluster.exe res agtest /fail

This problem can be re-produced easily, and this greatly reduce the HA.

Many thanks, and please forgive my poor English.

↧

Move Windows Cluster resources manually

June 6, 2016, 12:49 am

≫ Next: change starting number in a table

≪ Previous: Primary replica can't online when clustered instance and AlwaysON use together

Hi,

On windows server 2012 i have created 2 node cluster for sql server 2012. i have 04 disk 1-Quorum 2- MSDTC 3- Data 4-Log.

Cluster created successfully, SQL Server installation on both node done successfully.

When i restart node 1 all resources shifted to node 2. When start node 2 than all resources shift to node 1.

problem is that when i manually shift cluster resources than only quorum disk shifted to node 2 but other node does not shift. i have to manually select SQL server role to shift to node 2 same as with msdtc role.

pls guide what could be an issue that why on manually shifting cluster resource all resource are not being shifted.

thx

↧

change starting number in a table

June 6, 2016, 10:14 am

≫ Next: SQL 2016 Failover Cluster Issues

≪ Previous: Move Windows Cluster resources manually

How do you change the starting number in a table with auto increment?

↧

SQL 2016 Failover Cluster Issues

June 6, 2016, 4:50 pm

≫ Next: do i need to open port for listener(always on group listener) ip also.?

≪ Previous: change starting number in a table

I have a Hyper V guest Cluster consisting of two Server 2012 R2 VMs with SQL 2016 setup as a failover cluster. The cluster has four SQL instances on it. After installing the first instance, everything was working fine. But when I installed a second, instance, and then all the rest, none of the instances will drain/failover other than the very first instance I installed. The cluster validation comes both fine other than I don't have two network paths, but that is not an issue in this case. Below is the list of cluster errors I get when when I try to drain an instance over:

Both servers all completely patched and are brand new builds. When I drain the first instance that I setup on the cluster, I have no problems, it works just like it is supposed to. It is just the other three that fail. It does not seem to be related to a host since there are two instances on each server, but only one of the four will move and it easily moves back and forth.

Does anybody have any ideas what could be going on here?

Thank you,

Ryan

↧

do i need to open port for listener(always on group listener) ip also.?

June 6, 2016, 5:19 am

≫ Next: SQL 2016 Basic Availability Groups - worked until first restart

≪ Previous: SQL 2016 Failover Cluster Issues

I have implemented always on availability group architecture with 2 nodes,

My application server is in different (DMZ) zone and DB server(With always on) is in different zone,

i have to connect DB server from application server with listener name,( i have configured listener),

i need to open port 1433 for which all nodes? i have already open port 1433 for primary and secondary node,

do i need to have port open for listener also along with the nodes?

↧

SQL 2016 Basic Availability Groups - worked until first restart

June 7, 2016, 1:13 am

≫ Next: Control database on both replicas?

≪ Previous: do i need to open port for listener(always on group listener) ip also.?

Hi,

I'am testing SQL 2016 Basic Availability Groups

I created Test AG, everything shows OK but after restart, secondary server lost connection

In AG dashboard I see error:

The availability group is not ready for automatic failover. The primary replica and a secondary replica are configured for automatic failover, however, the secondary replica is not ready for an automatic failover. Possibly the secondary replica is unavailable, or its data synchronization state is currently not in the SYNCHRONIZED synchronization state.

Maybe I something miss

I used scripts:

---on test1 server

USE master
GO

CREATE AVAILABILITY GROUP TestAG
WITH
(
AUTOMATED_BACKUP_PREFERENCE = PRIMARY,
BASIC,
DB_FAILOVER = OFF,
DTC_SUPPORT = NONE
)
FOR DATABASE [testdb1]
REPLICA ON
'sql16-test1' WITH
(
ENDPOINT_URL = 'TCP://sql16-test1.mydomain.internal:5022',
FAILOVER_MODE = AUTOMATIC,
AVAILABILITY_MODE = SYNCHRONOUS_COMMIT,
SECONDARY_ROLE
(
ALLOW_CONNECTIONS = NO
)
),
'sql16-test2' WITH
(
ENDPOINT_URL = 'TCP://sql16-test2.mydomain.internal:5022',
FAILOVER_MODE = AUTOMATIC,
AVAILABILITY_MODE = SYNCHRONOUS_COMMIT,
SECONDARY_ROLE
(
ALLOW_CONNECTIONS = NO
)
)
GO

----- on Test2 server

ALTER AVAILABILITY GROUP [TestAG] JOIN
GO

↧

Control database on both replicas?

June 7, 2016, 3:35 am

≫ Next: WIndows 2012 R2 - SQL Always On cluster without shared storage

≪ Previous: SQL 2016 Basic Availability Groups - worked until first restart

Dear all,

I am running a 2014 Always On synchronous.

My goal is test the workload/pattern use/performance and other typical activites through a user database created by myself.

My concern is, where must I create this database? Both?

My idea would be just create this database in one node concrete as “not synchronized” out of Availability groups.

How can I achieve that?

I mean, is there some way for avoiding this control clause:

fn_hadr_backup_is_preferred_replica(your_database) = 1

“are you in the secondary”

then Sql Server Agent jobs and stored procedures are in the wrong server.

ONCE HERE, HOW CAN I GET THE INFO FOR THE ANOTHER SERVER, USING A LINK CONNECTION?

ELSE

“are you in the primary”.

then Sql Server Agent jobs and stored procedures attack the current one

If this plan is wrong, suppose I will have to create the database in both servers and move into Availability Group.

Thanks indeed for your inputs and hints related to,

↧

WIndows 2012 R2 - SQL Always On cluster without shared storage

June 7, 2016, 10:26 am

≫ Next: How to read SqlInstToNodeMap using ClusterRegQueryValue or similar api

≪ Previous: Control database on both replicas?

Hi Team,

Can anyone provide the guide to setup SQL AO cluster on windows 2012 r2 without shared storage.

Regards,

↧

How to read SqlInstToNodeMap using ClusterRegQueryValue or similar api

June 7, 2016, 5:48 am

≫ Next: Can not create Listner for High availability group of Always on in SQL 2012 on cluster environment

≪ Previous: WIndows 2012 R2 - SQL Always On cluster without shared storage

SqlInstToNodeMap represents map for cluster resource for availability group to sql server instance name[s]

For such a resource, I want to get the map using some C API like ClusterRegQueryValue

[The other bad approach is to login and query each and every sql server instance and check for the availability group name]

Thanks all,

Vasim.

↧

Can not create Listner for High availability group of Always on in SQL 2012 on cluster environment

May 27, 2015, 10:47 am

≫ Next: SQL move from failover cluster to standalone

≪ Previous: How to read SqlInstToNodeMap using ClusterRegQueryValue or similar api

I have getting issues when i am creating listener for always On

Error shown as below

Can not bring the Windows server fail over cluster (WSFC) resources online. (Error Code 5942). The WSFC service may not be running or may not be accessible in its currents states, or the WSFC resources may not be in a state that could accept the request. For information about this error code see "system error code" in windows development documentation

The attempt to create network name and IP address for the listener is failed. The WSFC service may not be running or may not be accessible in its currents states or the value provide for the network name and IP address may be incorrect. Check the state of the WSFC cluster and validate network name and IP address with network administrator. (Microsoft SQL Server error 41066)

Please suggest what to do to ressolve this issues.

Thank You

Nirav Bhatt

↧

SQL move from failover cluster to standalone

June 8, 2016, 10:39 pm

≫ Next: Transient Log Shipping Restore Error - Could not redo log record (172980:741440:22), for transaction ID (0:1889746485), on page (1:28106052)

≪ Previous: Can not create Listner for High availability group of Always on in SQL 2012 on cluster environment

Hi,

We have Two node Windows 2008 R2 failover cluster with SQL service.

Physical mode RDM disks for Quorum and Databases connect to virtual machines through Vmware 5.5 environment. So we cannot use snapshots for backup and plan to destroy failover cluster ant live one standalone SQL.

How can I destroy cluster and leave one of the node to working as standalone SQL server? Is it supported way?Thank you

↧

Transient Log Shipping Restore Error - Could not redo log record (172980:741440:22), for transaction ID (0:1889746485), on page (1:28106052)

June 9, 2016, 8:36 am

≫ Next: Building a SQL2014 HA cluster looking for recommendations

≪ Previous: SQL move from failover cluster to standalone

I'm running SQL Server 2008 r2 sp3 and am taking transaction log backups every 5 minutes from my primary server in a log shipping scenario. I have multiple standby servers that are log shipped to. On one of the servers, call it server A, I am randomly receiving the following error:

Could not redo log record (172980:741440:22), for transaction ID (0:1889746485), on page (1:28106052), database 'ABCDE' (database ID 5). Page: LSN = (172980:698240:6), type = 2. Log: OpCode = 2, context 3, PrevPageLSN: (172980:739640:5). Restore from a backup of the database, or repair the database.

The transaction log backup file isn't corrupted, because if I restore a backup of the primary database and run log shipping again, it will proceed without any issues. The log backup file itself is about 7.5MB compressed. My other standby servers are not having any issues. I log ship about a dozen different databases, and they each randomly experience this issue at random times.

I recently migrated from one primary SQL Server to another (different OS server, different SAN). Prior to the migration, I was having the above issue on a different SQL Server, call it server B. Since the migration, server B has been fine, but now server A is experiencing the issue.

↧

Building a SQL2014 HA cluster looking for recommendations

June 9, 2016, 8:21 am

≫ Next: DBCC LOGINFO on AlwaysON Availability Group Database

≪ Previous: Transient Log Shipping Restore Error - Could not redo log record (172980:741440:22), for transaction ID (0:1889746485), on page (1:28106052)

Here are my requirements. I have 2 data centers, and they are only a few miles apart with nearly no latency between centers. Both are on the same subnet. We need 2 SQL servers at each location. Plan is to have a primary at data center #1, with a replica of it here and the 2 at other data center #2 will be replicas of the primary from data center #1.

The requirement is we have a failover at the primary data center and if we lose the entire datacenter, we have a primary and a secondary at the second data center.

To clarify:
SQL1, data center 1 PRIMARY
SQL2, data center 1 has replica of SQL1
SQL3, data center 2 has replica of SQL1
SQL4, data center 2 has replica of SQL1

So do I set up a Sync copy locally at DC1 and a Sync copy at DC2 and then an ASync copy at SC2 on the 4th SQL server or just how to do I do this?

Thanks
Dale Landry

↧

DBCC LOGINFO on AlwaysON Availability Group Database

June 9, 2016, 12:35 pm

≫ Next: Monitoring Log Send Queue / Redo Queue Sizes

≪ Previous: Building a SQL2014 HA cluster looking for recommendations

In SQL 2014 - AAG - a database is configured for AAG in synchronous mode with three replicas.

after doing the log backup - when I check the DBCC loginfo - the status is still shows up as 2 for all VLFs.
I manually verified and found that the databases are completely in sync.

I also noticed that the log however is being reused - which makes me think the VLF are somehow being marked for "reuse"
but not showing up in dbcc loginfo.
Can someone please clarify this behaviour and if this is excepted behavior.

Hope it Helps!!

↧

Monitoring Log Send Queue / Redo Queue Sizes

June 9, 2016, 7:29 am

≫ Next: AlwaysOn Availability

≪ Previous: DBCC LOGINFO on AlwaysON Availability Group Database

We are attempting to monitor the size of our Log Send Queue and Redo Queue on our cluster. I know that we can monitor these via SSMS, but we'd like to come up with a way to have them monitored automatically and, if possible, send out an email alert when they get over a certain size.

I am a relative SQL newbie, and have found a way to use a SELECT statement to see the values, but I'm not sure if I can use an IF statement, or some other method to trigger an alert.

Any recommendations?

Thanks

Jon

↧

AlwaysOn Availability

June 10, 2016, 12:27 pm

≫ Next: SQLServer HA question

≪ Previous: Monitoring Log Send Queue / Redo Queue Sizes

Hi,

There are two questions:

Under what circumstances failover will happen in AlwaysOn?

Under what circumstances failover will not happen in AlwaysOn? (example: DB corrupted/diskfailure and so on?)

↧

SQLServer HA question

June 11, 2016, 9:29 pm

≫ Next: AlwaysOn Availability Groups feature Issue

≪ Previous: AlwaysOn Availability

Hello,

I have 2 instances of SQLServer 2012 Standard Edition installed and they are both installed on two seperate servers. Lets say Insatnce A and Instance B. Without adding them to a cluster and getting a virtual name, is it possible create an A-record in DNS and call it Instance C and use it is a reference to either InstanceA or InstanceB alternatively?

Thanks for the inputs.

↧

AlwaysOn Availability Groups feature Issue

June 12, 2016, 11:30 pm

≫ Next: Cluster Services for database

≪ Previous: SQLServer HA question

I am getting below error when i try to configure High Availability and i tried to solve this issue but i couldn't. Anyone have solution for this issue?

I am using SQL Server 2016 Enterprise version, Windows Server 2012 both are 64 bit.

The AlwaysOn Availability Groups feature requires the x86(non-Won) or x64 Enterprise Editon of SQL Server 2012 (or later version) running on Windows Server 2008 (or later version with WSFC hotfx KB 2494036 installed. This SQL Server editon

and/or Windows Server system do not meet one or more of these requirements. For more information about prerequisites, restrictions, and recommendations for AlwaysOn Availability Groups, see SQL Server Books Online.

Thanks

↧

Cluster Services for database

June 12, 2016, 10:44 pm

≫ Next: AlwaysOn - cluster lease timeouts and PREEMPTIVE_HADR_LEASE_MECHANISM

≪ Previous: AlwaysOn Availability Groups feature Issue

TechNet Support Team,

Please can you reconfirm if the SQL Server Std versions that supports Cluster services. OR cluster & replication services are part of Enterprise Edn.

↧