Quantcast
Channel: SQL Server High Availability and Disaster Recovery forum
Viewing all 4532 articles
Browse latest View live

SQL 2014: AlwaysOn Replica status shows unknown

$
0
0
We have installed with SQL server 2014 SP2 with 2 node AlwaysOn, SQLAAG1 and SQLAAG2. Replica (Secondary) node shows "unknown". As I am aware on 2012 we have this issue and MS has provided CU. Request for guidance. Thanks in advance.

SQL Server Always On:Failure on Secondary Server with Various errors but why Primary wan't accessible?

$
0
0

Hello Experts,

I ran into  issue on SQL 2012 SP4 Environment where secondary server went down and when i checked the logs found exact somany different errors which was listed below.
Primary server was always up and running but not sure what caused users to to stop the access on Primary database. 
Also SQL account on Primary which was used for connection tuned into orphaned user when i checked so had to remap on Primary. 

Though everything went   back to normal when secondary server came up but why users were not able to access primary server though it was up and running. 

Any idea what could have happened to Primary even though it was all time up and running? 
Did It tried to fail over on Secondary some reason?

*******************************************************************

Error Log as below:

Failed to update Replica status within the local Windows Server Failover Clustering (WSFC) due to exception 41034.

Failed to update Replica status within the local Windows Server Failover Clustering (WSFC) due to exception 41005.

Error: 41144, Severity: 16, State: 8.

The local availability replica of availability group ' ' is in a failed state.  The replica failed to read or update the persisted configuration data (SQL Server error: 41034).  To recover from this failure, either restart the local Windows Server Failover Clustering (WSFC) service or restart the local instance of SQL Server.

Remote harden of transaction 'user_transaction' (ID 0x000000005c947182 0000:28d9d77f) started at Feb  6 2018  5:10PM in database ' ' at LSN (15129:292464:157) failed.

Error: 41143, Severity: 16, State: 2.
Cannot process the operation.  The local replica of availability Group ' ' is in a failed state.  A previous operation to read or update persisted configuration data for the availability group has failed.  To recover from this failure, either restart the local Windows Server Failover Clustering (WSFC) service or restart the local instance of SQL Server.

Failed to obtain the Windows Server Failover Clustering (WSFC) resource handle for cluster resource with name or ID '567e9cfe-6a0a-4661-b270-b3c717ca0862' (Error code 1722).  The WSFC service may not be running or may not be accessible in its current state, or the specified cluster resource name or ID is invalid.  For information about this error code, see "System Error Codes" in the Windows Development documentation.

The state of the local availability replica in availability group ' ' has changed from 'PRIMARY_NORMAL' to 'RESOLVING_NORMAL'.  The state changed because the local instance of SQL Server is shutting down.  For more information, see the SQL Server error log, Windows Server Failover Clustering (WSFC) management console, or WSFC log.

The Cyclic Redundancy Check (CRC) value generated for the retrieved availability group configuration data does not match that stored with the data for the availability group with ID '%.*ls'. If this is a WSFC availability group, the availability group data in the WSFC store may have been modified outside SQL Server, or the data is corrupt. If the error persists, you may need to drop and recreate the availability group.

Failed to validate the Cyclic Redundancy Check (CRC) of the configuration of availability group '%.*ls'. The operation encountered SQL Server error %d, and the availability group has been taken offline to protect its configuration and the consistency of its joined databases. Check the SQL Server error log for more details. If configuration data corruption occurred, the availability group might need to be dropped and recreated.

AlwaysOn: The local replica of availability group is preparing to transition to the resolving role in response to a request from the Windows Server Failover Clustering (WSFC) cluster.

A connection timeout has occurred on a previously established connection to availability replica

Thank you. 



Thank you very much for your time and effort to answer this post. Please Mark As Answer if it is helpful. \\Aim To Inspire Rather to Teach Best -Ankit

Login issue for one Instance on failover node

$
0
0

Hello,

We have two fail-over node 01 and 02 and three instances (Ins1, Ins2 and Ins3) on these two nodes.

I can able to login to all three instances from node 02 using my windows authentication but i can't login to only Ins3 on node 01.

I am receiving error as below while trying to connect to Ins3 on node 01.  After that error i can't login to node 01 and my reference account get locked as well. Any help would be greatly appreciated.



Change large transaction log file initial size

$
0
0

Hello,

Can I change a database' initial transaction log file size. The file size is 42GB and I need it to be more like 1 MB as it is presently making my server's C partition run out of space. Can I change the initial size as it appears on the Files page in Database Properties?


Dr. Jan Vanderpool

Cannot Add DB on Availability Database Group

$
0
0

Hi Expert,

I use SQL Server 2016 Always On feature, i found at dm_hadr_availability_replica_states atcolumn recovery_health_desc = ONLINE_IN_PROGRESS, what does it mean? therefore I cannot add new Database into Availability Database Group (Primary) and get error:

Msg 41190, Level 16, State 8, Line 6 Availability group 'AOG1' failed to process add-database command. The local availability replica is not in a state that could process the command. Verify that the availability group is online and that the local availability replica is the primary replica, then retry the command

what should I do?

thanks so much for all the answer.

best regards,

Urbel.


SQL AlwaysON - Listener

$
0
0

Hi,

I have configured a SQL alwaysON availability group for set of DBs and a listener to connect to. 

Can I use this listener name in my application configuration? will this work seamlessly whenever I do failover and failback?

I am able to connect to SSMS using listener name , this tells me configuration seems to be fine. How can I make sure my application supports connecting to DB using listener name ? 

please help.

Windows Server 2012 R2 Cluster Mount Points and Cluster Shared Volumes SQL Server 2016 Error

$
0
0

We have an environment as follows:

2 node SQL Server Cluster running SQL Server 2016 instances.  The failover cluster is running on Windows Server 2012 R2.  All storage resides in a Compellant SAN.  We have just about run out of drive letters for new instances and want to move to mount points but I have run into an issue when trying to change the mount point volumes to Cluster Shared Volumes. 

I used Microsoft Knowledgebase document 280297 and 819546 as reference to setup my Mount Points. 

On each node of the cluster, I set up a mount point Root Disk and assigned it the same drive letter.  Created the mount point volumes and mapped them to the Root Disk.  Added the disks as disk resources for the cluster. However, I believe, I need to change the disks as a Cluster Shared Volume resource so each node in the cluster can have access to it during a failover.  

In FC Manager, I changed the mount point from Available Storage to Cluster Shared Volume.  When I click on the Mount Point in Windows Explorer I get a "Location is not available" error.  I then looked at the disk in Data Management and the"Mount in the following empty NTFS folder" to the Root Disk is gone. 

I attempted adding the disks as CSV's in two ways, first was adding both the root disk and the mount point disk/volume as CSV's.  Then I removed them both as CSV's and added only the mount point disk/volume.  Either way, I get the same error.

This is my first time setting up mount points in a cluster, I have looked at so many websites and have't found any resolution.  Any advice will help.

Change DR Cluster to Always ON mode (for Load balancing which is transparent to the Application/ETLs)

$
0
0

In our org. we have a SQL Server DR Cluster set-up, with shared SAN.
there are two nodes (Node1 - 10.10.200.114 & Node2 - 10.10.200.115)
And we have 7 DBs which is using by various applications (SSIS ETLs).
The DR Cluster Name is ClustDRDwh00034, which is using by all the applications.

This is an Active/Standby setup in which, Node1 should be active and if DR incidents happen the the SAN
should be detached from Node1 and attached to Node2 for enable the high availability.

This architecture we need to change, for improve the performance as the Active node load is very high.
The applications (SSIS ETLs) having very complex queries, which may run in parallel, creating resource contention issues.

We are thinking to change this DR Cluster to Always-ON Availability Mode, for mainly .

We are thinking below approaches, please let me know your suggestion/confirm whether this is possible or not...
1. make the two Nodes Active
2. Distribute the DBs into both the Nodes
3. Create two Availability groups, as below

 Node1
  AG1(DB1,DB2,DB3,DB4 - Primary)
  AG2(DB5,DB6,DB7 - Secondary)

 Node2
  AG1(DB1,DB2,DB3,DB4 - Primary)
  AG2(DB5,DB6,DB7 - Secondary)

4. Create a listener with the name same as existing DR Cluster name (ClustDRDwh00034), for AG1 & AG2.
5. All the applications/ETLs need to use the same instance name (ClustDRDwh00034) to connect for DB operation. This will
reduce the application changes, as we are using the same DR Cluster Name as the listener.


SQL Server AlwaysOn Availability Group - Add Azure Node

$
0
0

Hello,

We currently have SQL Server AlwaysOn Availability Group on premise.  It's only a 2-node WFSC.  All nodes are 2012 Windows
Server and SQL Server 2012 Enterprise.  I'm in the process of trying to extend the on-prem SQL Server Avaibility Group to
Azure by adding an Azure VM node to make it a 3-node WFSC.  When I got to the step of validating this node in the Windows
Failover Cluster, one of the failed steps was the Operating System being different.  This Azure node is Windows Server 2016
with SQL Server 2016 Developer.  My questions is: does anyone know if my Azure node/vm need to match the same
hardware specs as those two nodes on-prem?

Thanks

how to re-configure mirroring with new node

$
0
0

Hi,

I have single server on site and one server at DR Site. On-Site and DR site has mirroring. Now i am creating two nodes cluster at on-site. but i want to use existing DR server and database so that once i create new cluster so that i re-establish mirroring with new cluster.

pls guide is this possible to break current mirroring with existing server and re-establish mirroring with newly server without losing data at onsite and DR site.

Pls guide

thx


iffi

Suppressing message 978 in an Availability Group

$
0
0

We are running with Availability Groups in SQL2012 SP1 with a primary and a secondary replica that will allow access if you use application intent read only which we are not doing. We probably have some automatic polling by SCOM 2012 that throws the 978 message all the time causing a large SQL errorlog full of these messages and not much else.

is there a way to suppress these 978 messages?

Thanks

Chris 

basic availability group auto failover in sql 2016

$
0
0

Hi,

i've setup a test environment with 2 win 2012 r2 std and 2 sql 2016 std with cluster and basic availability group with 1 listener and 2 availability group that every availability group has different database in it.i redirect the client to connect to the listener and disable the network card in the primary availability group a failover is occur and both database are active and function in the slave server (now he become the primary) which is very good but according to this article https://blogs.technet.microsoft.com/msftpietervanhove/2017/03/14/top-5-questions-about-basic-availability-groups/

in section 6 it shouldn't work for the second availability group to automatically failover.

so my question is before i make this basic availability group in my production - should it failover automatically like it is works in my test environment or it shouldn't?

THX


SQL Cluster Resource Group 'Failed' but Cluster working fine

$
0
0
Our SQL cluster, for many months, has "‘Failed” status under Roles > Resource Group. But the cluster is working fine!

In the System event log, every hour, there is an Event ID 1069 error (Source: FailoverClustering): "Cluster resource 'SQL Server CEIP (MSSQL2016)' of type 'Generic Service' in clustered role 'RsrcGrp-MSSQL2016' failed." I have taken corrective steps given in MSKB: #883732, but it has made no difference.

On running Cluster Validation tests, we see only a Warning. It simply repeats what is in the event log.
BTW, this cluster is based on 2 VMs in VMware infrastructure.

How may I get rid of Failed state of Resource group.

Recommended way of running DBCC commands in a 24/7 highly available performance sensitive DB with Always on AG?

$
0
0

Hi All,

What would be the recommended way of running a DBCC command in a highly performance sensitive 24/7 Always on AG environment?

I believe physical_only will not help us (as it is a highly performance sensitive application)

Archtecture:

2 Nodes with synchronous replica in primary DC and 1 Node  with A-sync replica in secondary DC.

SQL Server 2016 (SP2)

Windows Server 2016


Regards, Ashif Shaikh


SQL Server Snapshot Replication error 22018

$
0
0

I am trying to get snapshot replication between two MSSQL servers to work but am receiving errors. Any help will be much appreciated!

My setup: Publisher/Distributor: SQL 2008 R2 Subscriber: SQL 2016

I am just trying it with a simple test table named Cars containing:

Car_ID   Car_Name1   Hummer2   Porsche3   Tesla4   Bently5   Lotus Elise6   Mini

In the Replication Monitor I get the following errors:

Error messages: The process could not bulk copy into table '"dbo"."Cars"'. (Source: MSSQL_REPL, Error number: MSSQL_REPL20037) Get help: http://help/MSSQL_REPL20037 metadata query failed Invalid character value for cast specification (Source: MSSQLServer, Error number: 22018) Get help: http://help/22018 Unspecified error


Always On Availability Group and SQL Replication Subscription DB

$
0
0

Hi, I have one DB (DBIMALON) which subscribes the data from another DB as part of SQL Server Replication.

DBA recommended some architecture change to convert the current DR Mode to Always-ON mode.

As part of this Architecture change this DB (DBIMALON) needs to be add into Always ON Availability Group.

Just want know is this possible to add DBIMALON into AAG, as this DB is a Replication Subscriber?

Free space on the disk is not recognized as unallocated spaces in the database

$
0
0

I have a database that was running out of space. I deleted millions of old historical records and dropped couple of highly fragmented indexes in one of the largest table and recreated them. I was able to open up about 75 Gig on the disk by doing so. But , when I look at the database property files , I only see 241 Meg as unallocated spaces and none of my jobs are able to run, because I receive insufficient disk space error message. What do I need to do to allocated the free disk space to the database ? Please help! 

Thanks, 

SQL 2017 Cluster on Windows 2019 Core

$
0
0

Hi Experts,

Am trying to setup a SQL 2017 Cluster on Windows 2019 Core. However am unable to find the right powershell scripts for it as it is the first time doing in for core.

Request you to please guide for the windows 2019 core SQL 2017 Cluster setup.

Thanks

SQL SERVER – AlwaysOn Automatic Seeding – Database Stuck in Restoring State

$
0
0

Hi,

After adding a database to the AG, node 1 says SYNCHRONIZED, but on node 2 the database is still in a restoring state.

The AG Dashboard says:

- Data synchronization state of availability database is not healthy

- Secondary database is not joined

Note: There are a lot of SQL logins for this database that are present on node 1, but not on node 2.

Any help greatly appreciated. Thanks!

Last_Backup_File is Null in msdb.dbo.log_shipping_monitor_primary

$
0
0
Hi There,

I have log shipping setup for a SQL Server 2005 x64 SP2 Ent. Ed. database using a monitor server. I restored the database on the secondary server myself and synched it with primary before setting up log shipping. I don't have any issues with Log shipping, the tx log is backed up, copied over to the secondary and restored.

However, last_backup_file, last_backup_date and last_backup_date_utc columns in log_shipping_monitor_primary table are all Null. I checked the log_shipping_monitor_secondary table and it shows the correct and updated information for last_copied_* and last_restored_* columns.

I tried running sp_refresh_log_shipping_monitor with @agent_type=0 on primary with @mode as both 1 & 2 but it did not help in any way. I deleted the log shipping and set it up again from scratch but the same issue remains.

I have 8 other databases using that monitor server but they do not have this issue. The only difference being that the rest of databases are on 32 bit Std Edition.

Does anyone has any idea on how to resolve this? Basically, I need the last_Backup_file to ftp the logs to our backup center.

Thanks!
Viewing all 4532 articles
Browse latest View live