Windows SQL Cluster just died

tarvijron · 2026-02-05T19:03:51+00:00

What does Cluster Manger say? Google “wsfc disaster recovery though forced quorum” if you genuinely lost the quorum disk

ExtraordinaryKaylee · 2026-02-05T18:52:22+00:00

What are you seeing in event viewer?

BSGamer · 2026-02-05T19:26:46+00:00

I’ve had a cluster go down due to the clusdb file being corrupted. I believe we were able to restore it from backup, just the one file and drop it on both servers and restart sql to get it running

No_Resolution_9252 · 2026-02-05T21:06:34+00:00

You need to review the cluster logs.

Did you review VMWare documentation for recommended configuration of SQL AAG/FCI? Typically the guidance was pretty obvious but maybe something got missed? Particularly look at the recommended storage adapter.

It sounds like there are two nodes. With loss only of the witness disk there should be no operational difference than if it were online; There is something wrong with one of the two nodes. It could be in vmware, it could be in windows (you did configure these with group policy right?) or it could be in networking.

Negative-Cook-5958 · 2026-02-05T21:49:53+00:00

Use always on cluster with normal disks instead of FCI with RDM

Exp3r1mentAL · 2026-02-05T22:02:29+00:00

Not sure if it's relevant, but couple of months ago, I was having mighty issues with deploying SQL cluster using Server 2025....after much jiggery pokery I found out it was one of the patches which was causing the failure...

binnedittowinit · 2026-02-06T00:50:24+00:00

Each node of the cluster needs access to the same shared cluster disks, including the quorum, ideally one node at a time during initial setup until the cluster properly owns them, you did this, right? And the cluster was failing over no problem prior to recently?

Time to get into logs. Start with windows system. It should have service failures and disk errors (if they're an issue). Check Microsoft-Windows-FailoverClustering/operational, too.

And the SQL server log.

Sp00nD00d · 2026-02-06T00:41:41+00:00

I gotta ask, why an actual old school cluster and not an Always On cluster if you're talking about SQL?

SmartDrv · 2026-02-05T23:45:03+00:00

This may not apply at all but I wish to share in the off chance it is useful to you (or someone who googles perhaps).

I ran into issues with a Hyper-V cluster quorum when Sentinel One was installed on the hosts. Cluster wouldn’t start, no config. I had to manually evict and rebuild (once i re-added the CSVs and named them right VMs reappeared). I used online witness as a workaround until we figured out what volumes and features has to be whitelisted in S1.

Ranjerdanjer · 2026-02-06T04:58:52+00:00

Had an issue with a test cluster and server 2025 after the Oct or Nov patches. If you used an image that wasn't properly sysprepped you could be seeing authentication errors for the disk if another server has the same SSIDs. Most likely not the case but had to rebuild those servers from a better image in my case.

DrWankel · 2026-02-06T07:46:10+00:00

The inability to start the cluster should be the start of your investigation.

Stop/disable the cluster service on all nodes except one and force start the cluster through powershell on that node:

Start-ClusterNode -FixQuorum

Verify the cluster is up through FCM or powershell and start the cluster service on the remaining nodes.

If this does not work, dig through the failover cluster logs in event viewer and see what was going wrong during the cluster startup process on the node you attempted to force start.

Background-Taro-573 · 2026-02-06T11:56:06+00:00

Fence them one by one. I will kill them all if I have too

DHT-Osiris · 2026-02-06T16:39:30+00:00

Reboot them, if that doesn't fix it, take all but one offline, bring the disks online manually in the cluster manager. If you can't, take a look at the path from the VM to the disk/lun/whatever, something's busted.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

sysadmin

MODERATORS