Getting to Patch Automation
Our App Supp team recently took over Windows Patching. SQL cluster patching proved a dilemma because we can’t failover the active node during business hours. We couldn’t put the SQL clusters on the regular schedule for other servers because we had to patch the passive then failover to the active and patch the active.
Simplified manual process for Maintenance Window 1:
Place SQL clusters into SCOM maintenance mode
Identify active & passive nodes in clusters
Patch the passive node
Reboot passive servers
(Maintenance Window 2)
Place SQL clusters into SCOM maintenance mode
Failover the active node (DBAs screamed over this one!!)
Patch the active node
Reboot active node servers
Verify SQL cluster service is online
The manual process took the whole maintenance window and we’d have to stop and continue at a different window. Since this was all manual, this meant our team was putting in 12+ hrs on these days. We had servers that hadn’t been updated since 2013. The word “manual” is deceptive beause it encompasses this --> remote into each individual SQL cluster server, check the status on each one, failover, check for updates, check when updates are found, download/install updates, check when install is done, reboot the server, wait for server to come back, verify updates, check for more updates (because we were very behind!), repeat the process until no more updates found then wait for the server to reboot one last time to verify updates and services online - this took a ridiculous amount of hours for just one server multiplied by how many servers we had to do.
When we reviewed complete automation using SCOM and SCCM, our team determined that it would take long to develop the solution (something we’d work towards) but we needed a half-way solution to cut our manual process NOW.
We used a PowerShell script to handle the failover (remote execution is not allowed so we still had to log on to each server to run it). We were still manually updating the servers though, which required keeping a manual eye on downloads, installs, reboots, download again, install again, reboot again (repeat until we get caught up on updates).
While we tailor our fully automated process, we will pilot this half-way method:
Semi-Automated SCCM SQL Cluster Patching Process
Create two groups for SQL Cluster...one dubbed Active Node Group and one dubbed Passive Node Group
SCCM patches and restarts the passive node group
We manually failover cluster and verify failover success
SCCM patches Active node Group and reboots










