POOL IT Disaster Recovery Plan

Backup System Overview:

Backup is performed by several systems:

  • BackupExec - M-F tape backup of all shares/websites/databases on all servers, complete backups are done of POOLWEB1 & 2
  • Acronis TrueImage – Weekly full backup with daily incremental backups are done for POOLSBS, POOLSQL1 and POOLRMX. These are backed up to ______
  • Volume Shadow Copy – Semi-backup solution that stores copies of changed/deleted files on file shares and in databases, enabled on all servers with file shares

Backup tapes are stored in fireproof safe in server room. One of the weekly backup tapes as well as a hard drive with all of the _____ server images are stored in the safe deposit box at the ______. NAME, NAME, and NAME have access to the safe deposit box. The key to the safe deposit box is in the main key box in the copier room

Type 1 Failure (File loss and/or corruption)

In the event of a lost /deleted/corrupt file or database use one of the following steps for recovery: (Methods will vary depending on the age of the file)

  • Attempt recovery of file using Volume Shadow Copy (enabled on all servers with shares or databases)
  • Restore file from ______Backup (POOLSBS, POOLSQL1, POOLRMX)
  • Restore file from BackupExec tape backup system

Type 2 Failure (Major Server File corruption)

If a critical software failure occurs on one of the servers and the system is unresponsive use one of the following options to restore the system:

  • If system will boot, attempt restore of data using Volume Shadow Copy or BackupExec
  • If system will not boot reliably restore using ______backup from the previous nights backup (use ______Boot CD), backups located at ______restore any updated data files if needed from the tape backup (BackupExec)
  • If system is not protected by ______backup (POOLBACK, POOLWEB1 &2), re-install operating system and restore data files from tape backup

Type 3 Failure (Major Server Hardware Failure)

In the event of a major server hardware failure (hard drive, motherboard, etc), contact ______ASAP for a replacement part. The length of time to get a replacement part will determine the course of action:

  • Same Day – If the replacement part can be brought out the same day, no relocation of the data needs to occur, replace defective part and test the system
  • 1-3 Days – On a critical system (POOLSBS, POOLSQL1, POOLRMX), use the _____ Universal Restore license to either restore the system to a replacement server (if ______provides) or setup Microsoft Virtual PC on a server and restore to a VM
  • Non-Critical Systems – On systems that are not critical or redundant, just wait for replacement hardware to become available and replace

Type 4 Failure (Multiple Server Failure)

If a major catastrophe occurs where multiple servers are damaged, contact ______ASAP and get replacement servers and/or network equipment ordered. Depending on the extent of the damage to the servers and/or the building will determine if temporary relocation is needed. Offsite server relocation for the servers would occur at ______until suitable repairs or relocation of the POOL offices is complete.

C:\Documents and Settings\Bill Tackett\Local Settings\Temporary Internet Files\OLK19\0608ICRMPPlanIT.doc