
Online Backup - Server Bare Metal Backup & Restore
SecuriData Online Backup - Server Bare Metal Backup & Restore
A Bare Metal Restore is the restore of an entire machine (usually to replacement hardware and to a new Operating System installation).
SecuriData Online Backup - Server Bare Metal Back & Restore (BMR)
To be able to perform BMR, you must protect the target computer with a single backup set that contains:
· System State and Services Database
· Operating System Volume
NOTE: Some applications can be installed and can store data across multiple volumes. You must include all of those volumes in this online backup set if you want to be able to restore those applications.
Depending on the applications running on the target computer, you may need to stop them for the initial backup, in order to facilitate a smooth BMR. To protect those services after the initial backup and when they are running (i.e. in “hot” mode), you must create separate backups with the corresponding special backup sets (e.g. "MS SQL Server Database Backups").
Details
The reason you must perform at least one online backup of the target computer with the service applications stopped is to ensure that any locked files used by those applications are backed up at least once. Otherwise, those services will not start after you perform the BMR.
In order to restore from the special backup sets (e.g. MS SQL Server), SecuriData’s Online Backup Service needs to connect to a running service. If the service cannot start, you must repair that service installation with the corresponding Installation Package.
In general, the steps for BMR of a Windows computer running MS SQL Server are:
1. Assemble the replacement hardware for the target BMR computer.
2. Re-install the same Windows Operating System (you do not need to apply any Service Packs).
3. If the Hardware is different from the original computer, you must install the corresponding hardware drivers.
4. Restore from the target computer’s backup set (including Operating System Volume, System State and Services Database).
5. Restart the MS SQL Server (if applicable).
6. Restore the latest version of MS SQL Server from the separate special backup set (if applicable).
Bare Metal Restore of Windows 2003
Summary
Bare Metal Restore is a restore process when the whole machine is being restored. Usually this type of restore is performed after a complete hardware failure or hardware upgrade.
You must have backed up the machine’s System State and Services Database.
Microsoft recommends using the same hardware for the destination machine.
· You can restore Windows 2000/2003 machines to different hardware.
· Restoring a Windows NT4 machine to different hardware is not possible (unless some initial preparations are made). However, if the differences are minor (e.g. different SCSI controller, NIC) the restore may be successful.
Switching between different computer types (ACPI to standard computer type and vise versa) is not allowed. For more information you can check Microsoft’s knowledge base article number 237556.
1. Determine your computer type:
Before starting the restore you can check your computer type. Here is how you can determine your computer type.
Click Start, point to Settings, click Control Panel, and then double click System.
Click the Hardware tab, and then click Device Manager to view what is listed under the computer branch.
Here are the descriptions of the computer types and the associated hal.dll:
· ACPI Multiprocessor PC – halmacpi.dll
· ACPI Uniprocessor PC – halaacpi.dll
· Advanced Configuration and Power Interface (ACPI) PC – halacpi.dll
· MPS Multiprocessor PC – halmps.dll
· MPS Uniprocessor PC – halapic.dll
· Standard PC – hal.dll
· Compaq SystemPro Multiprocessor or 100% Compatible – halsp.dll
There are other types available but that would require Hardware vendor specific installation (involving customized hal.dll).
2. Restoring from one computer type to another:
In some cases it is necessary to perform bare metal restore to a machine of different type. In this case after successful restore, after the reboot you will not be able to start your machine. Regardless what backup software you are using the result would be the same – your computer would not start properly or at all.
SecuriData has a solution for this problem:
· In the last screen of the restore wizard, DS-Client will ask you if you want it to save some files in order to preserve them. If you answered yes, they will be saved with an extra extension called asigra-date-time. The files are – hal.dll, ntoskrnl.exe, ntkrnlpa.exe, kernel32.dll, ntdll.dll, win32k.sys, winsrv.dll, setup.log (in \WINNT\repair directory), and boot.ini.
· After the restore is finished, you are prompted to reboot the target computer. Do so.
· If your computer boots normally, no intervention is required. If your computer does not boot, or crashes, you have to follow the steps outlined below:
· Reboot the computer from the Windows 2000 installation CD, and start a repair process using the recovery console.
· Once you have the command prompt, change to the system32 directory (by typing cd system32), and type dir *.*.asigra*
· You should see the 7 saved files. Copy each one of them to their original name. For example if you have hal.dll.asigra-022403-0800, you should type copy hal.dll.asigra-022403-0800 hal.dll. Confirm the overwrite and continue with the rest of the files.
· Type cd dllcache. Type dir *.*.asigra*. You may see 2 more files. Copy them to their original names.
· Change to the root of C:\ drive by typing cd \
· Type dir *.*.asigra*. You will see one file. Copy it to the original name (boot.ini).
· Change to C:\WINNT\repair by typing cd \winnt\repair. If your %SystemRoot% is located on a different drive or has a different name, change to the proper location.
· Type dir *.*.asigra*. You will see one file. Copy it to the original name (setup.log). You will need this file in case you need to perform repair.
· After successfully copying all the files type exit, which would reboot the machine. Boot normally, and your machine should be up and running.
There would be a rebuild of the plug and play database, and you may be prompted to provide drivers for the new hardware found. Follow the instructions on the screen. When prompted to reboot, select NO and finish installing all drivers. Set the TCP/IP details for your network adapter(s).
Now you have to fix your computer type. If you did not check the computer type of the destination machine (the one that you are restoring to), you can find out the type by checking the properties of the hal.dll. Open your Windows explorer and go to %SystemRoot%\system32 (%SystemRoot% is usually located in C:\WINNT). Locate the file hal.dll with the extra asigra extension. It could be something like hal.dll.asigra-022403-0800. Right click on it and select properties. Click on the "Version" tab, and from "Item name" select "Internal name". In the "Value" field you would see the original file name. In step 1 above there is a description for each computer type and its corresponding hal.dll.
Now you have to change your computer type to the right one (if it is not already so). Click Start, point to Settings, click Control Panel, and then double click System. Click the Hardware tab, and then click Device Manager to view what is listed under the computer branch. Right click on the computer type, select Properties, and select "Driver" tab. Click on "Update driver" button. Click on "Display a list of the known drivers for this device so that I can choose a specific driver" radio button and click "Next". Select the right model for your computer (the one that you discovered by checking the original hal.dll properties) and click "Next". Finish and when prompted to reboot do so.
3. Restoring Active Directory in a multiple server tree.
Active directory is part of the System State and can be restored only as a System State restore. System State would restore the registry as well. You can perform 2 types of AD restore – normal and authoritative. The AD objects restored through normal restore, would be overwritten by the AD replication. The restored objects would have their original "update sequence number". These USNs are used by the replication process to detect and propagate AD changes among the servers in the tree. Because of this the restored data would look old to the AD replication and the data would never get replicated. In fact the replication process would overwrite it. When is the normal (nonauthoritative) restore useful? It is good for single server trees, or when you want to restore the registry, but not to overwrite the Active Directory.
However in case you need to restore an older version of the Active Directory (but not older than 60 days), you have to perform an authoritative restore. Below is a description of each restore type:
Authoritative restore - Select "Authoritative restore" in the last screen of the Restore Wizard. This will perform authoritative restore of both AD and FRS.
The following list of requirements must be met before you perform an authoritative FRS restore:
a) The FRS service must be disabled on all downstream partners (direct and transitive).
b) Events 13553 and 13516 have been logged in the FRS event log.
c) The computer that is configured for the authoritative restore is configured to be authoritative for all the data.
d) All other partners in the replica set must be reinitialized with a non-authoritative restore.
To reinitialize the partners with a non-authoritative restore, stop the FRS service, configure the BurFlags registry key, and then restart the FRS service. To do so:
· Click Start, and then click Run.
· In the Open box, type cmd and then press ENTER.
· In the Command box, type net stop ntfrs.
· Click Start, and then click Run.
· In the Open box, type regedt32 and then press ENTER.
· Locate the following subkey in the registry: HKEY_LOCAL_MACHINE\System\CurrentControlSet\Services\NtFrs\Parameters\Backup/Restore\Process at Startup
· In the right pane, double click BurFlags.
· In the Edit DWORD Value dialog box, type D2 and then click OK.
· Quit Registry Editor.
· Perform this on every partner.
After rebooting the machine you just restored, start the FRS service on all the partners by typing "net start ntfrs" in the command box. They will synchronize from the restored machine.
Normal restore - default - "Authoritative restore" is NOT selected. This will perform non-authoritative restore for both Active Directory and NTFRS.
You can still perform Non-authoritative restore with DS-Client and then manually run ntdsutil. However this would not work if you are performing bare metal restore where the destination for restore doesn't have AD. If this is the case, let DS-Client perform authoritative restore. DS-Client is running several activities that you can not perform when running manually ntdsutil.
Note: After successful restore you should reboot the machine. Once the machine is restarted the plug and play database is reinitialized and most probably you will be prompted for a reboot. Please do not reboot immediately, wait and make sure the Active directory and FRS are synchronized.
Please note that replication may take some time before the restored data would be propagated. If you are testing authoritative restore – please note that objects added to the AD after the backup, would not be removed from AD after the authoritative restore. For more information see Microsoft’s knowledge base article 216243.
4. Restoring Active Directory on a server that is not the PDC when the PDC is missing.
There are some scenarios when you have to restore AD on a domain controller when the PDC is missing. In this case there are several steps that should be performed in order to get the AD in 100% running condition.
If the destination machine is a domain controller already, you have to boot into Directory Services Restore Mode. If the destination machine is a stand-alone Windows 2000/2003 computer you can start the restore immediately.
Select "Authoritative restore" in the last screen of the Restore Wizard. This will perform authoritative restore of both AD and FRS.
After successful restore of the System State reboot the computer when prompted. Please note that you may have to perform the steps outlined above in case you are restoring to different computer type.
There are a few articles from Microsoft describing the whole process:
216498 - HOW TO: Remove Data in Active Directory After an Unsuccessful Domain Controller Demotion
248410 - Error Message: The Account-Identifier Allocator Failed to Initialize Properly
234790 - HOW TO: Find Servers That Hold Flexible Single Master Operations Roles
223787 - Flexible Single Master Operation Transfer and Seizure Process
Outlined below is the whole process described step by step.
4.1. Login to the restored machine (after the normal reboot) and start a command prompt. Start ntdsutil:
Type "domain management"
Type "connections"
Type "connect to server ServerName" (where ServerName is your current computer name)
Type "quit"
Type "select operation target"
Type "list roles for connected server"
You should get information about what roles where associated with the server.
Type "list sites" and note the number for the site you are dealing with
Type "select site #" (where # is your site number. e.g. 0, 1, ...)
Type "list domains in site" and note the domain number
Type "select domain #" (where # is your domain number. e.g. 0, 1, ...)
Type "list servers in site" and note the number for the missing PDC
Type "select server #" (where # is your server number. e.g. 0, 1, ...)
Type "list naming contexts" and note the number for the CN=Configuration
Type "select naming context #" (where # is your naming context number. e.g. 0, 1, ...)
Type "quit"
Type "quit"
Type "roles" and you will be at the "fsmo maintenance:" prompt.
Now you must "seize" all 5 rows. ntdsutil will attempt to "safe transfer" the roles, but the transfer will fail since the PDC is missing. Just ignore the errors, because seizure will succeed:
Type "seize domain naming master"
Click "Yes" at the pop up message
Type "seize infrastructure master"
Click "Yes" at the pop up message
Type "seize PDC"
Click "Yes" at the pop up message
Type "seize RID master"
Click "Yes" at message prompting you to synchronize and "Yes' to confirm the operation
Type "seize schema master"
Type "quit"
Type "quit" to exit ntdsutil
4.2. Now you have to delete all references to the missing server. This could be accomplished using "ADSI edit". This utility is free and is located on the Windows 2000 Server installation CD. It is in \support\tools. Just install the tools.
Start ADSI Edit. If you launch ADSI edit immediately after logon, you may receive an error that the domain does not exist. Wait a minute or so and try again.
In "Domain NC..." folder, expand until you reach "OU=Domain Controllers". Locate the missing server and delete it. Redo the delete if you get an error or partial delete.
In "Configuration Container...", expand until you reach CN=Sites, CN=SiteName - CN=Servers. Locate the missing server and delete it.
The last step is to set the Global catalog. In "Active Directory Sites and Services" expand the site all the way down until you reached NTDS settings for the server that you restored. Right click on "NTDS Settings" select "Properties", check "Global Catalog", click "Apply", and close this dialog box.
4.3. Now you are ready to reboot. Reboot the server.
4.4. Examine all the logs for any errors. You should not have any errors. If this is not the case you may have incompatible hardware or device drivers. Usually the error would give you an indication what the problem is and you must be able to solve them.
4.5. Your server is now ready.
Note: If there are no other domain controllers to synchronize with, you will have problems. For an explanation of how to fix this, refer to the next paragraph 5 (Restoring Active Directory from a multi server to a single server environment, or when the rest of the domain controllers are missing).
5. Restoring Active Directory from a multi server to a single server environment, or when the rest of the domain controllers are missing:
You will have SAM errors if other domain controllers are missing. If you do not plan to ever start the other controllers, then you should remove them from Active Directory. Use ADSI Edit as explained in step 4 above.
From "Active Directory Sites and Services" delete "Connection" from NTDS Settings.
After Service Pack 3, the FSMO role holder would not assume its FSMO role unless it had successfully completed at least one sync.
Microsoft changed this in order to avoid multiple FSMO owners in a domain, and to prevent the Domain Controller from issuing duplicate RIDs.
As described in the previous steps if you seize the RID Master role it will not become authoritative until it synchronizes with another DC.
During disaster recovery where you restore only one domain controller from backup, or when you remove domain controllers from AD, you may get "The account-identifier allocator failed to initialize properly..." errors in the System log. The error would be from source SAM and Event ID 16650. Dcdiag may report problems with RID Manager or corrupted data.
There are 2 solutions:
a) You can restore another domain controller, or
b) You can delete the replication links
In case you want to keep only one domain controller, you have to delete the replication links, otherwise the domain controller is not fully operational.
Here are the instructions for deleting the replication links:
1. You need to use the repadmin tool. It is part of the "Windows 2003 Support tools". The installation of the tools is located in \SUPPORT\TOOLS on the Windows 2003 CD.
2. Open a command prompt and type "repadmin /showreps /v"
3. You need to find out the "guid-based-dns-name of replica partner", which is displayed after "Address:" of the deleted partner.
4. Here is an extract from repadmin:
------------------------------------
C:\>repadmin /showreps /v
Default-First-Site-Name\SPEEDYNT4
DSA Options : IS_GC
objectGuid : a089478f-cffb-409f-b74b-addb83a6e93e
invocationID: a089478f-cffb-409f-b74b-addb83a6e93e
==== INBOUND NEIGHBORS ======================================
CN=Schema,CN=Configuration,DC=spdnt4,DC=local
Default-First-Site-Name\TR1
DEL:b91cd853-6af0-4c3a-9586-b5dcc8f489f7 (deleted DSA) via RPC
objectGuid: a48650dc-8ed0-4363-8c25-b41553f9248d
Address: a48650dc-8ed0-4363-8c25-b41553f9248d._msdcs.spdnt4.local
ntdsDsa invocationId: 8994f8c1-f71e-454c-ad3a-17b119621381
WRITEABLE SYNC_ON_STARTUP DO_SCHEDULED_SYNCS
USNs: 3518/OU, 3518/PU
------------------------------------
5. In the example above, the FQDN is spdnt4.local and the "guid-based-dns-name of replica partner" is "a48650dc-8ed0-4363-8c25-b41553f9248d._msdcs.spdnt4.local"
6. The restored server is called SPEEDYNT4 and the deleted partner - TR1.
7. Now you can delete the replicas. The syntax is:
repadmin /delete <naming context> <restored server name> <guid-based-dns-name of replica partner> /localonly
8. <naming context> would be for the Schema, Configuration and Domain.
10. In our example to delete the links for the schema we would type:
repadmin /delete CN=Schema,CN=Configuration,DC=spdnt4,DC=local speedynt4.spdnt4.local a48650dc-8ed0-4363-8c25-b41553f9248d._msdcs.spdnt4.local /localonly
11. To delete the links for Configuration we would type:
repadmin /delete CN=Configuration,DC=spdnt4,DC=local speedynt4.spdnt4.local a48650dc-8ed0-4363-8c25-b41553f9248d._msdcs.spdnt4.local /localonly
12. To delete the links for Domain we would type:
repadmin /delete DC=spdnt4,DC=local speedynt4.spdnt4.local a48650dc-8ed0-4363-8c25-b41553f9248d._msdcs.spdnt4.local /localonly
13. You must reboot the server after deleting all the replication links.
14. Check the System Log after the reboot to make sure there are no errors from source SAM.
6. Common mistakes:
- Slow clients logins
- Misconfigured DNS, DHCP
- If the missing server is your PDC, and also your main DNS and DHCP server, your client computers may experience slow logins or errors when trying to login to the domain. If you are using fixed IP addresses, make sure the clients DNS settings are correct. Check your DHCP server for proper settings.
