This procedure describes how to use a Restore Roll to upgrade or reconfigure your existing Rocks cluster.
If your Rocks frontend is running Rocks version 5.0 and you wish to upgrade it to version 5.1, you first need to install the rocks-command, rocks-devel and rocks-devel-env packages: For i386, execute:
For x86_64, execute:
Keep in mind, after you add these packages, your frontend will no longer be able to install compute nodes. |
Let's create a Restore Roll for your frontend. This roll will contain site-specific info that will be used to quickly reconfigure your frontend (see the section below for details).
# cd /export/site-roll/rocks/src/roll/restore # make roll |
The above command will output a roll ISO image that has the name of the form: hostname-restore-date-0.arch.disk1.iso. For example, on the i386-based frontend with the FQDN of rocks-45.sdsc.edu, the roll will be named like:
rocks-45.sdsc.edu-restore-2006.07.24-0.i386.disk1.iso |
Burn your restore roll ISO image to a CD.
Reinstall the frontend by putting the Rocks Boot CD in the CD tray (generally, this is the Kernel/Boot Roll) and rebooting the frontend.
At the boot: prompt, type:
build |
At this point, the installation follows the same steps as a normal frontend installation (See the section: Install Frontend) -- with two exceptions:
On the first user-input screen (the screen that asks for 'local' and 'network' rolls), be sure to supply the Restore Roll that you just created.
You will be forced to manually partition your frontend's root disk.
You must reformat your / partition, your /var partition and your /boot partition (if it exists). Also, be sure to assign the mountpoint of /export to the partition that contains the users' home areas. Do NOT erase or format this partition, or you will lose the user home directories. Generally, this is the largest partition on the first disk. |
After your frontend completes its installation, the last step is to force a re-installation of all of your compute nodes. The following will force a PXE (network install) reboot of all your compute nodes.
# ssh-agent $SHELL # ssh-add # rocks run host compute '/boot/kickstart/cluster-kickstart-pxe' |
By default, the Restore Roll contains two sets of files: system files and user files, and some user scripts. The system files are listed in the 'FILES' directive in the file: /export/site-roll/rocks/src/roll/restore/src/system-files/version.mk.
FILES = /etc/passwd /etc/shadow /etc/gshadow /etc/group \ /etc/exports /etc/auto.home /etc/motd |
The user files are listed in the 'FILES' directive in the file: /export/site-roll/rocks/src/roll/restore/version.mk.
FILES += /etc/X11/xorg.conf |
If you have other files you'd like saved and restored, then append them to the 'FILES' directive in the file /export/site-roll/rocks/src/roll/restore/version.mk, then rebuild the restore roll.
If you'd like to add your own post sections, you can add the name of the script to the 'SCRIPTS' directive of the /export/site-roll/rocks/src/roll/restore/version.mk file.
SCRIPTS += /share/apps/myscript.sh /share/apps/myscript2.py |
This will add the shell script /share/apps/myscript.sh, and the python script /share/apps/myscript2.py in the post section of the restore-user-files.xml file.
If you'd like to run the script in "nochroot" mode, add
For example
|
All the files under /export/rocks/install/site-profiles are saved and restored. So, any user modifications that are added via the XML node method will be preserved.
The networking info for all node interfaces (e.g., the frontend, compute nodes, NAS appliances, etc.) are saved and restored. This is accomplished via the 'rocks dump' command.