Recovering from Disk Movement Problems
This section describes how to recover from unexpected problems after
moving disks in the Solaris Volume Manager environment.
Disk Movement and Device ID Overview
Solaris Volume Manager uses device IDs, which are associated with a specific
disk, to track all disks used in a Solaris Volume Manager configuration. When disks
are moved to a different controller or when the SCSI target numbers change, Solaris Volume Manager
usually correctly identifies the movement and updates all related Solaris Volume Manager
records accordingly, and no system administrator intervention is required.
In isolated cases, Solaris Volume Manager cannot completely update the records and
reports an error on boot.
Resolving Unnamed Devices Error Message
If you add new hardware or move hardware (for example, move a string
of disks from one controller to another controller), Solaris Volume Manager will check
the device IDs associated with the disks that moved, and update the c*t*d* names in internal Solaris Volume Manager records accordingly.
If the records cannot be updated, the boot processes spawned by /etc/rc2.d/S95svm.sync (linked to /etc/init.d/svm.sync) will report an error to the console at boot time:
Unable to resolve unnamed devices for volume management.
Please refer to the Solaris Volume Manager documentation,
Troubleshooting section, at http://docs.sun.com or from
your local copy." |
No data loss has occured, and none will occur as a direct result of
this problem. This error message indicates that the Solaris Volume Manager name records
have been only partially updated, so output from the metastat
command will likely show some of the c*t*d* names previously used,
and some of the c*t*d* names reflecting the state after the move.
If you need to update your Solaris Volume Manager configuration while this condition
exists, you must use the c*t*d* names reported by the metastat command when you issue any meta* commands.
If this error condition occurs, you can do one of the following to resolve
the condition:
Restore all disks to their original locations. Next, do a
reconfiguration reboot, or run (as a single command):
/usr/sbin/devfsadm && /usr/sbin/metadevadm -r |
After
these commands complete, the error condition will be resolved and you can
continue.
Contact your support representative for guidance
Note - This error condition is quite unlikely to occur. If it does occur,
it is most likely to affect Fiber Channel-attached storage.
Boot Problems
Because Solaris Volume Manager enables you to mirror the root (/), swap, and /usr directories,
special problems can arise when you boot the system, either through hardware
failures or operator error. The tasks in this section provide solutions to
such potential problems.
The following table describes these problems and points you to the appropriate
solution.
Table 24-1 Common Solaris Volume Manager Boot Problems
Background Information for Boot Problems
If Solaris Volume Manager takes a volume offline due to errors, unmount
all file systems on the disk where the failure occurred. Because each disk
slice is independent, multiple file systems can be mounted on a single disk.
If the software has encountered a failure, other slices on the same disk will
likely experience failures soon. File systems mounted directly on disk slices
do not have the protection of Solaris Volume Manager error handling, and leaving such
file systems mounted can leave you vulnerable to crashing the system and losing
data.
Minimize the amount of time you run with submirrors disabled
or offline. During resynchronization and online backup intervals, the full
protection of mirroring is gone.
How to Recover From Improper /etc/vfstab Entries
If you have made an incorrect entry in the /etc/vfstab
file, for example, when mirroring root (/), the system
will appear at first to be booting properly then fail. To remedy this situation,
you need to edit the /etc/vfstab file while in single-user
mode.
The high-level steps to recover from improper /etc/vfstab file entries are as follows:
Booting the system to single-user mode
Running the fsck command on the mirror
volume
Remounting file system read-write
Optional: running the metaroot command
for a root (/) mirror
Verifying that the /etc/vfstab file correctly
references the volume for the file system entry
Rebooting
Example--Recovering the root (/) Mirror
In the following example, root (/) is mirrored
with a two-way mirror, d0. The root (/)
entry in the /etc/vfstab file has somehow reverted back
to the original slice of the file system, but the information in the /etc/system file still shows booting to be from the mirror d0. The most likely reason is that the metaroot
command was not used to maintain the /etc/system and /etc/vfstab files, or an old copy of the/etc/vfstab
file was copied back.
The incorrect /etc/vfstab file would look something
like the following:
#device device mount FS fsck mount mount
#to mount to fsck point type pass at boot options
#
/dev/dsk/c0t3d0s0 /dev/rdsk/c0t3d0s0 / ufs 1 no -
/dev/dsk/c0t3d0s1 - - swap - no -
/dev/dsk/c0t3d0s6 /dev/rdsk/c0t3d0s6 /usr ufs 2 no -
#
/proc - /proc proc - no -
swap - /tmp tmpfs - yes -
|
Because of the errors, you automatically go into single-user mode when
the system is booted:
ok boot
...
configuring network interfaces: hme0.
Hostname: lexicon
mount: /dev/dsk/c0t3d0s0 is not this fstype.
setmnt: Cannot open /etc/mnttab for writing
INIT: Cannot create /var/adm/utmp or /var/adm/utmpx
INIT: failed write of utmpx entry:" "
INIT: failed write of utmpx entry:" "
INIT: SINGLE USER MODE
Type Ctrl-d to proceed with normal startup,
(or give root password for system maintenance): <root-password>
|
At this point, root (/) and /usr
are mounted read-only. Follow these steps:
Run the fsck command on the root (/)
mirror.
Note - Be careful to use the correct volume for root.
# fsck /dev/md/rdsk/d0
** /dev/md/rdsk/d0
** Currently Mounted on /
** Phase 1 - Check Blocks and Sizes
** Phase 2 - Check Pathnames
** Phase 3 - Check Connectivity
** Phase 4 - Check Reference Counts
** Phase 5 - Check Cyl groups
2274 files, 11815 used, 10302 free (158 frags, 1268 blocks,
0.7% fragmentation)
|
Remount root (/) read/write so you can edit the /etc/vfstab file. # mount -o rw,remount /dev/md/dsk/d0 /
mount: warning: cannot lock temp file </etc/.mnt.lock>
|
Run the metaroot command.
This command edits the /etc/system and /etc/vfstab files to specify that the root (/)
file system is now on volume d0.
Verify that the /etc/vfstab file contains the correct
volume entries. The root (/) entry in the /etc/vfstab file should appear as follows so that the entry for the file system
correctly references the RAID 1 volume:
#device device mount FS fsck mount mount
#to mount to fsck point type pass at boot options
#
/dev/md/dsk/d0 /dev/md/rdsk/d0 / ufs 1 no -
/dev/dsk/c0t3d0s1 - - swap - no -
/dev/dsk/c0t3d0s6 /dev/rdsk/c0t3d0s6 /usr ufs 2 no -
#
/proc - /proc proc - no -
swap - /tmp tmpfs - yes -
|
Reboot the system. The system returns to normal operation.
|