Home
Dell OpenManage Server Administrator Managed Node for Fluid Cache for DAS Messages Reference Guide
Contents
1. 63 3 Storage Management Message Reference 65 Alert Monitoring andLogging 65 Alert Message Format with Substitution Variables 66 Alert Message Change History 69 Alert Descriptions and Corrective Actions 70 4 System Event Log Messages for IPMI Systems 247 Temperature SensorEvents 247 4 Contents Voltage SensorEvents 249 Fan SensorEvents 4 251 Processor Status Events 253 Power SupplyEvents 255 Memory ECCEvents 260 BMC Watchdog Events 261 Memory Events 0 2 4 262 Hardware Log SensorEvents 264 Drive Events aaau aaa aa aaa 265 Intrusion Events aaa aa aaa 267 BIOS Generated SystemEvents 268 Operating System Generated System Events 278 Cable Interconnect Events 279 Battery Events ahaaha aaa aaa 280 Power And Performance Events 281 Entity Presence Events 284 Miscellaneous 0 4 285 ee Te ee ee es ee oe 289 Contents 6 Contents Introduction Dell OpenManage Server Administrator generates event messages stored primarily in the operating system or Server Administrator event logs and sometimes in Simple Network Management Protocol SNMP traps This document describes the event messages that are created by Server Administrator
2. 2106 SMART FPT Warning Cause A disk on the Clear Alert 903 exceeded Non critical specified controller has Number received a SMART alert None predictive failure Related indicating that the disk Alert is likely to fail in the Nimber near future None Action Replace the LRA disk that has received N mb i the SMART alert If the 2070 physical disk is a member of a non redundant virtual disk then back up the data before replacing the disk CAUTION Removing a physical disk that is included ina non redundant virtual disk causes the virtual disk to fail and may cause data loss 98 l Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2107 SMART Critical Cause A disk has Clear Alert 904 configuration Failure Error received a SMART alert Number change predictive failure after None a configuration change Related The disk is likely to fail Alert in the near future Number Action Replace the None disk that has received LRA the SMART alert If the Number physical disk is a 2071 member of a non redundant virtual disk then back up the data before replacing the disk CAUTION Removing a physical disk that is included ina non redundant virtual disk causes the virtual disk to fail and may cause data loss Storage Management Message Reference 99 T
3. 52 Server Management Messages A processor sensor in the specified system could not obtain a reading The sensor location chassis location previous state and processor sensor status information is provided Table 2 13 Processor Sensor Messages continued Event Description Severity Cause ID 1602 Processor sensor Information A processor sensor in the returned to a normal specified system transitioned value back to a normal state RO eee ae sensor location en hOGA kLOnUTA Chas Siss ocation previous state an i processor sensor status Chassis Location are provided lt Name of chassis gt Previous state was lt State gt Processor sensor status lt status gt 1603 Processor sensor Warning A processor sensor in the detected a warning value Sensor Location lt Location in chassis gt Chassis Location lt Name of chassis gt Previous state was lt State gt Processor sensor status lt status gt specified system is in a throttled state The sensor location chassis location previous state and processor sensor status information is provided Server Management Messages 53 Table 2 13 Processor Sensor Messages continued Event Description Severity Cause ID 1604 Processor sensor Error A processor sensor in the detected a failure specified system is disabled value has a configuration error or en Location ae a sre eee oe ee ee eee he sensor location chassis loca
4. Action Replace the source disk and restore from backup Software RAID e Perform a backup with the Verify option If the file backup fails try to restore the failed file from a previous backup When the backup with the Verify option is complete without any errors delete the Virtual Disk e Recreate a new Virtual Disk with new drives e Restore the data from backup None Related Alert Number 2195 2346 LRA Number 2071 209 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2348 The rebuild Critical Cause You are Clear Alert 904 failed due to Failure Error attempting to rebuild None errors on the data on a disk that is Related target physical defective Alert disk Action Replace the Number target disk If a rebuild 2195 2346 does not automatically LRA start after replacing the Number disk initiate the 2071 Rebuild task You may need to assign the new disk as a hot spare to initiate the rebuild 2349 Abad disk Critical Cause A write Clear Alert 904 block could Failure Error operation could not None not be complete because the Related reassigned disk contains bad disk Alert during a write blocks that could not be N mb i operation reassigned Data loss 2346 may have occurred and data redundancy may LRA also be lost Number 2
5. FluidCache None Storage Management Message Reference 241 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2913 The following Information No action required Clear Alert 901 failed cache None device has Related completed Alert None recovery wwn LRA 1 path 2 Number FluidCache None 2914 A valid Information No action required Clear Alert 1601 permanent None license is Related installed Alert None FluidCache LRA Number None 2915 No valid Error A valid license must be Clear Alert 1604 license is installed None installed Related FluidCache Alert None LRA Number None 2916 Runningonan Information A permanent license Clear Alert 1601 evaluation should be purchased None license Days Related remaining 1 Alert None days LRA FluidCache Number None 242 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2917 Running onan_ Error A permanent license Clear Alert 1604 expired must be installed None evaluation Related license No Alert None configuration LRA changes will be Number allowed None Expired days FluidCache 2918 Running onan_ Error A permanent license Clear Alert 1604 expired must be installed None evaluation Related license Aler
6. Informational informational purposes Action None Clear Alert 1201 Status Alert 2088 is a clear alert for alerts 2061 and 2136 Related Alert Number None LRA Number None 2089 Physical disk initialization completed OK Normal Cause This alert is for Informational informational purposes Action None Storage Management Message Reference Clear Alert 901 Status Alert 2089 is a clear alert for alert 2062 Related Alert Number None LRA Number None Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2090 Virtual disk OK Normal Cause This alert is for Clear Alert 1201 reconfiguration Informational informational purposes Status completed Alert 2090 is a clear alert for alert 2063 Related Alert Number None LRA Number None Action None 2091 Virtual disk OK Normal Cause This alert is for Clear Alert 1201 rebuild Informational informational purposes Status completed Alert 2091 is a clear alert for alert 2064 Related Alert Number None LRA Number None Action None Storage Management Message Reference 89 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2092 Physical disk OK Normal Cause This alert is for Clear Alert 901 re
7. Related Alert None LRA Number None 138 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2178 The controller Warning Cause The controller Clear Alert 1153 battery Learn Non critical battery must be fully None cycle has charged before the Related timed out Learn cycle can begin Alert None The battery may be unable to maintain a LRA full charge causing the Number Learn cycle to timeout 2100 Additionally the battery must be able to maintain cached data for a specified period of time in the event of a power loss For example some batteries maintain cached data for 24 hours If the battery is unable to maintain cached data for the required period of time then the Learn cycle timeout occurs Action Replace the battery pack as the battery is unable to maintain a full charge 2179 The controller OK Normal Cause This alert is for ClearAlert 1151 battery Learn Informational informational purposes None cycle has been Action None Related postponed Alert None LRA Number None Storage Management Message Reference 139 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2180 The controller OK Normal Cause This alert is for ClearAlert 1151 battery Learn Informa
8. Sends an SNMP trap if the operating system s SNMP service is installed and enabled NOTE Dell OpenManage Server Administrator Storage Management does not log alerts regarding the data I O path These alerts are logged by the respective RAID drivers in the system alert log See the Dell OpenManage Server Administrator Storage Management Online Help for updated information Storage Management Message Reference 65 Alert Message Format with Substitution Variables When you view an alert in the Server Administrator alert log the alert identifies the specific components such as the controller name or the virtual disk name to which the alert applies In an actual operating environment a storage system can have many combinations of controllers and disks as well as user defined names for virtual disks and other components Each environment is unique in its storage configuration and user defined names To receive an accurate alert message that the Storage Management service must be able to insert the environment specific names of storage components into an alert message This environment specific information is inserted after the alert message text as shown for alert 2127 in Table 3 1 For other alerts the alert message text is constructed from information passed directly from the controller or another storage component to the alert log In these cases the variable information is represented with a percent symbol in the Storage Managem
9. physical disk results in lost data In the case of an enclosure more than one enclosure component has failed For example the enclosure may have suffered the loss of all fans or all power supplies Action Identify and replace the failed components To identify the failed component select the Storage object and click the Health subtab Storage Management Message Reference 111 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2123 The controller status contd displayed on the Health subtab indicates whether a controller has a Failed or Degraded component Click the controller that displays a Warning or Failed status This action displays the controller Health subtab which displays the status of the individual controller components Continue clicking the components with a Warning or Health status until you identify the failed component See the online help for more information See the enclosure documentation for information on replacing enclosure components and for other diagnostic information 112 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2124 Redundancy OK Normal Cause Data Clear Alert 1304 normal Informational redundancy has been
10. was asserted em Redun Gain Information This event is generated when memory redundancy is regained redundancy regained Emory te ancy 1s regaine em ECC Warning Warning This event is generated when correctable ECC errors have increased from a normal rate transition to non critical from OK em ECC Warning Critical This event is generated when correctable ECC errors reach a transition to critical om critical rate from less severe em CRC Err Critical This event is generated when CRC errors enter a transition to non recoverable state non recoverable em Fatal SB CRC Critical This event is generated while uncorrect bi RCE Gas storing CRC errors to memory asserted Mem Fatal NB CRC Critical This event is generated while dee PreeueDie NEC wes removing CRC errors from Joserted memory Mem Overtemp Critical This event is generated when critical over temperatur system memory reaches critical WAS ASSeETEA temperature 270 System Event Log Messages for IPMI Systems Table 4 12 BIOS Generated System Events continued Event Message Severity Cause USB Over current Critical This event is generated when erae ition to the USB exceeds a predefined Aoncrecover bT current level Hdwr version err hardware Critical This event is generated when incompatibility there is a mismatch between BMC iDRAC Firmware and the BMC and iDRAC firmware CP
11. Alert Number 2122 2322 LRA Number 2090 2314 The Critical Cause Storage Clear Alert 104 initialization Failure Error Management is unable None sequence of to monitor or manage Related SAS SAS devices Alert None ERE Action Reboot the LRA system If problem N mb t system startup persists make sure you 205 f SAS 5 have supported versions management of the drivers and and fot firmware Also you may monitoring 1s need to reinstall Storage not possible Management or Server Administrator because of some missing installation components Storage Management Message Reference 191 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2315 Diagnostic OK Normal Cause The 1 Clear Alert 751 message 1 Informational indicates a substitution None variable The text for Related this substitution Alert None variable is generated by the utility that ran the LRA diagnostics and is Number displayed with the alert None in the alert log This text can vary depending on the situation This alert is for informational purposes Action None 2316 Diagnostic Critical Cause A diagnostics Clear Alert 754 message 1 Failure Error test failed The 1 None indicates a substitution Related variable The text for Alert None this substitution variable is generated by LRA the utility that ran the Number diagnostics and is
12. ID 1755 SD card device sensor Error An SD card device detected a non recoverable sensor in the specified value system detected an Sensor location lt Location error from which it TA heed os cannot recover The sensor location chassis Chassis location lt Name of location previous state chassis gt and SD card device Previous state was type information is lt State gt provided The SD card Sb cara device type ei state is provided if an E asa aeeees SD card is present in the SD card device SD card state lt State of SD card gt 62 Server Management Messages Chassis Management Controller Messages The Alerts sent by Dell M1000e Chassis Management Controller CMC are organized by severity That is the event ID of the CMC trap indicates the severity informational warning critical or non recoverable of the alert Each CMC alert includes the originating system name location and event message text The alert message text matches the corresponding Chassis Event Log message text that is logged by the sending CMC for that event Table 2 17 Chassis Management Controller Messages EventID Description Severity Cause 2000 CMC generateda Informational A user initiated test trap test trap was issued through the CMC GUI or RACADM CLI 2002 CMC reported a Informational CMC informational return to normal event as described in the or informational drsCAMessage variable event binding supplied with the alert
13. If sensor type is not discrete Temperature sensor value in degrees Celsius lt Reading gt If sensor type is discrete Discret stat lt State gt temperatur sensor on the backplane board system board or drive carrier in the specified system could not obtain a reading The sensor location chassis location previous state and a nominal temperature sensor value information is provided Server Management Messages 23 Table 2 2 Temperature Sensor Messages continued Event Description Severity Cause ID 1052 Temperature sensor returned Information A temperature to a normal value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Temperature sensor value in degrees Celsius lt Reading gt If sensor type is discrete Discrete temperature stat lt State gt sensor on the backplane board system board or drive carrier in the specified system returned to a valid range after crossing a failure threshold The sensor location chassis location previous state and temperature sensor value are provided 1053 Temperature sensor detected Warning a warning value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Temperature sensor value in degrees
14. continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2380 Foreign OK Normal Cause This alert is Clear Alert 751 configuration Informational provided for None has been informational purposes Related partially Action None Alert None imported Some LRA configuration Number failed to None import 222 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2381 Controller OK Normal Cause This alert is Clear Alert 751 preserved Informational provided for None cache is informational purposes Related recovered Action None Alert None LRA Number None 2382 Anun Warning Cause A physical disk Clear Alert 903 supported Non critical of media type SSD is None configuration attached to a controller Related was detected that does not support Alert None The controller SSD disks LRA does not Action Replace the Number support unsupported physical None physical disks disk with a physical disk of type SSD of media type HDD lt Physical DiskID gt lt controller ID gt lt connector ID gt 2383 The OK Normal Cause The number of ClearAlert 1201 Information Informational physical disks you 2195 level set for the specified for the hot Related hot spare spare protection policy Alert None protection is violat
15. incorrect lt Processor Entity gt Information This event is generated when the configuration error earlier processor configuration was deasserted error was corrected lt Processor Entity gt Warming This event is generated when the throttled was asserted processor slows down to prevent overheating lt Processor Entity gt Information This event is generated when the throttled was deasserted earlier processor throttled event was corrected CPU lt number gt has an Critical The specified CPU generated an internal error IERR internal error CPU lt number gt has a thermal Critical The CPU generates this event trip over temperature before it shuts down because of event excessive heat caused by lack of cooling or heat synchronization CPU lt number gt configuration Warming The specified CPU is not is unsupported support for this system CPU lt number gt is present Information The specified CPU is present 254 System Event Log Messages for IPMI Systems Table 4 4 Processor Status Events continued Event Message Severity Cause CPU lt number gt terminator is Information This event is generated if the present terminator is present on a processor slot CPU lt number gt terminator is Warning This event is generated if the absent terminator is missing on an empty processor slot CPU lt number gt is throttled Warning This event is generated when the processor slows down to prevent ov
16. memory device messages 6 Memory ECC Events 260 memory ECC messages 260 Memory Events 262 memory modules messages 262 memory prefailure sensor 9 messages AC power cord 49 265 battery 280 battery sensor 57 BIOS generated system 268 BMC watchdog 261 cable interconnect 279 chassis intrusion 35 cooling device 26 current sensor 32 drives 265 entity presence 281 fan enclosure 47 fan sensor 251 hardware log sensor 264 intrusion 267 memory device 46 memory ECC 260 memory modules 262 pluggable device 55 268 power supply 42 255 processor sensor 52 processor status 253 r2 generated system 277 redundancy unit 38 Server Administrator General 19 storage management 71 temperature sensor 22 247 voltage sensor 29 249 Multi bit ECC error 179 P Physical disk 1 146 Physical disk online 126 pluggable device sensor 10 Power And Performance Events 281 Power Supply Events 255 power supply messages 42 255 power supply sensor 9 Processor sensor 268 processor sensor 9 Processor Status Events 253 processor status messages 253 r2 generated system messages 277 Redundancy degraded 109 Redundancy lost 111 Redundancy normal 113 Redundancy sensor 260 redundancy unit messages 38 redundancy unit sensor 9 S SAS expander error 1 215 SAS port report 1 199 200 SAS SMP communications error 1 214 SCSI sense data 92 SCSI sense sector reassign 11
17. the PCI device option ROM for a NIC does not support link tuning or the Flex addressing feature This event is generated when the PCI device option ROM for a NIC does not support link tuning or the Flex addressing feature This event is generated when BIOS fails to program virtual MAC address on the given NIC device This event is generated when BIOS could not obtain virtual MAC address or Link Tuning data from iDRAC This event is generated when an unknown hardware failure is detected This event is generated when a description gt fatal error occurs during system boot See Table 4 13 for more information 276 System Event Log Messages for IPMI Systems POST Code Table Table 4 13 lists the POST Code errors that are generated when a fatal error occurs during system boot Table 4 13 POST Code Errors Fatal Error Description Cause Code 80 No memory detected This error code implies that no memory is installed 81 Memory detected but is This error code indicates memory not configurable configuration error that could be a result of bad memory mismatched memory or bad socket 82 Memory configured but This error code indicates memory not usable sub system failure 83 System BIOS shadow failure This error code indicates system BIOS shadow failure 84 CMOS failure This error code indicates that CMOS RAM is not working 85 DMA controller failure This error code indicate
18. 1 Status Power supply Information This event is generated when the sensor for PS 1 failure power supply has recovered from was deasserted an earlier failure event PS 1 Status Power supply Warning This event is generated when the sensor for PS 1 power supply is about to fail predictive failure was asserted PS 1 Status Power supply Information This event is generated when the sensor for PS 1 power supply has recovered from predictive failure was an earlier predictive failure event deasserted PS 1 Status Power supply Critical This event is generated when AC sensor for PS 1 input power is removed from the power lost was asserted supply PS 1 Status Power supply Information This event is generated when the sensor for PS 1 input power supply is plugged in lost was deasserted PS 1 Status Power supply Warnin This event is generated when an sensor for PS 1 Critical invalid power supply configuration error was configuration is detected asserted PS 1 Status Power supply Information This event is generated when the sensor for PS 1 power supply has recovered from configuration error was an earlier invalid configuration deasserted Power supply lt number gt is Information This event is generated when the present power supply is plugged in Power supply lt number gt is Critical This event is generated when the absent power supply is removed Power suppl
19. Alert None LRA Number None 2413 Controller Informational Cause This alert is Clear Alert 1201 CacheCade is provided for None created informational purposes Related Action None Alert None LRA Number None Storage Management Message Reference 231 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2414 Controller Informational Cause This alert is Clear Alert 1201 CacheCade is provided for None deleted informational purposes Related Action None Alert None LRA Number None 2415 Controller Informational Cause The battery Clear Alert 1151 battery is learn cycle has started None discharging Rate N ne Related i Alert None LRA Number None 2416 Disk medium Warning Cause A part of the Clear Alert 903 error detected Non critical physical disk is None damaged Related Action None Alert None LRA Number None 232 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2417 There is an unrecoverable medium error detected on virtual disk Cause Unrecoverable medium error found on one or more member physical disks of a virtual disk Critical Failure Error Action Perform a backup of the virtual disk with the Verify option selected If the Backup o
20. Cause This alert is for Clear Alert 1201 check Informational informational purposes Number consistency Aton Nowe 2085 started Related Alert Number None LRA Number None 2059 Virtual disk OK Normal Cause This alert is for Clear Alert 1201 format started Informational informational purposes Number Action None 2086 Related Alert Number None LRA Number None Storage Management Message Reference 77 Table 3 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2060 Copy of data started from physical disk 2 to physical disk 1 OK Normal Informational Cause This alert is for informational purposes Action None Clear Alert 1201 Number None Related Alert Number 2075 LRA Number None 2061 Virtual disk initialization started OK Normal Informational Cause This alert is for informational purposes Action None Clear Alert 1201 Number 2088 Related Alert Number None LRA Number None 2062 Physical disk initialization started 733 OK Normal Informational Cause This alert is for informational purposes Action None Storage Management Message Reference Clear Alert 901 Number 2089 Related Alert Number None LRA Number None Table 3 4 Storage Management Messages continued Event Descrip
21. Celsius lt Reading gt If sensor type is discrete Discrete temperature stat lt State gt 24 Server Management Messages A temperature sensor on the backplane board system board CPU or drive carrier in the specified system exceeded its warning threshold The sensor location chassis location previous state and temperature sensor value are provided Table 2 2 Temperature Sensor Messages continued Event Description Severity Cause ID 1054 Temperature sensor detected Error A temperature a failure value sensor on the Sensor location lt Location in ae ko chassis system oard or drive carrier in the Chassis location lt Name of specified system chassis gt exceeded its failure Previous state was lt State gt threshold If sensor type is not discrete The ae location chassis Temperature sensor value in location previous degrees Celsius lt Reading gt state If sensor type is discrete and temperature sensor value Discrete temperature stat p are provided lt State gt 1055 Temperature sensor detected Error A temperature a non recoverable valu Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Temperature sensor value in degrees Celsius lt Reading gt If sensor type is discrete Discret stat lt State gt temperatur sensor on the backplane boa
22. Check the N mbet health of the enclosure 2091 and its components Replace any hardware that is in a Failed state See the hardware documentation for more information 2303 The enclosure OK Normal Cause This alert is for Clear Alert 851 cannot support Informational both SAS and SATA physical disks Physical disks may be disabled Storage Management Message Reference informational purposes Action None None Related Alert None LRA Number None 185 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2304 Anattemptto OK Normal Cause This alert is for ClearAlert 751 hot plug an Informational informational purposes None EMM has been Action None Related detected This Alert type of hot Number plug is not 2211 supported LRA Number None 2305 The physical Warning Cause The physical Clear Alert 903 disk is too Non critical disk is too small to None small to be rebuild the data Related used for 2 Action Remove the Alert rebuild physical disk and insert Number a new physical disk that 2326 is the same size or larger 1 RA than the disk that is Naber being rebuilt The new 2070 physical disk must also use the same technology for example SAS or SATA as the disk being rebuilt If the rebuild does not start automatically after you have inserted a suitable physical disk then run t
23. ECC single bit error rate is exceeded The system board fail safe Critical This event is generated when voltage is outside of the system board voltages are range not at normal levels The system board fail safe Information This event is generated when voltage is within range earlier Fail Safe system voltages return to a normal level System Event Log Messages for IPMI Systems 275 Table 4 12 BIOS Generated System Events continued Event Message Severity Cause A hardware incompatibility Critical detected between BMC iDRAC firmware and CPU A hardware incompatibility Information was corrected between BMC iDRAC firmware and CPU Device option ROM on Critical embedded NIC failed to support Link Tuning or FlexAddress Device option ROM on Critical mezzanine card lt number gt failed to support Link Tuning or FlexAddress Failed to program virtual Critical MAC address on a component at bus lt bus gt device lt device gt function lt function gt Failed to get Link Tuning Critical or FlexAddress data from iDRAC An unknown system hardware Critical failure detected POST fatal error lt error Critical This event is generated when there is a mismatch between the BMC and iDRAC firmware and the processor in use or vice versa This event is generated when an earlier mismatch between the BMC and iDRAC firmware and the processor is corrected This event is generated when
24. ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2387 A virtual disk bad block medium error is detected Critical Failure Error Storage Management Message Reference Cause Virtual disk bad blocks are due to presence of unrecoverable bad blocks on one or more member physical disks Action 1 Perform a backup of the virtual disk with the Verify option selected One of the following can occur e Backup operation fails In this case restore the file from a previous backup After restoring the file run Patrol Read and check for bad blocks If more bad blocks exist proceed to step 2 Backup operation completes without error This indicates that there are no bad blocks on your virtual disk Backup operation displays bad blocks This indicates that the bad blocks are located in a non data area Proceed to step 2 Clear Alert 1204 None Related Alert None LRA Number 2081 225 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2387 2 To clear these bad contd blocks execute the Clear Virtual Disk Bad Blocks task 3 Run Patrol Read to ensure no new bad blocks are found 2388 The Controller OK Normal Cause The Controller Clear Alert 751 Encryption Informational Encryption Key is None Key is destroyed Related destro
25. Information User requested a host system system control action control action to reboot power Action requested was off or power cycle the system lt Action gt Alternatively the user had indicated protective measures to be initiated in the event of a thermal shutdown 1008 Systems Management Information Systems Management Data Manager Started Data Manager services were started 1009 Systems Management Information Systems Management Data Manager Stopped Data Manager services were stopped 1011 RCI table is corrupt Error This message is generated when the BIOS Remote Configuration Interface RCI table is corrupted or cannot be read by the systems management software 1012 IPMI Status Information This message is generated Interface lt the IPMI interface being used gt lt additional information if available and applicable gt to indicate the Intelligent Platform Management Interface IPMI status of the system Additional information when available includes Baseboard Management Controller BMC not present BMC not responding System Event Log SEL not present and SEL Data Record SDR not present Server Management Messages 21 Table 2 1 Server Administrator General Messages continued Event Description Severity Cause ID 1013 System Peak Power Information The system peak power sensor detected new peak detected a new peak value in value power consumption The new Peak value in peak
26. Number restored to a virtual disk Alert 2124 or an enclosure that is a clear previously suffered a alert for loss of redundancy alerts 2122 This alert is for and 2123 informational purposes Related Action None Alert Number None LRA Number None 2125 Controller Warning Cause Virtual disk Clear Alert 1203 cache Non critical controller was Number preserved for missing or offline virtual disk Storage Management Message Reference disconnected during I O operation Action Import foreign disks if any Check if the enclosure containing the virtual disk is disconnected from the controller 2186 2240 Related Alert Number None LRA Number None 113 Table 3 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2126 SCSI sense Warning sector reassign Non critical 114 Cause A sector of the physical disk is corrupted and data cannot be maintained on this portion of the disk This alert is for informational purposes CAUTION Any data residing on the corrupt portion of the disk may be lost and you may need to restore your data from backup Action If the physical disk is part of a non redundant virtual disk then back up the data and replace the physical disk CAUTION Removing a physical disk that is included ina non redundant virtual disk causes the virtual
27. Related Fee dictiyg one controller property Alert None Balun and run the command RA changed again REA None 2223 Abort Check OK Normal Cause This alert is for Clear Alert 751 Consistencyon Informational informational purposes None Error Action Change at least Related Copyback and one controller property Alert None Loadbalance and run the command RA changed again Number None 2224 Copybackand OK Normal Cause This alert is for Clear Alert 751 Loadbalance Informational informational purposes None changed Action Change at least Related one controller property Alert None and run the command LRA Agel Number None 2225 Abort Check OK Normal Cause This alert is for Clear Alert 751 Consistencyon Informational informational purposes None Error and Load Action Change at least Related balance one controller property Alert None changed and run the command RA agan Number None Storage Management Message Reference 155 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2226 Load balance OK Normal Cause This alert is for ClearAlert 751 changed Informational informational purposes None Action Change at least Related one controller property Alert None and run the command LRA agai Number None 2227 Abort Check OK Normal Cause This alert is for ClearAlert 751 Consistencyon Infor
28. SNMP ID Alert Trap Information Numbers 2399 The Physical OK Normal Cause The physical Clear Alert 901 Disk Power Informational disk power status is None status changed changed from one state Related from 1 to 2 to another A physical Alert None eT 8P Number statuses spun down oe None transition and spun up Action None 2400 Physical disk Warning Cause The physical Clear Alert 901 configuration Non critical disk configuration data None data updated is updated because it Related as it was stale was outdated Alert None Action None LRA Number None 2401 Configuration Failure Error Cause The virtual disk Clear Alert 754 command configurationcommand None could not be did not succeed Related committed to Action Check for the Alert None disk recent configuration LRA Configuration that has not taken Number pane oe effect Re apply the None applied configuration 2402 Changingthe Failure Error Cause When changing Clear Alert 904 Physical Disk the Physical Disk Power None Power status status fails Related from 1 to 2 Action Replace the Alert None failed physical disk LRA Number None Storage Management Message Reference 229 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2403 Virtual Disk is OK Normal Cause The operating ClearAlert 1201 available Informational system detects the None newly
29. Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt SD card device typ lt Typ of SD card device gt SD card state lt State of SD card gt sensor in the specified system failed The sensor location chassis location previous state and SD card device type information is provided The SD card state is provided if an SD card is present in the SD card device Server Management Messages _ 59 Table 2 16 SD Card Device Messages Event Description Severity Cause ID 1751 SD card device sensor Information An SD card device value unknown sensor in the specified Sensor location lt Location nr oe h in ohassis gt obtain a reading T e sensor location chassis Chassis location lt Name of location previous state chassis gt and SD card device Previous state was type information is lt State gt provided The SD card Sp card device type tye state is provided if an Ge Oprea rA aa aes SD card is present in the SD card device SD card state lt State of SD card gt 1752 SD card device returned to Information An SD card device normal sensor in the specified Sensor location lt Location system detected that in chassis gt an SD card transitioned back to a Chassis location lt Name of normal state The chassis gt sensor location chassis Previous state was location previous state lt State gt and SD card device SDcard device
30. Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Fan sensor value lt Reading gt Information A fan sensor reading on the specified system returned to a valid range after crossing a warning threshold The sensor location chassis location previous state and fan sensor value information is provided 1103 Fan sensor detected a warning value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Fan sensor value lt Reading gt A fan sensor reading in the specified system exceeded a warning threshold The sensor location chassis location previous state and fan sensor value information is provided 1104 Fan sensor detected a failure value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Fan sensor value lt Reading gt A fan sensor in the specified system detected the failure of one or more fans The sensor location chassis location previous state and fan sensor value information is provided Server Management Messages _ 27 Table 2 3 Cooling Device Messages continued Event Description Severity Cause ID 1105 Fan sensor detected a Error A fan sensor non recoverable valu detected an error from which it cannot recover The sensor locatio
31. State gt Chassis intrusion state lt Intrusion state gt 1251 Chassis intrusion Error A chassis intrusion sensor sensor value unknown in the specified system Sensor location a ee g lt Location in chassis gt rea mg ne Sensor location chassis location Chassis location lt Name previous state and of chassis gt chassis intrusion state Previous state was are provided lt State gt Chassis intrusion state lt Intrusion state gt 1252 Chassis intrusion Information A chassis intrusion sensor returned to normal in the specified system sensor te ese Rens pes aE regen was lt Location in chassis gt OPSRES WARS tnesystem was operating but has Chassis locator lt Name since been replaced of chassis gt The sensor location Previous state was chassis location previous lt State gt state and chassis Bb an f intrusion state Chassis intrusion state ii ti ided information is provided lt Intrusion state gt P 36 Server Management Messages Table 2 6 Chassis Intrusion Messages continued Event Description Severity Cause ID 1253 Chassis intrusion in Warning A chassis intrusion sensor progress in the specified system Sensor Iocation detected that oe LOCAL ton Gn chasse cover 1s currently being opened and the system is Chassis location lt Name operating The sensor of chassis gt location chassis location Previous state was previous state and chassis lt State gt intrusion state ar i information is
32. The controller name is not always displayed Battery Message Format Battery X Controller A For example 2174 The controller battery has been removed Battery 0 Controller 1 SCSI Physical Message Format Physical Disk X Y Controller A Connector B Disk For example 2049 Physical disk removed Physical Disk 0 14 Controller 1 Connector 0 SAS Physical Disk Message Format Physical Disk X Y Z Controller A Connector B For example 2049 Physical disk removed Physical Disk 0 0 14 Controller 1 Connector 0 Virtual Disk Message Format Virtual Disk X Name Controller A Name Message Format Virtual Disk X Controller A For example 2057 Virtual disk degraded Virtual Disk 11 Virtual Disk 11 Controller 1 PERC 5 E Adapter NOTE The virtual disk and controller names are not always displayed Enclosure Message Format Enclosure X Y Controller A Connector B For example 2112 Enclosure shutdown Enclosure 0 2 Controller 1 Connector 0 SCSI Power Supply Message Format Power Supply X Controller A Connector B Target ID C where C is the SCSI ID number of the enclosure management module EMM managing the power supply For example 2122 Redundancy degraded Power Supply 1 Controller 1 Connector 0 Target ID 6 Storage Management Message Reference 67 Table 3 2 Message Format with Variables for Each Storage Object continued Storage Object Message Variables SAS Power Supply Messa
33. This alert is for Clear Alert 751 import of Informational informational purposes None U nsupported Action None Related Virtual Disk Alert None type RAID 1 LRA Number None Storage Management Message Reference 219 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2372 Attempted OK Normal Cause This alert is Clear Alert 751 import of Informational provided for None Virtual Disk informational purposes Related exceeding the Action None Alert None limit supported on LRA the controller ee one 2373 Attempted OK Normal Cause This alert is Clear Alert 751 import of Informational provided for None unsupported informational purposes Related Virtual Disk User is attempting to Alert None type RAID 1 import a foreign virtual LRA disk with unsupported Number RAID level on the es a controller QRS Action None 2374 Attempted OK Normal Cause This alert is Clear Alert 751 import of Informational provided for None Virtual Disk informational purposes Related with missing and is displayed when Alert None span you attempt to import a foreign virtual disk with se bek a missing span ae me Action None 2375 Attempted OK Normal Cause User is Clear Alert 751 import of Informational attempting to importa None Virtual Disk foreign virtual disk with Related with missing a missing physical disk Alert None physical disk
34. assis location previous Chassis location lt Name of state power supply chassis gt type additional power Previous state was lt State gt supply status and Por r Suppl otype ene ee configuration error 44 power supply gt lt Additional power supply status information gt If in configuration error state Configuration error type lt type of configuration error gt Server Management Messages type information are provided Table 2 8 Power Supply Messages continued Event Description Severity Cause ID 1355 Power supply sensor detected Error A power supply sensor a non recoverable valu Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Power Supply type lt type of power supply gt lt Additional power supply status information gt If in configuration error state Configuration error type lt type of configuration error gt in the specified system detected an error from which it cannot recover The sensor location chassis location previous state power supply type additional power supply status and configuration error type information is provided Server Management Messages 45 Memory Device Messages The memory device messages listed in Table 2 9 provide status and warning information for memory modules present in a particular system Memory devices determine health status by monitori
35. created virtual Related disk Alert None Action None LRA NOTE This alert also Number appears when a None CacheCade is created but is not available for the operating system as itis a CacheCade and not a Virtual Disk 2404 Virtual Disk is OK Normal Cause The operating ClearAlert 1201 not available Informational system does not detect None the newly created Related virtual disk Alert None Action Wait for some LRA time Number None 2405 Command Informational Cause The spundown Clear Alert 901 timeout on physical disks take more None physical disk time than the timeout Related period and the Alert None configuration LRA commands are timed Number out None Action None 230 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2407 Controller Informational Cause The Local Key ClearAlert 751 Encryption Management LKM None mode is encryption mode is Related enabled in enabled Alert None LKM Action None LRA Number None 2411 Controller Informational Cause Using Manage Clear Alert 751 LKM Encryption Key None Encryption key operations encryption Related is changed key is changed Alert None Action None LRA Number None 2412 Controller Informational Cause This alert is Clear Alert 1201 CacheCade is provided for None resized informational purposes Related Action None
36. disk fails Because the virtual disk is redundant uses mirrored or parity information and only one physical disk has failed the virtual disk can be rebuilt Action 1 Replace the failed drive Rebuild of the virtual disk starts automatically NOTE If you put the drive in a different slot you need to assign it as a hot spare for the rebuild to start automatically If you are using an Expandable RAID Controller PERC PERC 4 SC 4 DC 4e DC 4 Di CERC ATA100 4ch PERC 5 E PERC 5 i or a Serial Attached SCSI SAS S R controller rebuild the virtual disk by first configuring a hot spare for the disk and then initiating a write operation to the disk The write operation initiates a rebuild of the disk Storage Management Message Reference Clear Alert 1203 Number None Related Alert Number 2048 2049 2050 2076 2079 2081 2123 2129 2346 LRA Number 2080 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2057 Cause 2 A physical disk contd in the disk group has been removed Action 2 If a physical disk was removed from the disk group either replace the disk or restore the original disk You can identify which disk has been removed by locating the disk that has a red X for its status Perform a rescan after replacing the disk 2058 Virtual disk OK Normal
37. hot See the physical disk enclosure documentation for more diagnostic information Storage Management Message Reference Clear Alert 1054 Number None Related Alert Number None LRA Number 2091 Table 3 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2103 Temperature Critical dropped below Failure Error the minimum failure threshold Cause The physical disk enclosure is too cool Action Check if the thermostat setting is too low and if the room temperature is too cool Clear Alert 1054 Number None Related Alert Number 2112 LRA Number 2091 2104 Controller bat OK Normal tery is recondi Informational tioning Cause This alert is for informational purposes Action None Clear Alert 1151 Number 2105 Related Alert Number None LRA Number None 2105 Controller battery recondition is completed OK Normal Informational Storage Management Message Reference Cause This alert is for informational purposes Action None Clear Alert 1151 Status Alert 2105 is a clear alert for alert 2104 Related Alert Number None LRA Number None 97 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers
38. in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Current sensor value in Amps lt Reading gt OR Current sensor value in Watts lt Reading gt If sensor type is discrete Discrete current state lt State gt Server Management Messages in the specified system failed The sensor location chassis location previous state and current sensor value are provided Table 2 5 Current Sensor Messages continued Event Description Severity Cause ID 1201 Current sensor value unknown Error A current sensor Sensor location lt Location in in the specified system could not chassis gt obtain a reading Chassis location lt Name of The sensor chassis gt location chassis Previous state was lt State gt location previous i state and a If sensor type is not discrete nominal current Current sensor value in Amps sensor value lt Reading gt OR information is Current sensor value in provided Watts lt Reading gt If sensor type is discrete Discrete current state lt State gt 1202 Current sensor returned to Information A current sensor a normal value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Current sensor value lt Reading gt OR in Amps Current sensor value in Watts lt Re
39. is for informational purposes Action None 2331 A bad disk OK Normal Cause The diskhasa Clear Alert 901 block has been Informational bad block Data has None reassigned been readdressed to Related another disk block and Alert None no data loss has occurred LRA Number Action Monitor the Noii disk for other alerts or indications of poor health For example you may receive alert 2306 Replace the disk if you suspect there is a problem 200 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Cause and Action Related SNMP ID Alert Trap Information Numbers 2332 A controller OK Normal Cause This alert is for Clear Alert 751 hot plug has informational purposes None been detected Action None Related Alert None LRA Number None 2334 Controller Cause The 1 Clear Alert 751 event log 1 indicates a substitution None variable The text for Related this substitution Alert None variable is generated by the controller and is LRA displayed with the alert Number in the alert log This None text is from events in the controller event log that were generated while Storage Management was not running This text can vary depending on the situation This alert is for informational purposes Action None Storage Management Message Reference 201 Table 3 4 Storage Management Messages continued Event ID Description Severity Cause and A
40. is generated when the BMC watchdog detects that the system has crashed timer expired because no response was received from Host and the action is set to power cycle The OS watchdog timer Critical reset the system This event is generated when the BMC watchdog detects that the system has crashed timer expired because no response was received from Host and the action is set to reboot System Event Log Messages for IPMI Systems 261 262 Table 4 7 BMC Watchdog Events continued Event Message Severity Cause The OS watchdog timer Critical This event is generated when the powered cycle th BMC watchdog detects that the system system has crashed timer expired because no response was received from Host and the action is set to power cycle The OS watchdog timer Critical This event is generated when the powered off the BMC watchdog detects that the system system has crashed timer expired because no response was received from Host and the action is set to power off The OS watchdog timer Critical This event is generated when the expired BMC watchdog timer expires and no action is set Memory Events The memory modules can be configu red in different ways in particular systems These messages monitor the status warning and configuration information about Table 4 8 Memory Events the memory modules in the system Event Message Severity Cause Memory RAID
41. or cold Verify that the fans in the server or enclosure are working If the physical disk is in an enclosure you should check the thermostat settings and examine whether the enclosure is located near a heat source Storage Management Message Reference 101 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2109 Make sure the enclosure contd has enough ventilation and that the room temperature is not too hot See the physical disk enclosure documentation for more diagnostic information Action 2 If you cannot identify why the disk has reached an unacceptable temperature then replace the disk If the physical disk is a member of a non redundant virtual disk then back up the data before replacing the disk CAUTION Removing a physical disk that is included ina non redundant virtual disk causes the virtual disk to fail and may cause data loss 102 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2110 SMART Warning Cause A disk is Clear Alert 903 warning Non critical degraded and has Number degraded received a SMART alert None predictive failure The Related disk is likely to fail in Alert the near future Number Action Replace the None disk
42. provided Chassis intrusion state P lt Intrusion state gt 1254 Chassis intrusion Critical A chassis intrusion sensor detected in the specified system saison location detected that e lt Location in chassis gt SOVET WAS OPENCE Wile the system was operating Chassis locatton lt Name The sensor location of chassis gt chassis location previous Previous state was state and chassis lt State gt intrusion state ee information is provided Chassis intrusion state P lt Intrusion state gt 1255 Chassis intrusion Error A chassis intrusion sensor sensor detected a non recoverable valu Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Chassis intrusion state lt Intrusion state gt in the specified system detected an error from which it cannot recover The sensor location chassis location previous state and chassis intrusion state information is provided Server Management Messages _ 37 Redundancy Unit Messages Redundancy means that a system chassis has more than one of certain critical components Fans and power supplies for example are so important for preventing damage or disruption of a computer system that a chassis may have extra fans or power supplies installed Redundancy allows a second or nth fan to keep the chassis components at a safe temperature when the primary fan has failed Redundancy is normal when
43. redundancy Check Consistency may be lost task If you receive this s i ewe Number alert again check the 2080 health of the physical disks included in the virtual disk Review the alert messages for significant alerts related to the physical disks If you suspect that a physical disk has a problem replace it and restore from backup 2343 The Check Warning Cause The Check Clear Alert 1203 Consistency Non critical Consistency can no None logging of longer report errors iN Related pervs the parity data Alert None parity data is n disabled n See the LRA hardware Noaber documentation for 2080 more information 2344 The virtual Warning Cause A user has Clear Alert 1203 disk Non critical cancelled the virtual None initialization disk initialization Related terminated Action Restart the Alert None initialization LRA Number 2080 206 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2345 The virtual Critical Cause The controller Clear Alert 1204 disk Failure Error cannot communicate None initialization with attached devices A Related failed disk may be removed or Alert None contain errors Cables may also be loose or LRA defective Number f 2081 Action Verify the health of attached devices Review the Alert Log for significant events Make sure the cables are a
44. resumed Alert None LRA Number None 2194 The virtual OK Normal Cause This alert is for ClearAlert 1201 disk Read Informational informational purposes None policy has Action None Related changed Alert None LRA Number None Storage Management Message Reference 145 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2195 Dedicated hot OK Normal Cause This alert is for Clear Alert 1201 spare assigned Informational informational purposes Number Physical disk ketin None 2196 l Related Alert None LRA Number None 2196 Dedicated hot OK Normal Cause This alert is for Clear Alert 1201 spare Informational informational purposes Status unassigned Acton None None Physical Related disk 1 Alert None LRA Number None 2197 Physical disk OK Normal Cause This alert is Clear Alert 903 Copyback Informational provided for Number stopped for informational purposes None rebuild Action None Related Alert Number 2060 LRA Number None 146 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2198 The physical OK Normal Cause This alert is for Clear Alert 903 disk is too Informational informational purposes Number small to be Action None None u
45. s Guide available at support dell com for more information on checking the cables Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2283 Aredundant Waming Cause The controller Clear Alert 903 path is broken Non critical has two connectors that are connected to the same enclosure The communication path on one connector has lost connection with the enclosure The communication path on the other connector is reporting this loss Action Make sure the cables are attached securely and both enclosure management modules EMMs are healthy See the Cables Attached Correctly section for more information on checking the cables 2284 Related Alert None LRA Number 2070 2284 A redundant path has been restored OK Normal Informational Storage Management Message Reference Cause This alert is provided for informational purposes Action None Clear Alert 901 Alert 2284 is a clear alert for alert 2283 Related Alert None LRA Number 2071 177 Table 3 4 Storage Management Messages continued 178 Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2285 Adiskmedia OK Normal Cause This alert is for ClearAlert 901 error was Informational informational purposes None corrected
46. sensor in a failure value the specified system Sensor location lt Location S Pn ehassiss thresho ne sensor l location chassis gaaon location lt Name of location previous chassis gt state and voltage Previous state was lt State gt sensor value If sensor type is not information 13 provided discrete Voltage sensor value in Volts lt Reading gt If sensor type is discrete Discrete voltage state lt State gt 1155 Voltage sensor detected a Error A voltage sensor in non recoverable valu Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Voltage sensor value in Volts lt Reading gt If sensor type is discrete Discrete voltage state lt State gt the specified system detected an error from which it cannot recover The sensor location chassis location previous state and voltage sensor value information is provided Server Management Messages 31 Current Sensor Messages The current sensors listed in Table 2 5 measure the amount of current in amperes that is traversing critical components Current sensor messages provide status and warning information for current sensors in a particular chassis Table 2 5 Current Sensor Messages Event Description Severity Cause ID 1200 Current sensor has failed Error A current sensor 32 Sensor location lt Location
47. state Make sure the cables are attached securely See the online help for more information on checking the cables Restart the Clear task 2271 The Patrol OK Normal Cause The Patrol Read Clear Alert 901 Read Informational task has encounteredan None encountered a error such as a bad disk Related media error block that cannot be Alert None remapped This alert is for informational LRA purposes Number None Action None 170 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2272 Patrol Read Critical Cause The Patrol Read Clear Alert 904 found an Failure Error task has encountered an None uncorrectable error that cannot be Related media error corrected There maybe Alert None a bad disk block that cannot be remapped LRA Number Action Back up your 207 Storage Management Message Reference data If you are able to back up the data successfully then fully initialize the disk and then restore from back up 171 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2273 Ablock onthe Critical Cause The controller ClearAlert 904 physical disk Failure Error encountered an None has been unrecoverable medium Related punctured by error when attempting Alert 2095 the c
48. that has received LRA the SMART alert If the Number physical disk is a 2070 member of a non redundant virtual disk then back up the data before replacing the disk CAUTION Removing a physical disk that is included ina non redundant virtual disk causes the virtual disk to fail and may cause data loss 2111 Failure Warning Cause A disk has Clear Alert 903 prediction Non critical received a SMART alert Number threshold predictive failure due None exceeded due to test conditions Related to test Action None Alert Number None LRA Number 2070 Storage Management Message Reference 103 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2112 Enclosure was Critical Cause The physical Clear Alert 854 shut down Failure Error disk enclosure is either Number hotter or cooler than None the maximum or Related minimum allowable Alert temperature range Number Action Check for None factors that may cause RA overheating or excessive Number cooling For example 2091 verify that the enclosure fan is working You should also check the thermostat settings and examine whether the enclosure is located near a heat source Make sure the enclosure has enough ventilation and that the room temperature is not too hot or too cold See the enclosure documentation for more diagnostic information S
49. the event for example Power supply input AC is off Power supply POK power OK signal is not normal Power supply is turned off Chassis intrusion lt Intrusion state state gt 14 Introduction Specifies whether the chassis intrusion state is Open or Closed For example Chassis intrusion state Open Table 1 2 Event Description Reference continued Description Line Item Explanation Chassis location lt Name of chassis gt Specifies name of the chassis that generated the message for example Chassis location Main System Chassis Configuration error type lt type of configuration error gt Specifies the type of configuration error that occurred for example Configuration error type Revision mismatch Current sensor value in Amps lt Reading gt Specifies the current sensor value in amps for example Current sensor value in Amps 7 853 Date and time of action lt Date and time gt Specifies the date and time the action was performed for example Date and time of action Sat Jun 12 16 20 33 2004 Device location lt Location in Specifies the location of the device in the specified chassis for example chassis Device location Memory Card A Discrete current Specifies the state of the current sensor for example Sra E S ROTER Discrete current state Good Discrete Specifies the state of the temperature sensor temperatu
50. 0 Because of this difference in technology the hot spare cannot rebuild data if one of the physical disks in the virtual disk fails Action Add a SATA disk that is large enough to be used as the hot spare and assign the new disk as a hot spare 2210 Battery Warning Cause Battery is in Clear Alert 1153 requires Non critical warn only mode and None reconditioning requires reconditioning Related Initiate the Action Initiate the Alert None eae learn battery learn cycle LRA Number None Storage Management Message Reference 151 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2211 The physical Warning Cause The physical Clear Alert 903 disk is not Non critical disk may not have a None supported supported version of the Related firmware or the disk Alert None may not be supported by Dell LRA KNO e Number Action If the disk is 2070 supported by Dell update the firmware to a supported version If the disk is not supported by Dell replace the disk with one that is supported 2212 The controller OK Normal Cause This alert is for ClearAlert 1151 battery Informational informational purposes None temperature 1s Action None Related above normal Alert None LRA Number None 2213 Recharge Warning Cause The battery has ClearAlert 1153 count Non critical been recharged more None maxi
51. 071 Action Replace the disk 2350 There wasan Critical Cause The rebuild or Clear Alert 904 unrecoverable Failure Error recovery operation None disk media encountered an Related error during unrecoverable disk Alert the rebuild or media error Numib r Pa Action Replace the 2095 2273 operation disk LRA Number 2071 210 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2351 A physical disk OK Normal Cause This alert is for Clear Alert 901 ismarked as Informational informational purposes Number missing 2352 Related Alert None LRA Number None Action None 2352 A physical disk OK Normal Cause This alert is for Clear Alert 901 that was Informational informational purposes Status marked as Achin Non Alert 2352 missing has is a clear been replaced alert for alert 2351 Related Alert None LRA Number None 2353 The enclosure OK Normal Cause This alert is for Clear Alert 1051 temperature Informational informational purposes Status has returned to Action None Alert 2353 normal is a clear alert for alerts 2100 and 2101 Related Alert None LRA Number None Storage Management Message Reference 211 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Info
52. 158 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2238 The controller OK Normal Cause The user has Clear Alert 751 debug log file Informational attempted to export the None has been controller debug log Related exported This alert is for Alert None informational purposes i LRA Action None Number None 2239 A foreign OK Normal Cause The user has Clear Alert 751 configuration Informational attempted to clear a None has been foreign configuration Related cleared This alert is for Alert None informational purposes LRA Action None N mber None 2240 A foreign OK Normal Cause The user has Clear Alert 751 configuration Informational attempted to importa None has been foreign configuration Related imported This alert is for Alert None informational purposes l i LRA Action None Number None 2241 The Patrol OK Normal Cause The controller Clear Alert 751 Read mode has Informational has changed the patrol None changed read mode This alert is Related for informational Al rt N n purposes LRA Action None Number None Storage Management Message Reference 159 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2242 The Patrol OK Normal Cause The controller
53. 2003 CMC reported a Warning CMC warning event as warning described in the drsCAMessage variable supplied with the alert 2004 CMC reported a_ Critical CMC critical event as critical event described in the drsCAMessage variable binding supplied with the alert 2005 CMC reported a _ Non Recoverable CMC non recoverable non recoverable event event as described in the drsCAMessage variable binding supplied with the alert Server Management Messages _ 63 64 Server Management Messages Storage Management Message Reference The Dell OpenManage Server Administrator Storage Management s alert or event management features let you monitor the health of storage resources such as controllers enclosures physical disks and virtual disks Alert Monitoring and Logging The Storage Management Service performs alert monitoring and logging By default the Storage Management service starts when the managed system starts up If you stop the Storage Management Service then alert monitoring and logging stops Alert monitoring does the following K Updates the status of the storage object that generated the alert Propagates the storage object s status to all the related higher objects in the storage hierarchy For example the status of a lower level object is propagated up to the status displayed on the Health tab for the top level Storage object Logs an alert in the alert log and the operating system application log
54. 2061 displayed with the alert in the alert log This text can vary depending on the situation Action See the documentation for the utility that ran the diagnostics for more information 192 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2318 Problems with Warning Cause The battery or Clear Alert 1153 the battery or Non critical the battery chargeris None the battery not functioning Related charger have properly Alert been detected Action Replace the Number The battery battery pack 2188 health is poor LRA Number 2100 2319 Single bit Warning Cause The DIMM is ClearAlert 753 ECC error Non critical beginning to None The DIMM is malfunction Related degrading Action Replace the Alert DIMM to avoid data Number loss or data corruption 2320 The DIMM isa part of IRA the controller battery Number Storage Management Message Reference pack See your hardware 2060 documentation for information on replacing the DIMM or contact technical support 193 Table 3 4 Storage Management Messages continued 194 Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2320 Single bit Critical Cause The DIMM is Clear Alert 754 ECC error Failure Error malfunctioning None The DIMM is Data
55. 4 sensor AC power cord 9 chassis intrusion 9 current 9 fan 9 fan enclosure 9 hardware log 9 memory prefailure 9 power supply 9 processor 9 52 redundancy unit 9 Index 291 temperature 9 voltage 9 Service tag changed 125 Single bit ECC error limit 142 Single bit ECC error 180 SMART thermal shutdown 165 Smart warning degraded 103 Smart warning temperature 101 System Event Log Messages 247 T temperature sensor 9 Temperature Sensor Events 247 temperature sensor messages 22 247 U understanding event description 14 V viewing event information 13 event messages 10 events in Red Hat Enterprise Linux 12 events in SUSE Linux Enterprise Server 12 292 Index viewing events in Windows operating systems 12 Virtual disk initialization 118 Virtual disk renamed 127 voltage sensor 9 Voltage Sensor Events 249 voltage sensor messages 29 249
56. A immediate experiencing a problem Number reboot is The 1 indicates a 2051 strongly substitution variable recommended The text for this to avoid substitution variable is further displayed with the alert problems in the alert log and can If the reboot vary depending on the does not situation restore pee Action Reboot the cormianicatio system If the problem n then coritaci is not resolved contact technical sup technical support See port for more your system information documentation for information about contacting technical support by using telephone fax and Internet services 2269 The physical OK Normal Cause This alert is for Clear Alert 901 disk Clear Informational informational purposes None operation has Action None Related completed Alert None LRA Number None Storage Management Message Reference 169 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2270 The physical Critical Cause A Clear task was Clear Alert 904 disk Clear Failure Error being performed ona None operation physical disk but the Related failed task was interrupted Alert None and did not complete successfully The LRA controller may have lost Number communication with 2071 the disk The disk may have been removed or the cables may be loose or defective Action Verify that the disk is present and not in a Failed
57. Action None Related during Alert None recovery LRA Number None 2286 A Learn cycle OK Normal Cause This alert is for ClearAlert 1151 start is pending Informational informational purposes None while the Action None Related battery Alert None charges LRA Number None 2287 Protection OK Normal Cause A new Clear Alert 101 policy has been Informational protection policy has None changed been created existing Related protection policy has AJert 2384 been modified J LRA Action None Number None 2288 The patrolread OK Normal Cause This alert is for Clear Alert 751 has resumed Informational informational purposes Status Action None None Related Alert None LRA Number None Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2289 Multi bit ECC Critical Cause An error Clear Alert 754 error on Failure Error controller DIMM Storage Management Message Reference involving multiple bits has been encountered during a read or write operation The error correction algorithm recalculates parity data during read and write operations If an error involves only a single bit it may be possible for the error correction algorithm to correct the error and maintain parity data An error involving multiple bits however usually indicates data loss In some ca
58. Alert None OpenManage Server LRA Administrator Storage Number Management online 2060 f help for more information Storage Management Message Reference 123 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2149 Bad block Warning Cause A portion ofa Clear Alert 753 extended sense Non critical physical disk is None error damaged Related Action See the Dell Alert None OpenManage Server LRA Administrator Storage Number Management online 2060 help for more information 2150 Bad block Warning Cause A portion ofa Clear Alert 753 extended Non critical physical disk is None medium error damaged Related Action See the Dell Alert None OpenManage Server LRA Administrator Storage Number Management online 2060 help for more information 2151 Enclosureasset OK Normal Cause A user has Clear Alert 851 tag changed Informational changed the enclosure None asset tag This alert is for Related informational purposes Alert None Action None LRA Number None 2152 Enclosureasset OK Normal Cause A user has Clear Alert 851 name changed Informational changed the enclosure None asset name This alert is Related for informational Alert None purposes LRA Action None N mbe r None 124 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Descript
59. Alert 851 alarm disabled Informational disabled the enclosure Number alarm None Action None Related Alert Number None LRA Number None 2140 Dead disk OK Normal Cause Disk space that Clear Alert 1201 segments Informational was formerly dead or Number restored inaccessible to a None redundant virtual disk Related has been restored This Alert alert is for informational Number purposes None Action None LRA Number None 2141 Physical disk OK Normal Cause Portions of the Clear Alert 901 dead segments Informational physical disk were Number removed formerly inaccessible None The disk space from Related these dead segments Alert has been recovered and Number is now usable Any data None residing on these dead segments has been lost LRA This alert is for Number None Storage Management Message Reference informational purposes Action None 121 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2142 Controller OK Normal Cause A user has Clear Alert 751 rebuild rate Informational changed the controller Number has changed rebuild rate This alert is None for informational Related purposes Alert Action None Number None LRA Number None 2143 Controller OK Normal Cause A user has Clear Alert 751 alarm enabled Informational enabled the controller Number alarm This alert is for None i
60. Cause The battery may Clear Alert 1153 battery Non critical be recharging the room Number temperature is temperature may be too 2172 above normal hot or the fan in the Related system may be degraded Alert None or failed f LRA Action If this alert was Number generated due to a 2100 battery recharge the situation is corrected when the recharge is complete You should also check if the room temperature is normal and that the system components are functioning properly Storage Management Message Reference 135 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2172 The controller OK Normal Cause This alert is for Clear Alert 1151 battery Informational informational purposes Status temperature is Kati n None Alert 2172 normal is a clear alert for alert 2171 Related Alert None LRA Number None 2173 Unsupported Warning Cause An unsupported ClearAlert 853 configuration Non critical configuration was None detected The detected Related SCSI rates of Action Replace one of Alert None the enclosure the EMMs with the LRA management matching SCSI rate Nuib e modules EMM umber EMMs are i 2090 not the same EMM0 1 EMM1 2 136 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Inform
61. Clear Alert 751 Read operation Informational has started the Patrol Number has started Read operation This 2243 alert is for informational Related RUEP Oss Alert None Action None LRA Number None 2243 The Patrol OK Normal Cause The controller Clear Alert 751 Readoperation Informational has stopped the Patrol Status has stopped Read operation This Alert 2243 alert is for informational is a clear purposes alert for alert 2242 Related Alert None LRA Number None Action None 2244 Avirtualdisk OK Normal Cause This alert is for ClearAlert 1201 blink has been Informational informational purposes None initiated Action None Related Alert None LRA Number None 160 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2245 Avirtual disk OK Normal Cause This alert is for ClearAlert 1201 blink has Informational informational purposes None ceased Action None Related Alert None LRA Number None 2246 The controller Warning Cause The Clear Alert 1153 battery is Non critical temperature of the the None degraded battery is high This Related maybe due to the Alert None battery being charged LRA Action As the charge Number weakens the charger 2100 should automatically recharge the battery If the battery has reached its recharge limit re
62. Dell OpenManage Server Administrator Version 7 1 2 Messages Reference Guide Notes and Cautions K NOTE A NOTE indicates important information that helps you make better use of your computer VAN CAUTION A CAUTION indicates potential damage to hardware or loss of data if instructions are not followed Information in this document is subject to change without notice 2013 Dell Inc All rights reserved Reproduction of these materials in any manner whatsoever without the written permission of Dell Inc is strictly forbidden Trademarks used in this text Dell the DELL logo and OpenManage are trademarks of Dell Inc Microsoft Windows and Windows Server are either trademarks or registered trademarks of Microsoft Corporation in the United States and or other countries Red Hat Enterprise Linux and Enterprise Linux are registered trademarks of Red Hat Inc in the United States and or other countries SUSE is a trademark of Novell Inc in the United States and other countries Citrix Xen and XenServer are either registered trademarks or trademarks of Citrix Systems Inc in the United States and or other countries VMware is registered trademarks or trademarks of VMWare Inc in the United States or other countries Other trademarks and trade names may be used in this document to refer to either the entities claiming the marks and names or their products Dell Inc disclaims any proprietary interest in tra
63. ID Alert Trap Information Numbers 2340 The BGI com Critical Cause The BGI task Clear Alert 1204 pleted with Failure Error encountered errors that None uncorrectable cannot be corrected Related errors The virtual disk Alert None contains physical disks that have unusable disk LRA space or disk errors that Number cannot be corrected 2081 Action Replace the physical disk that contains the disk errors Review other alert messages to identify the physical disk that has errors If the virtual disk is redundant you can replace the physical disk and continue using the virtual disk If the virtual disk is non redundant you may need to recreate the virtual disk after replacing the physical disk After replacing the physical disk run Check Consistency to check the data 2341 The Check OK Normal Cause This alert is for Clear Alert 1201 Consistency Informational informational purposes None made Action None Related corrections and Alert None completed LRA Number None Storage Management Message Reference 205 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2342 The Check Warning Cause The dataona ClearAlert 1203 Consistency Non critical source disk and the None found redundant data on a Related inconsistent target disk is Alert parity data inconsistent Number Pan Action Restart the 2341 2343
64. Information after OS graceful shutdown restart event Comment string accompanying an operating system shutdown restart System Event OS stop Critical event runtime critical stop The operating system encountered a critical error and was stopped abnormally OEM Event data record Information after OS bugcheck event Operating system bugcheck code and paremeters A critical stop occurred Critical during OS load The operating system encountered a critical error and was stopped abnormally while loading 278 System Event Log Messages for IPMI Systems Table 4 14 Operating System Generated Events continued A runtime critical stop Critical The operating system occurred encountered a critical error and was stopped abnormally An OS graceful stop Information The operating system was occurred stopped An OS graceful shut down Information The operating system was occurred shutdown normally Cable Interconnect Events The cable interconnect messages in Table 4 15 are used for detecting errors in the hardware cabling Table 4 15 Cable Interconnect Events Description Severity Cause Cable sensor lt Name Critical This event is generated when Location gt the cable is not connected or Cone iooration Error was is incorrectly connected asserted Cable sensor lt Name Information This event is generated when Location gt the earlier cable conn
65. Linux and SUSE Linux Enterprise Server message log var log messages The text in boldface type indicates the message text 12 Introduction K NOTE These messages are typically displayed as one long line In the following example the message is displayed using line breaks to help you see the message text more clearly Feb 6 14 20 51 server01 Server Administrator Instrumentation Service EventID 1000 Server Administrator starting Feb 6 14 20 51 server01 Server Administrator Instrumentation Service EventID 1001 Server Administrator startup complete Feb 6 14 21 21 server01 Server Administrator Instrumentation Service EventID 1254 Chassis intrusion detected Sensor location Main chassis intrusion Chassis location Main System Chassis Previous state was OK Normal Chassis intrusion state Open Feb 6 14 21 51 server01 Server Administrator Instrumentation Service EventID 1252 Chassis intrusion returned to normal Sensor location Main chassis intrusion Chassis location Main System Chassis Previous state was Critical Failed Chassis intrusion state Closed Viewing Events in VMware ESX ESXi 1 Log in to the system running VMware ESX ESXi with VMware vSphere Client 2 Click View Administration System Logs 3 Select Server Log var log messages entry from the drop down list Viewing the Event Information The event log for each operating system contains some or all of the following inf
66. Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2293 The EMM has Critical Cause The failure may ClearAlert 854 failed Failure Error be caused by aloss of None power to the EMM Related The EMM self test may Alert None also have identified a failure There could also LRA be a firmware problem Number or a multi bit error 2091 Action Replace the EMM See the hardware documentation for information on replacing the EMM 2294 Adevicehas OK Normal Cause This alert is for Clear Alert 851 been inserted Informational informational purposes None Action None Related Alert None LRA Number None 2295 Adevicehas Critical Cause A device has Clear Alert 854 been removed Failure Error been removed andthe None system is no longer Related functioning in optimal Alert None condition LRA Action Replace the Number device 2091 Storage Management Message Reference 181 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2296 AnEMM has OK Normal Cause This alert is for ClearAlert 951 been inserted Informational informational purposes None Action None Related Alert None LRA Number None 2297 AnEMMhas Critical Cause An EMM has Clear Alert 954 been removed Failure Error be
67. None LRA Number 2060 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2188 The controller OK Normal Cause The controller ClearAlert 1151 write policy Informational battery is unable to None has been maintain cac hed data Related changed to for the required period Alert None Write of time For example if Through the required period of LRA time is 24 hours the Number battery is unable to None maintain cached data for 24 hours It is normal to receive this alert during the battery Learn cycle as the Learn cycle discharges the battery before recharging it When discharged the battery cannot maintain cached data Action Check the health of the battery If the battery is weak replace the battery pack 2189 The controller OK Normal Cause This alert is for Clear Alert 1151 write policy Informational informational purposes None has been Action None Related changed to Alert None Write Back LRA Number None Storage Management Message Reference 143 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2190 The controller OK Normal Cause This alert is for Clear Alert 751 has detected a Informational informational purposes None hot add of an Action None Related enclosure Alert None LRA Numb
68. Number offline The user may 2158 have manually put the Related physical disk offline Alert Action Perform a Number rescan You can also 2099 2196 select the offline disk TRA and perform a Make Number Online operation 2070 2051 Physical disk Warning Cause A physical disk Clear Alert 903 degraded Non critical has reported an error None condition and may be degraded The physical disk may have reported the error condition in response to a SMART Trip Predictive Failure Action Replace the degraded physical disk You can identify which disk is degraded by locating the disk that has a Yellow Triangle for its status Perform a rescan after replacing the disk Related Alert Number 2094 LRA Number 2070 Storage Management Message Reference 73 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2052 Physical disk OK Normal Cause This alert is for Clear Alert 901 inserted Informational informational purposes None Action None Related Alert Number 2065 2305 2367 LRA Number None 2053 Virtual disk OK Normal Cause This alert is for ClearAlert 1201 created Informational informational purposes None Action None Related Alert None LRA Number None 2054 Virtual disk Warning Cause A virtual disk Clear Alert 1203 deleted Non critical has been deleted None Performing a Reset
69. RA again Number None 2232 The controller Cause This alert is for ClearAlert 751 alarm is informational purposes None silenced Action None Related Alert None LRA Number None 2233 The Cause This alert is for Clear Alert 751 Background informational purposes None initialization Action None Related BGI rate has Al rt N n changed LRA Number None Storage Management Message Reference 157 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2234 The Patrol OK Normal Cause This alert is for Clear Alert 751 Read rate has Informational informational purposes None changed Action None Related Alert None LRA Number None 2235 The Check OK Normal Cause This alert is for ClearAlert 751 Consistency Informational informational purposes None rate has Action None Related changed Alert None LRA Number None 2236 Copyback OK Normal Cause This alert is for ClearAlert 751 modified Informational informational purposes None Action Change at least Related one controller property Alert None and run the command LRA again Number None 2237 Abort Check OK Normal Cause This alert is for Clear Alert 751 Consistencyon Informational informational purposes None Error Action Change at least Related modified one controller property Alert None property and run the command LRA again Number None
70. Related Configuration may Alert None detect that a virtual disk has been deleted LRA Number Action None 2080 2055 Virtual disk OK Normal Cause This alert is for ClearAlert 1201 configuration Informational informational purposes None changed Action None Related Alert None LRA Number None 74 l Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2056 Virtual disk Critical Cause One or more Clear Alert 1204 failed Failure Error physical disks included in None the virtual disk have Related failed If the virtual disk Alert is non redundant does Number not use mirrored or parity 7949 7949 data then the failure of 2050 2076 a single physical disk can 2079 2081 cause the virtual diskto 5499 7346 fail If the virtual disk is f redundant then more LRA physical disks have failed FE than can be rebuilt using mirrored or parity information Action Create a new virtual disk and restore from a backup Storage Management Message Reference 75 Table 3 4 Storage Management Messages continued Event ID Description Severity Cause and Action SNMP Trap Numbers Related Alert Information 2057 76 Virtual disk degraded Warning Non critical Cause 1 This alert message occurs when a physical disk included in a redundant virtual
71. Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2311 The firmware Warning Cause The firmware on Clear Alert 853 on the EMMs Non critical the EMM modules None is not the same is not the same version Related version It is required that both Alert None EMMO 1 modules have the same EMM1 2 version of the firmware This alert may be Number caused if you attempt to 2090 insert an EMM module that has a different firmware version than an existing module The l and 2 indicate a substitution variable The text for these substitution variables is displayed with the alert in the alert log and can vary depending on the situation Action Upgrade to the same version of the firmware on both EMM modules 2312 Apower supply Warning Cause The power Clear Alert 1003 in the Non critical supply has an AC Number enclosure has failure 2325 an AC failure Action Replace the Related power supply Alert Number 2122 2324 LRA Number 2090 190 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2313 Apower supply Warning Cause The power Clear Alert 1003 in the Non critical supply has a DC failure Number enclosure has a Action Replace the 2323 DC failure power supply Related
72. This alert is for Alert 2130 informational purposes is a clear Action None alert for alert 2127 Related Alert Number None LRA Number None 2131 Firmware Warning Cause The firmware on Clear Alert 753 version Non critical the controller is not a Number mismatch supported version None Action Install a Related supported version of the Alert firmware If you do not Number have a supported None version of the firmware LRA available you can N mber download it from 2060 support dell com or check with your support provider for information on how to obtain the most current firmware 116 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2132 Driver version Warning Cause The controller Clear Alert 753 mismatch Non critical driver is not a supported Number version None Action Install a Related supported version of the Alert driver If you do not Number have a supported driver None version available you RA can download it from Numb r support dell com or you 2060 can check with your support provider for information on how to obtain the most current driver 2135 Array Manager Warning Cause Storage Clear Alert 103 is installedon Non critical Management has been Number the system installed on a system None NOTE This is that Tasan Array Related not s
73. This alert is provided for informational LRA b purposes Tea is Action None 220 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2376 Attempted OK Normal Cause User is Clear Alert 751 import of Informational attempting to importa None Virtual Disk foreign virtual disk with Related with stale a stale physical disk Alert None physical disk This alert is provided for informational LRA purposes Number None Action None 2377 Attempted OK Normal Cause User is Clear Alert 751 import ofan Informational attempting toimport an None orphan drive orphan drive This alert Related is provided for Alert None informational purposes LRA Action None N mber None 2378 Attempted OK Normal Cause User is Clear Alert 751 import ofan Informational attempting toimport an None incompatible incompatible physical Related physical drive drive This alert is Alert None provided for informational purposes LRA Number Action None None 2379 An overflow of OK Normal Cause This alert is Clear Alert 751 the foreign Informational provided for None configuration informational purposes Related has occurred Action None Alert None You can import the foreign LRA configuration Number in multiple None attempts Storage Management Message Reference 221 Table 3 4 Storage Management Messages
74. U mismatch was asserted and the processor in use or vice versa Hdwr version err hardware Information This event is generated when an incompatibility BMC iDRAC earlier mismatch between the Firmware and CPU mismatch BMC and iDRAC firmware and was deasserted the processor is corrected SBE Log Disabled Critical This event is generated when correctable menory rri r the ECC single bit error rate is logging disabled was exceeded asserted CPU Protocol Err Critical This event is generated when Ese to the processor protocol enters a Horerecovarabie non recoverable state CPU Bus PERR Critical This event is generated when raneto t the processor bus PERR enters a Horerecoverabie non recoverable state CPU Init Err Critical This event is generated when transition es the processor initialization Aon recoverabi enters a non recoverable state CPU Machine Chk Critical This event is generated when Pet leer to the processor machine check Fie eer i eee enters a non recoverable state Logging Disabled Critical This event is generated when all all event logging disabled was asserted event logging is disabled System Event Log Messages for IPMI Systems 271 Table 4 12 BIOS Generated System Events continued Event Message Severity Cause LinkT FlexAddr Link Critical This event is generated when Tuning sensor device the PCI device option ROM for option ROM failed to a NIC does not support link suppor
75. Warning This event is generated when there is redundancy a memory failure in a RAID configured degraded memory configuration Memory RAID Critical This event is generated when redundancy redundancy lost is lost in a RAID configured memory configuration Memory RAID Information This event is generated when the redundancy regained System Event Log Messages for redundancy lost or degraded earlier is regained in a RAID configured memory configuration IPMI Systems Table 4 8 Memory Events continued Event Message Severity Cause Memory Mirrored Warning This event is generated when there is redundancy a memory failure in a mirrored degraded memory configuration Memory Mirrored redundancy lost Critical This event is generated when redundancy is lost in a mirrored memory configuration Memory Mirrored redundancy regained Information This event is generated when the redundancy lost or degraded earlier is regained in a mirrored memory configuration emory Spared redundancy degraded Warning This event is generated when there is a memory failure in a spared memory configuration emory Spared redundancy lost Critical This event is generated when redundancy is lost in a spared memory configuration emory Spared redundancy regained emory RAID is redundant emory RAID redundancy is lost Check memory device at location s lt DIMM number gt Memory RAID redundancy is
76. able 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2108 SMART Warning Cause A disk has Clear Alert 903 warning Non critical received a SMART alert Number predictive failure None The disk is likely to fail Related in the near future Alert Action Replace the Number disk that has received None the SMART alert If the pert LRA physical disk is a Number member of a 2070 non redundant virtual disk then back up the data before replacing the disk CAUTION Removing a physical disk that is included ina non redundant virtual disk causes the virtual disk to fail and may cause data loss 100 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2109 SMART Warning Cause A disk has Clear Alert 903 warning Non critical reached an Number temperature unacceptable None temperature and Related received a SMART alert 4 en predictive failure Number The disk is likely to fail None in the near future LRA Action 1 Determine Niinbex why the physical disk 2070 has reached an unacceptable temperature A variety of factors can cause the excessive temperature For example a fan may have failed the thermostat may be set too high or the room temperature may be too hot
77. ading gt If sensor type is discrete Discrete current state lt State gt in the specified system returned to a valid range after crossing a failure threshold The sensor location chassis location previous state and current sensor value information is provided Server Management Messages 33 Table 2 5 Current Sensor Messages continued Event Description Severity Cause ID 1203 Current sensor detected a Warning A current sensor warning value in the specified Sensor location lt Location in system exceeded elias tes its warning threshold Chassis location lt Name of The Sensor chassis gt location chassis Previous state was lt State gt location previous i d curr If sensor type is not discrete state anaeurrent sensor value Current sensor value in Amps are provided lt Reading gt OR Current sensor value in Watts lt Reading gt If sensor type is discrete Discrete current state lt State gt 1204 Current sensor detected a Error A current sensor failure value in the specified Sensor location lt Location in R chassis gt its failure threshold Chassis location lt Name of The sensor chassis gt location chassis Previous state was lt State gt location previous d curr If sensor type is not discrete state anoeurteni sensor value Current sensor value in Amps are provided lt Reading gt OR Current sensor value in Watts lt Reading gt If sensor type is discret
78. age Server Administrator Storage Management management information base MIB The SNMP 70 Storage Management Message Reference traps for these alerts use all of the SNMP trap variables For more information on SNMP support and the MIB see the Dell OpenManage SNMP Reference Guide To locate an alert scroll through the following table to find the alert number displayed on the Server Administrator Alert tab or search this file for the alert message text or number See Understanding Event Messages on page 8 for more information on severity levels For more information regarding alert descriptions and the appropriate corrective actions see the online help Table 3 4 Storage Management Messages Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2048 Device failed Critical Cause A storage Clear Alert 754 Failure Error component such as Number 804 a physical disk or an 2121 854 enclosure has failed Related 904 The failed component Alert 954 may have been Number 1004 identified by the 2095 2201 1054 controller while 2203 1104 performing a task such 1154 as a rescan or a check Local 1204 consistency Response f l Agent Action Replace the LRA failed component Naber You can identify which 495 296 disk has failed by 2071 2081 locating the disk that 2091 2101 has a red X for its status Perform a rescan after replacing the failed component Stora
79. an also view the event log using your operating system s event viewer Each operating system s event viewer accesses the applicable operating system event log 10 Introduction The location of the event log file depends on the operating system you are using On systems running the Microsoft Windows operating systems event messages are logged in the operating system event log and the Server Administrator event log K NOTE The Server Administrator event log file is named dcsys32 xml and is located in the lt install_path gt omsa log directory The default install_path is C Program Files Dell SysMgt On systems running the Red Hat Enterprise Linux SUSE Linux Enterprise Server Citrix XenServer VMware ESX and VMware ESXi operating systems the event messages are logged in the operating system log file and the Server Administrator event log K NOTE The default name of the operating system log file is var log messages and you can view the operating system log file using a text editor such as vi or emacs The Server Administrator event log file is named dcsys lt xx gt xml where xx is either 32 or 64 bit depending on the operating system In the Red Hat Enterprise Linux SUSE Linux Enterprise Server Citrix XenServer and VMware ESX operating systems the Server Administrator event log file is located in the opt dell srvadmin var log openmanage directory In the VMware ESXi operating system the Server Administrator event log fil
80. are You can download the current version of the driver and firmware from support dell com Rebooting the system may also resolve this problem Storage Management Message Reference 167 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2266 Controller log OK Normal Cause The 1 Clear Alert 751 801 file entry 1 Informational indicates a substitution None 851 901 variable The text for Related 951 this substitution Alert None 1991 variable is generated by 1051 the controller and is LRA 1101 displayed with the alert Number 1151 in the alert log This None 1201 text can vary depending on the situation This alert is for informational purposes Action None 2267 The controller OK Normal Cause This alert is for Clear Alert 751 reconstruct Informational informational purposes None rate has Action None Related changed Alert None LRA Number None 168 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2268 1 Storage Critical Cause Storage Clear Alert 104 Management Failure Error Management has lost None has lost communication with a Related communicatio controller This may Alert None n with the con occur if the controller troller An driver or firmware is LR
81. are was asserted System Event Log Messages for IPMI Systems is incompatible with the firmware 285 Table 4 19 Miscellaneous Events continued Hdwar version err Version Change sensor hardware incompatibility BMC firmware and CPU mismatch was asserted Critical This event is generated when the CPU and firmware are not compatible Link Tuning Version Change successful software or F W change was deasserted sensor Warning This event is generated when the link tuning setting for proper NIC operation fails to update Link Tuning Version Change sensor successful hardware change lt device slot number gt was deasserted Warning This event is generated when the link tuning setting for proper NIC operation fails to update LinkT FlexAddr Link Tuning sensor failed to program virtual MAC address Bus Device Function was asserted Critical This event is generated when Flex address can be programmed for this device LinkT FlexAddr Link Tuning sensor device option ROM failed to support link tuning or flex address Mezz lt location gt was asserted 286 Critical This event is generated when ROM does not support Flex address or link tuning System Event Log Messages for IPMI Systems Table 4 19 Miscellaneous Events continued LinkT FlexAddr Critical This event is generated when link tuning Li
82. ated following Alert None storage LRA device wwn Number 1 path 2 None FluidCache 2907 Cachinghas Information No action required Clear Alert 1201 been enabled None 1501 on the Related following Alert None storage LRA device wwn Number 1 path 2 None FluidCache 2908 The following Information No action required Clear Alert 901 cache device None has been Related disconnected Alert None wwn 1 LRA path 2 Number FluidCache None 240 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2909 The following Warning Service is required Clear Alert 1203 storage device Contact Dell Technical None 1503 is in an Support Related unknown Alert None state wwn 1 LRA path 2 Number FluidCache None 2910 Cachinghas Information No action required Clear Alert 1501 been disabled None for the Related following Alert None storage LRA device wwn Number 1 path 2 None FluidCache 2911 The following Information Service is required Clear Alert 1401 cached LUN Contact Dell Technical None has had a Support Related failure wwn Alert None 1 path 2 LRA FluidCache Number None 2912 Resilvering for Information No action required Clear Alert 901 the following None cache device is Related complete ww Alert None n 1 path LRA 2 Number
83. ated SNMP ID Alert Trap Information Numbers 2205 Adedicated OK Normal Cause The hot spare is Clear Alert 901 hot spare has Informational no longer required None been because the virtual disk Related automatically it was assigned to has Alert unassigned been deleted Number Action None 2098 2161 2196 LRA Number None 2206 The onlyhot Warning Cause The only Clear Alert 903 spare available Non critical physical disk available None is a SATA disk to be assigned asa hot Related SATA disks spare is using SATA Alert cannot replace technology The Number SAS disks physical disks in the None virtual disk are using SAS technology LRA Number Because of this difference in technology the hot spare cannot rebuild data if one of the physical disks in the virtual disk fails Action Add a SAS disk that is large enough to be used as the hot spare and assign it as a hot spare 2070 150 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2207 The only hot Warning Cause The only Clear Alert 903 spare available Non critical physical disk available None is a SAS disk to be assigned as ahot Related SAS disks spare is using SAS Alert None cannot replace technology The SATA disks physical disks in the LRA virtual disk are using Number SATA technology 207
84. ated due to processor internal error lt Processor Entity gt status Critical processor sensor Thermal The processor generates this event before it shuts down Trip because of excessive heat caused by lack of cooling or heat synchronization lt Processor Entity gt Information This event is generated when a Status processor sensor recovered from IERR processor recovers from the internal error lt Processor Entity gt status Warning processor sensor disabled This event is generated for all processors that are disabled System Event Log Messages for IPMI Systems 253 Table 4 4 Processor Status Events continued Event Message Severity Cause lt Processor Entity gt status Information This event is generated if the processor sensor terminator is missing on an terminator not present empty processor slot lt Processor Entity gt Critical This event is generated when the presence was deasserted system could not detect the processor lt Processor Entity gt Information This event is generated when the presence was asserted earlier processor detection error was corrected lt Processor Entity gt Information This event is generated when the thermal tripped processor has recovered from an was deasserted earlier thermal condition lt Processor Entity gt Critical This event is generated when the configuration error processor configuration is was asserted
85. ation Numbers 2174 The controller Warning Cause The controller Clear Alert 1153 battery has Non critical cannot communicate None been removed with the battery The Related battery may be Alert removed or the contact Number point between the 2188 2318 controller and the battery may be burnt or LRA corroded Number 2100 Action Replace the battery if it has been removed If the contact point between the battery and the controller is burnt or corroded you must replace either the battery or the controller or both See the hardware documentation for information on how to safely access remove and replace the battery 2175 The controller OK Normal Cause This alert is for ClearAlert 1151 battery has Informational informational purposes None been replaced Action None Related Alert None LRA Number None Storage Management Message Reference 137 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2176 The controller OK Normal Cause This alert is for Clear Alert 1151 battery Learn Informational informational purposes Number cycle has Action None 2177 started Related Alert None LRA Number None 2177 The controller OK Normal Cause This alert is for Clear Alert 1151 battery Learn Informational informational purposes Status cycle has Action None Alert 2177 completed is a clear alert for alert 2176
86. build Informational informational purposes Status completed Alert 2092 is a clear alert for alert 2065 Related Alert Number None LRA Number None Action None 90 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2094 Predictive Warning Cause The physical Clear Alert 903 Failure reported Non critical disk is predicted to fail Many physical disks contain Self Monitoring Analysis and Reporting Technology SMART When enabled SMART monitors the health of the disk based on indications such as the number of write operations that have been performed on the disk Action Replace the physical disk Even though the disk may not have failed yet it is strongly recommended that you replace the disk If this disk is part of a redundant virtual disk perform the Offline task on the disk replace the disk the rebuild starts automatically NOTE If you put the drive in a different slot you need to assign it as a hot spare for the rebuild to start automatically Number None Related Alert Number None LRA Number 2070 Storage Management Message Reference 91 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2094 If this disk is a hot
87. bus LRA technology You cannot Number protocols both SAS and SATA bpo use both gt a 2070 SATA SAS is physical disks in the not supported same virtual disk on the same Remove the physical virtual disk disk and insert a new physical disk that uses the correct technology If the rebuild does not start automatically after you have inserted a suitable physical disk then run the Rebuild task 2368 The SCSI OK Normal Cause This alert is for ClearAlert 851 Enclosure Informational informational purposes None Processor Action None Related SEP has been Alert rebooted as Number part of the 2049 2052 firmware 2162 2292 download operation and LRA is unavailable Number until the None operation completes 218 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2369 Virtual Disk OK Normal Cause A physical disk Clear Alert 1201 Redundancy Informational in a RAID 6 virtual disk Number has been has either failed or been 2121 degraded removed Related Action Replace the Alert missing or failed Number physical disk 2048 2049 2050 2076 2346 LRA Number None 2370 Redundant OK Normal Cause This alert is for Clear Alert 1201 Path View Informational informational purposes None cleared Action None Related Alert None LRA Number None 2371 Attempted OK Normal Cause
88. chassis Table 2 8 Power Supply Messages Event Description Severity Cause ID 1350 Power supply sensor has Error A power supply sensor failed Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Power Supply type lt type of power supply gt lt Additional power supply status information gt If in configuration error state Configuration error type lt type of configuration error gt 42 Server Management Messages in the specified system failed The sensor location chassis location previous state power supply type additional power supply status and configuration error type information are provided Table 2 8 Power Supply Messages continued Event Description Severity Cause ID 1351 Power supply sensor value Information A power supply sensor unknown in the specified Sensor location lt Location eae in ehassis gt obtain a rea ing The sensor location Chassis location lt Name of chassis location chassis gt previous state power Previous state was lt State gt supply type Power Happily types Stype of additional power supply status and power supply gt A configuration error lt Additional power supply type information status information gt are provided If in configuration error state Configuration error type lt type of configuration error gt 1352 Power supply returned to Information A power
89. cond spare then unassign the hot spare perform the Prepare to Remove task on the disk replace the disk and assign the new disk as a hot spare CAUTION If this disk is part of a non redundant disk back up your data immediately If the disk fails you cannot recover the data 2095 SCSI sense OK Normal Cause A SCSI device Clear Alert 751 851 data 1 Informational experienced an error Number 901 but may have recovered None Action None Related Alert Number 2273 LRA Number None 2098 Global hot OK Normal Cause A user has Clear Alert 901 spare assigned Informational assigned a physical disk Number as a global hot spare None This alert is for Related informational purposes AJert Action None Number 2277 LRA Number None 92 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2099 Global hot OK Normal Cause A physical disk Clear Alert 901 spare Informational that was assigned asa Number unassigned hot spare has been None unassigned and is no Related longer functioningasa Alert hot spare The physical Number disk may have been None unassigned by a user or automatically LRA unassigned by Storage Number None Management Storage Management unassigns hot spares that have been used to rebuild data Once data is rebuilt the hot spare become
90. ction Related SNMP Alert Trap Information Numbers 2335 202 Controller event log 1 Warning Non critical Cause The 1 indicates a substitution variable The text for this substitution variable is generated by the controller and is displayed with the alert in the alert log This text is from events in the controller event log that were generated while Storage Management was not running This text can vary depending on the situation Action If there is a problem review the controller event log and the Server Administrator alert log for significant events or alerts that may assist in diagnosing the problem Check the health of the storage components See the hardware documentation for more information Storage Management Message Reference Clear Alert 753 None Related Alert None LRA Number 2060 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2336 Controller Critical Cause The 1 Clear Alert 754 event log 1 Failure Error Storage Management Message Reference indicates a substitution variable The text for this substitution variable is generated by the controller and is displayed with the alert in the alert log This text is from events in the controller event log that were generated while Storage Management was not running This text can vary depending on the s
91. ction Related SNMP ID Alert Trap Information Numbers 2168 Thenon RAID Waring Cause The version of ClearAlert 103 SCSI driver Non critical the driver does not None version is older meet the minimum Related than the requirements Storage Alert None minimum Management may not required level be able to display the LRA See readme txt storage or perform Number for the storage management 2050 validated functions until you have driver version updated the system to meet the minimum requirements Action See the Readme file for the validated driver version Update the system to meet the minimum requirements and then reinstall Storage Management 2169 The controller Critical Cause The controller Clear Alert 1154 battery needs Failure Error battery cannot be None to be replaced recharged The battery Related may be old or it may Alert have been already Nimber recharged the 2118 maximum number of times In addition the LRA battery charger may not Number be working 2101 Action Replace the battery pack 134 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2170 The controller OK Normal Cause This alert is for ClearAlert 1151 battery charge Informational informational purposes None level is normal Action None Related Alert None LRA Number None 2171 The controller Warning
92. cts a low battery condition This event is generated when an earlier battery condition was corrected This event is generated when the sensor detects a failed or missing battery 280 System Event Log Messages for IPMI Systems Power And Performance Events The power and performance events are used to detect degradation in system performance with change in power supply Table 4 17 Power And Performance Events Description Severity Cause System Board Power Normal This event is generated when Optimized system performance was Performance status restored sensor for System Board degraded lt description of why gt was deasserted System Board Power Warning This event is generated when Optimized change in power supply Performance status degrades system sensor for System performance Board degraded lt description of why gt was asserted System Board Power Warning This event is generated when Optimized change in power supply Performance status degrades system sensor for System performance Board degraded power capacity changed was asserted System Board Power Normal This event is generated when Optimized Performance status sensor for System Board degraded power capacity changed was deasserted the system performance is restored System Event Log Messages for IPMI Systems 281 Table 4 17 Power And Performance Events continued Description Se
93. d Event Description Severity Cause 1454 Fan enclosure removed Error from system for an extended amount of time Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt A fan enclosure has been removed from the specified system for a user definable length of time The sensor and chassis location information is provided 1455 Fan enclosure sensor Error A fan enclosure sensor in the detected a non specified system detected an recoverable valu error from which it cannot sensor location par ciation and a ene PRT ae NEES chassis location are provided Chassis location lt Name of chassis gt 48 Server Management Messages AC Power Cord Messages The AC power cord messages listed in Table 2 11 provide status and warning information for power cords that are part of an AC power switch if your system supports AC switching Table 2 11 AC Power Cord Messages Event Description Severity Cause ID 1500 AC power cord sensor Critical An AC power cord sensor in has failed Failure the specified system failed Sansor Tae a aia Error The oe nee lt Location in chassis gt cannot be monitore l The sensor and chassis Chassis location location information is lt Name of chassis gt provided 1501 AC power cord is not Information The AC power cord status is being monitored not being monitored eee eee eee This e when a system s lt Location in chassis gt exp
94. d an Intel Trusted a Technology TXT error during POST Execution Critical This event is generated when TXT Post failed SINIT Authenticat Code Module detected an Intel Trusted Execution Technology TXT error at boot ed Critical This event is generated when the Authenticated Code Module detected a TXT initialization failure Intel Trusted Information This event is generated when the TXT Execution returned from a previous failure Technology TXT is operating correctly Failure detected on Critical This event is generated when the SD Removable Flash card module is installed but improperly edia lt name gt configured or failed to initialize Removable Flash Warning This event is generated when the module edia lt name gt is is write protected Changes may not be write protected written to the media Internal Dual SD Information This event is generated when both SD odule is cards are functioning properly redundant Internal Dual SD Critical This event is generated when either one is 16st odule redundancy of the SD cards or both the SD cards are not functioning properly 288 System Event Log Messages for IPMI Systems Index A AC power cord messages 49 AC power cord sensor 9 AC power cord sensor has failed 265 Asset name changed 124 Asset tag changed 124 Background initialization 115 Bad bloc
95. ded removed or failed lt Entity Name gt PS Critical Power supply redundancy is lost Redundancy if only one power supply is sensor redundancy lost functional lt Entity Name gt PS Information This event is generated if the Redundancy power supply has been sensor redundancy reconnected or replaced regained lt Power Supply Sensor Critical This event is generated when the Name gt predictive failure power supply is about to fail was asserted lt Power Supply Sensor Critical This event is generated when the Name gt input lost was power supply is unplugged asserted lt Power Supply Sensor Information This event is generated when the Name gt predictive failure power supply has recovered from was deasserted an earlier predictive failure event lt Power Supply Sensor Information This event is generated when the Name gt input lost was power supply is plugged in deasserted PS 1 Status Power supply Information This event is generated when the sensor for PS 1 presence power supply is plugged in was asserted PS 1 Status Power supply Critical This event is generated when the sensor for PS 1 was deasserted presence power supply is removed System Event Log Messages for IPMI Systems Table 4 5 Power Supply Events continued Event Message Severity Cause PS 1 Status Power supply Critical This event is generated when the sensor for PS 1 failure power supply has failed was asserted PS
96. degraded Check memory device at location s lt DIMM number gt Memory is not redundant Information This event is generated when the redundancy lost or degraded earlier is regained in a spared memory configuration Information This event is generated when the memory redundancy mode has change to RAID redundant Critical This event is generated when redundancy is lost in a RAID configured memory configuration Warning This event is generated when there is a memory failure in a RAID configured memory configuration Information This event is generated when the memory redundancy mode has change to non redundant System Event Log Messages for IPMI Systems 263 Table 4 8 Memory Events continued Event Message Severity Cause Memory mirror is redundant Memory mirror redundancy is lost Check memory device at location s number gt lt DIMM Memory mirror redundancy is degraded Check memory device at location lt DIMM number gt Memory spare is redundant Memory spare redundancy is lost Check memory device at location lt DIMM number gt Memory spare redundancy is degraded Check memory device at location lt DIMM number gt Information Critical Warning Information Critical Warning This event is generated when the memory redundancy mode has change to mirror redundant This event is generated when redundancy is lost in a mirror configured memory co
97. demarks and trade names other than its own 2013 03 Contents 1 Introduction naaa 7 What s NewinthisRelease 8 Messages Not Described in This Guide 8 Understanding EventMessages 8 Sample Event Message Text 10 Viewing Alerts and EventMessages 10 Logging Messages to a Unicode Text File 11 Viewing Events in Microsoft Windows Server 2008 oana aaa 12 Viewing Events in Red Hat Enterprise Linux and SUSE Linux Enterprise Server 12 Viewing Events in VMware ESX ESXi 13 Viewing the Event Information 13 Understanding the Event Description 14 2 Server Management Messages 19 Server Administrator General Messages 19 Temperature Sensor Messages 22 Cooling Device Messages 26 Voltage Sensor Messages 29 Current Sensor Messages 32 Contents Chassis IntrusionMessages 35 Redundancy UnitMessages 38 Power Supply Messages 42 Memory Device Messages 46 Fan Enclosure Messages 47 AC Power Cord Messages 49 Hardware Log Sensor Messages 50 Processor Sensor Messages 52 Pluggable Device Messages 55 Battery Sensor Messages 57 Secure Digital SD Card Device Messages 59 Chassis Management Controller Messages
98. disk included in the virtual disk has failed or a user has cancelled the initialization Action If a physical disk has failed then replace the physical disk Clear Alert 1204 Number None Related Alert Number None LRA Number 2081 Storage Management Message Reference 83 Table 3 4 Storage Management Messages continued 84 Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2080 Physical disk Critical Cause The physical Clear Alert 904 initialization Failure Error disk has failed or is not Number failed functioning None Action Replace the Related failed or non functional Alert disk You can identify a Number disk that has failed by None locating the disk that LRA has a red X for its Number status Restart the 2071 initialization 2081 Virtual disk Critical Hardware RAID Clear Alert 1204 reconfiguratio Failure Error Cause A physical disk Number n failed included in the virtual None disk has failed or is not Related functioning A user may Alert also have cancelled Number the reconfiguration None Action Replace the LRA failed or non functional Number disk You can identifya 2081 disk that has failed by locating the disk that displays a red X in the status field If the physical disk is part of a redundant array then rebuild the physical disk When finished restart the reconfiguration Storage Management Messag
99. disk to fail and may cause data loss If the disk is part of a redundant virtual disk then any data residing on the corrupt portion of the disk is reallocated elsewhere in the virtual disk Storage Management Message Reference Clear Alert 903 Number None Related Alert Number None LRA Number None Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action ID Related SNMP Alert Trap formation Numbers 2127 Background OK Normal Cause BGI of a virtual initialization Informational disk has started This BGI started alert is for informational purposes Action None C lear Alert 1201 Status 2130 2128 BGI cancelled OK Normal Cause BGI of a virtual Informational disk has been cancelled A user or the firmware C N N lear Alert 1201 umber one may have stopped BGI Related Action None N N L N Z Alert umber one RA umber one 2129 BGI failed Critical Cause BGI of a virtual Failure Error disk has failed Action None C N N lear Alert 1204 umber one Related Storage Management Message Reference 115 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2130 BGI completed OK Normal Cause BGI of a virtual Clear Alert 1201 Informational disk has completed Number
100. disk which None cache is connected to the Related controller Alert None Action Check for LRA foreign configuration Number and import if any Check for cable fault Recover any virtual disk lost by the controller None 164 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description ID Cause and Action Related Alert Information SNMP Trap Numbers 2259 An enclosure blink operation has initiated Cause This alert is for informational purposes Action None Clear Alert Number 2260 Related Alert None LRA Number None 851 2260 An enclosure blink has ceased Cause This alert is for informational purposes None Clear Alert None Related Alert None LRA Number None 851 2261 A global rescan has initiated Cause This alert is for informational purposes Action None Clear Alert None Related Alert None LRA Number None 751 2262 SMART thermal shutdown is enabled Cause This alert is for informational purposes Action None Clear Alert None Related Alert None LRA Number None Storage Management Message Reference 101 165 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2263 SMART OK Normal Cause This alert is for ClearAlert 101 thermal Informa
101. e Discrete current state lt State gt 34 Server Management Messages Table 2 5 Current Sensor Messages continued Event Description Severity Cause ID 1205 Current sensor detected a Error A current sensor non recoverable valu Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Current sensor value in Amps lt Reading gt OR Current sensor value in Watts lt Reading gt If sensor type is discrete Discrete current state lt State gt in the specified system detected an error from which it cannot recover The sensor location chassis location previous state and current sensor value are provided Chassis Intrusion Messages The chassis intrusion messages listed in Table 2 6 are a security measure Chassis intrusion means that someone is opening the cover to a system s chassis Alerts are sent to prevent unauthorized removal of parts from a chassis Server Management Messages _ 35 Table 2 6 Chassis Intrusion Messages Event Description Severity Cause ID 1250 Chassis intrusion Error A chassis intrusion sensor sensor has failed in the specified system Sensor location o a aa lt UOCAtL OR in chassiss ocation chassis location l l previous state and Chassis location lt Name chassis intrusion state of chassis gt are provided Previous state was lt
102. e lt Reading gt Specifies the temperature in degrees Celsius for example Temperature sensor value Celsius 30 in degrees Voltage sensor value in Volts lt Reading gt Specifies the voltage sensor value in volts for example Voltage sensor value in Volts 1 693 Introduction 17 18 Introduction Server Management Messages The following tables lists in numerical order each event ID and its corresponding description along with its severity and cause K NOTE For corrective actions see the appropriate documentation Server Administrator General Messages The messages in Table 2 1 indicate that certain alert systems are up and working Table 2 1 Server Administrator General Messages Event Description Severity Cause ID 0000 Log was cleared Information User cleared the log from Server Administrator This operation does not clear the operating system event log Therefore this event is not logged in the operating system event log This is logged in the OpenManage System Administrator alert log 0001 Log backup created Information The log was full copied to backup and cleared 1000 Server Administrator Information Server Administrator is starting beginning to initialize 1001 Server Administrator Information Server Administrator startup complete completed initialization 1002 A system BIOS update Information The user has chosen to update has been scheduled for the next reb
103. e Number This alert may be None caused when a user attempts to insert an LRA EMM module that has Number 2090 Storage Management Message Reference a different firmware version than an existing module Action Download the same version of the firmware to both EMM modules 107 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2121 Device OK Normal Cause A device that Clear Alert 752 returned to Informational was previously in an Status 802 normal error state has returned Alert 2121 852 to anormal state For is a clear 902 example if an enclosure alert for 952 became too hot and alert 2048 1002 subsequently cooled Related 1052 down you may receive Ajert 1102 this alert This alert is Number 1152 for informational 2050 2065 1202 purposes 2158 7 Action None LRA Number None 108 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2122 Redundancy Warning Cause One or more of Clear Alert 1305 degraded Non critical the enclosure Status components has failed 2124 For example a fan or Related power supply may have Alert failed Although the Number enclosure is currently 2048 operational the failure of 7 pa additional components Number could cause th
104. e capacity specified system is near or at the Log typer crog types capacity of the hardware log The log type information is provided 1554 Log size is full Error The size of a hardware log on eG pert as Epas the specified system is full The log type information is provided 1555 Log sensor has failed Error A hardware log sensor in the toa typi eee es specified system failed The hardware log status cannot be monitored The log type information is provided Server Management Messages 51 Processor Sensor Messages The processor sensors monitor how well a processor is functioning Processor messages listed in Table 2 13 provide status and warning information for processors in a particular chassis Table 2 13 Processor Sensor Messages Event Description Severity Cause ID 1600 Processor sensor has Critical A processor sensor in the failed Failure specified system is not Error functioning The sensor Sensor Location lt Location in chassis gt Chassis Location lt Name of chassis gt Previous state was lt State gt Processor sensor status lt status gt location chassis location previous state and processor sensor status information is provided 1601 Processor sensor value Critical unknown Failure Error Sensor Location 9 lt Location in chassis gt Chassis Location lt Name of chassis gt Previous state was lt State gt Processor sensor status lt status gt
105. e 2090 Storage Management Message Reference enclosure to fail Action Identify and replace the failed component To identify the failed component select the enclosure in the tree view and click the Health subtab Any failed component is identified with a red X on the enclosure s Health subtab Alternatively you can select the Storage object and click the Health subtab 109 Table 3 4 Storage Management Messages continued Event ID Cause and Action Related SNMP Alert Trap Information Numbers 110 2122 contd The controller status displayed on the Health subtab indicates whether a controller has a Failed or Degraded component See the enclosure documentation for information on replacing enclosure components and for other diagnostic information Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2123 Redundancy Waming Cause A virtual disk or Clear Alert 1306 lost Non critical an enclosure has lost Number data redundancy In the 2124 case of a virtual disk Related one or more physical AJert disks included in the N mb t virtual disk have failed 2048 2049 Due to the failed 2057 physical disk or disks the virtual disk is no LRA longer maintaining Number redundant mirrored or 2080 2090 parity data The failure of an additional
106. e Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2081 Software RAID contd e Perform a backup with the Verify option e If the file backup fails try to restore the failed file from a previous backup e When the backup with the Verify option is complete without any errors delete the Virtual Disk e Recreate a new Virtual Disk with new drives e Restore the data from backup 2082 Virtual disk Critical Cause A physical disk Clear Alert 1204 rebuild failed Failure Error included in the virtual Number disk has failed or isnot None functioning A user may Related also have cancelled Alert the rebuild Number Action Replace the 2048 failed or non functional RA disk You can identity a Number disk that has failed by 2081 locating the disk that has a red X for its status Restart the virtual disk rebuild Storage Management Message Reference 85 Table 3 4 Storage Management Messages continued 86 Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2083 Physical disk Critical Cause A physical disk Clear Alert 904 rebuild failed Failure Error included in the virtual Number disk has failed or is not None functioning A user may Related also have cancelled the Alert rebuild Nomber Action Replace the None
107. e a long time The time it takes depends on the size of the physical disk or the virtual disk 80 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2070 Virtual disk OK Normal Cause The virtual disk Clear Alert 1201 initialization Informational initialization cancelled Number cancelled because a physical disk None included in the virtual Related disk has failed or Alert because a user cancelled N mb r the virtual disk None initialization LRA Action If a physical N mber disk failed then replace None the physical disk You can identify which disk has failed by locating the disk that has a red X for its status Perform a rescan after replacing the disk Restart the format physical disk operation Restart the virtual disk initialization 2074 Physical disk OK Normal Cause The user has Clear Alert 901 rebuild Informational cancelled the rebuild Number cancelled operation None Action Restart the Related rebuild operation Alert Number None LRA Number None Storage Management Message Reference 81 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2075 Copy ofdata OK Normal Cause This alert is Clear Alert 1201 completed Informati
108. e is located in the etc cim dell srvadmin log openmanage directory Logging Messages to a Unicode Text File Logging messages to a Unicode text file is optional By default the feature is disabled in the Server Administrator To enable this feature modify the Event Manager section of the dceemdy lt xx gt ini configuration file where xx is 32 or 64 bit depending on the operating system as follows On systems running Microsoft Windows operating systems you can locate the configuration file in the lt install_path gt dataeng ini directory and set the property UnitextLog enabled true The default install_path is C Program Files Dell SysMgt Restart the DSM SA Event Manager service to enable the setting The Server Administrator Unicode text event log file is named desys32 log and is located in the lt install_path gt omsa log directory On systems running the Red Hat Enterprise Linux SUSE Linux Enterprise Server Citrix XenServer and VMware ESX operating systems you can locate the configuration file in the opt dell srvadmin etc Introduction 11 srvadmin deng ini directory and set the property UnitextLog enabled true Run the etc init d dataeng restart command to restart the Server Administrator Event Manager service and enable the setting This also restarts the Server Administrator Data Manager and SNMP services The Server Administrator Unicode text event log file is named desys lt xx gt log where xx is 32 or 64 bit depending
109. e memory module event cause Single bit warning error rate exceeded Single bit error logging disabled Power Supply type lt type of power supply gt Specifies the type of power supply for example Power Supply type VRM state was lt State gt Previous redundancy Specifies the status of the previous redundancy message for example Previous redundancy state was Lost Previous state was lt State gt Specifies the previous state of the sensor for example Previous state was OK Normal Processor sensor status lt status gt Specifies the status of the processor sensor for example Processor sensor status error Configuration Redundancy unit lt Redundancy location in chassis gt 16 Introduction Specifies the location of the redundant power supply or cooling unit in the chassis for example Redundancy unit Fan Enclosure Table 1 2 Event Description Reference continued Description Line Item Explanation SD card device type lt Type of SD card device gt Specifies the type of SD card device for example SD card device typ Hypervisor SD card state lt State of SD card gt Specifies the state of the SD card for example SD card state Present Active Sensor location lt Location in chassis gt Specifies the location of the sensor in the specified chassis for example Sensor location CPU1 Temperature sensor valu
110. e the appropriate documentation Temperature Sensor Events The temperature sensor event messages help protect critical components by alerting the systems management console when the temperature rises inside the chassis These event messages use additional variables such as sensor location chassis location previous state and temperature sensor value or state Table 4 1 Temperature Sensor Events Event Message Severity Cause lt Sensor Name Location gt Critical Temperature of the backplane temperature sensor board system board or the carrier detected a failure in the specified system lt Sensor lt Reading gt where lt Sensor Name Location gt exceeded the Name Location gt is the critical threshold entity that this sensor is monitoring For example PROC Temp or Planar Temp Reading is specified in degree Celsius For example 100 C lt Sensor Name Location gt Warning Temperature of the backplane temperature sensor detected a warning lt Reading gt board system board or the carrier in the specified system lt Sensor Name Location gt exceeded the non critical threshold System Event Log Messages for IPMI Systems 247 248 Table 4 1 Temperature Sensor Events continued Event Message Severity Cause lt Sensor Name Location gt Warning Temperature of the backplane temperature sensor board system board or the carrier returned to warning state in the specified
111. ecte C powel configuration is set to Chassis Locatrons nonredundant The sensor lt Name of chassis gt and chassis location information is provided 1502 AC power has been Information Power is restored in an AC restored Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt power cord that did not have AC power The sensor and chassis location information is provided Server Management Messages 49 Table 2 11 AC Power Cord Messages continued Event Description Severity Cause 1503 AC power has been lost Critical Sensor location Failure lt Location in chassis gt Error Chassis location lt Name of chassis gt Power supply is disrupted to the AC power cord or an AC power cord is not transmitting power but there is sufficient redundancy to classify this as a warning The sensor and chassis location information is provided 1504 AC power has been lost Error Power supply is disrupted to Sensor location the AC power cord or an AC lt Location in chassis gt power cord is not transmitting Chassis location power and lack of redundancy Ndama SE chassiss requires this to be classified as an error The sensor and chassis location information is provided 1505 AC power has been lost Error An AC power cord sensor in Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt the specified system failed The AC po
112. ection Connection was asserted error was corrected The lt name gt cable or Critical This event is generated when interconnect is not the named cable or connected or is improperly interconnect is not connected connected or is incorrectly connected The lt name gt cable or Information This event is generated when interconnect is connected named cable or interconnect earlier cable or interconnect connection error was corrected System Event Log Messages for IPMI Systems 279 Battery Events Table 4 16 Battery Events Description Severity Cause lt Battery sensor Name Critical This event is generated when Location gt the sensor detects a failed or Failed was asserted pansing battery lt Battery sensor Name Information This event is generated when Location gt the earlier failed battery was Failed was deasserted corrected lt Battery sensor Name Warning This event is generated when Location gt the sensor detects a low battery is low was asserted condition lt Battery sensor Name Information This event is generated when Location gt is low was deasserted The lt Battery sensor Name Warning Location gt battery is low The lt Battery sensor Name Location gt battery is operating normally The lt Battery sensor Name Location gt battery has failed Information Critical the earlier low battery condition was corrected This event is generated when the sensor dete
113. ed LRA policy is Action Reassign the Number violated for the number of hot spares as None Virtual Disk specified in the protection policy for that RAID level Storage Management Message Reference 223 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2384 The Warning Waming Cause The number of ClearAlert 1203 level set forthe Non critical physical disks you 2195 hot spare specified for the hot Related protection spare protection policy Alert None pon gt is violated LRA AA Action Reassign the Number Virtual Disk number of hot spares as None specified in the protection policy for that RAID level 2385 The Critical Critical Cause The number of ClearAlert 1204 level set forthe Failure Error physical disks you 2195 hot spare specified for the hot Related protection spare protection policy Alert None lievi ae ane for the N n ae Virtual Disk Action Reassign the Number number of hot spares as None specified in the protection policy for that RAID level 2386 The drive Warning Cause The assignment Clear Alert 901 could not be Non critical of a Dedicated Hot 2195 assigned as a Spare fails as the disk is Related Dedicated Hot invalid Alert None Spare Action None LRA Number None 224 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event
114. en removed None Action Reinsert the Related EMM See the Alert None hardware documentatio LRA n for information Number on replacing the EMM 209 2298 The enclosure Warning Cause The enclosure Clear Alert 853 has a bad Non critical has a bad sensor The None sensor 1 enclosure sensors Related monitor the fan speeds a jet None temperature probes and so on The LRA lindicates a Number substitution variable 2090 The text for this substitution variable is displayed with the alerts in the alert log and can vary depending on the situation Action See the hardware documentation for more information 182 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2299 Bad PHY 1 Critical Cause There is a Clear Alert 854 Failure Error problem with a physical None connection or PHY The Related 1 indicates a Alert None substitution variable The text for this LRA Number substitution variable is displayed with the alert 2091 in the alert log and can vary depending on the situation Action Contact Dell technical support Storage Management Message Reference 183 Table 3 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2300 The enclosure Critical is uns
115. ent Message Reference 227 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2396 The Check Critical Cause The Check Clear Alert 1204 Consistency Failure Error Consistency task None detected detects uncorrectable Related uncorrectable multiple errors Alert None multiple Action Replace the LRA medium errors failed physical disk You Number can identify the failed None disk by locating the disk that has a red X for its status Rebuild the physical disk When finished restart the check consistency operation 2397 The Check Critical Cause The Check Clear Alert 1204 Consistency Failure Error Consistency task None completed detected uncorrectable Related with multiple errors Alert None uncorrectable Action Replace the LRA SONS failed physical disk You Number can identify the failed None disk by locating the disk that has a red X for its status Rebuild the physical disk When finished restart the check consistency operation 2398 The Manage OK Normal Cause The Manage Clear Alert 751 Physical Disk Informational Physical Disk Power None Power properties are changed Related property s Action None Alert None changed LRA Number None 228 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related
116. ent documentation An example of such an alert is shown for alert 2334 in Table 3 1 Table 3 1 Alert Message Format Alert ID Message Text Displayed inthe Message Text Displayed in the Alert Log with Storage Management Service Variable Information Supplied Documentation 2127 Background Initialization Background Initialization started Virtual started Disk 3 Virtual Disk 3 Controller 1 PERC 5 E Adapter 2334 Controller event log Controller event log Current capacity of the battery is above threshold Controller 1 PERC 5 E Adapter The variables required to complete the message vary depending on the type of storage object and whether the storage object is in a SCSI or SAS configuration The following table identifies the possible variables used to identify each storage object K NOTE Some alert messages relating to an enclosure or an enclosure component such as a fan or EMM are generated by the controller when the enclosure or enclosure component ID cannot be determined 66 Storage Management Message Reference K NOTE A B C and X Y Z in the following examples are variables representing the storage object name or number Table 3 2 Message Format with Variables for Each Storage Object Storage Object Message Variables Controller Message Format Controller A Name Message Format Controller A For example 2326 A foreign configuration has been detected Controller 1 PERC 5 E Adapter NOTE
117. er None 2191 Multiple Critical Cause There are too ClearAlert 854 enclosures are Failure Error many enclosures None attached to the attached to the Related controller This controller port When jet is an the enclosure limit is Number unsupported exceeded the controller 377 configuration loses contact with all enclosures attached to LRA the port Number 091 Action Remove the last enclosure You must remove the enclosure that has been added last and is causing the enclosure limit to exceed 144 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2192 The virtual OK Normal Cause The virtual disk Clear Alert 1203 disk Check Informational Check Consistency has None Consistency identified errors and Related has made made corrections For Alert None correctionsand example the Check l completed Consistency may have LRA encountered a bad disk Number block and remapped the None disk block to restore data consistency This alert is for informational purposes Action None As a precaution monitor the alert log for other errors related to this virtual disk If problems persist contact Dell Technical Support 2193 The virtual OK Normal Cause This alert is for Clear Alert 1201 disk Informational informational purposes None reconfiguratio Action None Related n has
118. erheating CPU lt number gt is absent Critical This event is generated when the system could not detect the processor CPU lt number gt is operating Information This event is generated when the correctly processor recovered from an error CPU lt number gt is configured Information The specified CPU is configured correctly correctly Power Supply Events The power supply sensors monitor the functionality of the power supplies These messages provide status and warning information for power supplies for a particular system Table 4 5 Power Supply Events Event Message Severity Cause lt Power Supply Sensor Critical This event is generated when the Name gt power supply sensor power supply sensor is removed removed lt Power Supply Sensor Information This event is generated when the Name gt power supply sensor power supply has been replaced AC recovered System Event Log Messages for IPMI Systems 255 256 Table 4 5 Power Supply Events continued Event Message Severity Cause lt Power Supply Sensor Information This event is generated when the Name gt power supply sensor power supply that failed or returned to normal state removed was replaced and the state has returned to normal lt Entity Name gt PS Information Power supply redundancy Redundancy is degraded if one of the sensor redundancy power supply sources is degra
119. erity Description Cause and Action Related SNMP Alert Trap Information Numbers 2356 SAS SMP Critical communicatio Failure Error ns error 1 214 Cause The text for this alert is generated by the firmware and can vary depending on the situation The reference to SMP in this text refers to SAS Management Protocol Action There may be a SAS topology error See the hardware documentation for information on correct SAS topology configurations There may be problems with the cables such as a loose connection or an invalid cabling configuration See the Cables Attached Correctly section for more information on checking the cables See the hardware documentation for information on correct cabling configurations Verify that the firmware is a supported version Storage Management Message Reference Clear Alert 754 None Related Alert None LRA Number 2061 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2357 SAS expander Critical Cause The 1 Clear Alert 754 error 1 Failure Error indicates a substitution None variable The text for Related this substitution Alert None variable is generated by the firmware and is LRA displayed with the alert Number in the alert log This 2061 text can vary depending on the situation Action There may be a problem with the enclosure Chec
120. ert None moved from another controller These LRA physical disks contain Number None virtual disks that were created on the other controller See the Import Foreign Configuration and Clear Foreign Configuration section in the Dell OpenManage Server Administrator Storage Management User s Guide for more information Action None Storage Management Message Reference 197 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2327 The NVRAM Warning Cause The nonvolatile Clear Alert 753 has corrupted Non critical random access memory None data The NVRAM is corrupt Related controller is This may occur after a Alert reinitializing power surge a battery Number the NVRAM failure or for other 2266 reasons The controller is reinitializing the LRA NVRAM The controller Number properties reset to the 2060 default settings after the reinitialization is complete None The controller is taking the required corrective action If this alert is generated often such as during each reboot replace the controller 2328 The NVRAM Waming Cause The NVRAM Clear Alert 753 has corrupt Non critical has corrupt data The None data controller is unable to Related correct the situation Alert None Action Replace the LRA controller N mber 2060 198 Storage Management Message Reference Table 3 4 Stora
121. failed or non functional LRA disk You can identify a Number disk that has failed by 207 locating the disk that has a red X for its status Rebuild the virtual disk rebuild 2085 Virtual disk OK Normal Cause This alert is for Clear Alert 1201 check Informational informational purposes Status consistency Actoni None Alert 2085 completed is a clear alert for alert 2058 Related Alert Number None LRA Number None Storage Management Message Reference Table 3 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2086 Virtual disk OK Normal Cause This alert is for format Informational informational purposes completed Action None Clear Alert 1201 Status Alert 2086 is a clear alert for alert 2059 Related Alert Number None LRA Number None 2087 Copy of data OK Normal Cause This alert is for resumed from Informational informational purposes physical disk 2 to physical disk 1 Action None Clear Alert 901 Status None Related Alert Number 2060 LRA Number None Storage Management Message Reference 87 Table 3 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2088 Virtual disk initialization completed OK Normal Cause This alert is for
122. ference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2901 The following Error If the device is Clear Alert 1204 storage device inaccessible restore None 1504 is either connectivity If the Related inaccessible or device has failed Alert None failed wwn replace it LRA 1 path 2 Number FluidCache None 2902 The following Information No action required Clear Alert 1201 storage device None 1501 has had Related transient Alert None failures wwn LRA 1 path 2 Number FluidCache None 2903 The following Information No action required Clear Alert 901 cache device None has been Related registered ww Alert None n 1 path LRA 2 Number FluidCache None 2904 The following Information No action required Clear Alert 901 cache device None has been Related removed wwn Alert None 1 path 2 LRA FluidCache Number None Storage Management Message Reference 239 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2905 The following Information No action required Clear Alert 901 cache device is None being Related removed wwn Alert None 1 path 2 LRA FluidCache Number None 2906 Caching is Information No action required Clear Alert 1201 being removed None 1501 for the Rel
123. ge Format Power Supply X Controller A Connector B Enclosure C For example 2312 A power supply in the enclosure has an AC failure Power Supply 1 Controller 1 Connector 0 Enclosure 2 SCSI Temperature Probe Message Format Temperature Probe X Controller A Connector B Target ID C where C is the SCSI ID number of the EMM managing the temperature probe For example 2101 Temperature dropped below the minimum warning threshold Temperature Probe 1 Controller 1 Connector 0 Target ID 6 SAS Temperature Probe Message Format Temperature Probe X Controller A Connector B Enclosure C For example 2101 Temperature dropped below the minimum warning threshold Temperature Probe 1 Controller 1 Connector 0 Enclosure 2 SCSI Fan Message Format Fan X Controller A Connector B Target ID C where C is the SCSI ID number of the EMM managing the fan For example 2121 Device returned to normal Fan 1 Controller 1 Connector 0 Target ID 6 SAS Fan SCSI EMM Message Format Fan X Controller A Connector B Enclosure C For example 2121 Device returned to normal Fan 1 Controller 1 Connector 0 Enclosure 2 Message Format EMM X Controller A Connector B Target ID C where C is the SCSI ID number of the EMM For example 2121 Device returned to normal EMM 1 Controller 1 Connector 0 Target ID 6 SAS EMM Message Format EMM X Controller A Connector B Enclosure C For example 2121 Device
124. ge Management Message Reference 71 Table 3 4 Storage Management Messages continued Cause and Action SNMP Trap Numbers Related Alert Information Cause A physical disk has been removed from the disk group This alert can also be caused by loose or defective cables or by problems with the enclosure Action If a physical disk was removed from the disk group either replace the disk or restore the original disk On some controllers a removed disk has a red X for its status On other controllers a removed disk may have an Offline status or is not displayed on the user interface Perform a rescan after replacing or restoring the disk If a disk has not been removed from the disk group then check for problems with the cables See the online help for more information on checking the cables Ensure that the enclosure is powered on If the problem persists check the enclosure documentation for further diagnostic information Storage Management Message Reference Clear Alert 903 Number 2052 Related Alert Number 2054 2057 2056 2076 2079 2081 2083 2129 2202 2204 2270 2292 2299 2369 LRA Number 2070 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2050 Physical disk Warning Cause A physical disk Clear Alert 903 offline Non critical in the disk group is
125. ge Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2329 SAS port Warning Cause The text for this Clear Alert 753 report 1 Non critical alert is generated by the controller and can vary depending on the situation The 1 indicates a substitution variable The text for this substitution variable is generated by the controller and is displayed with the alert in the alert log This text can vary depending on the situation Action Run the PHY integrity test diagnostic Make sure the cables are attached securely If the problem persists replace the cable with a valid cable according to SAS specifications If the problem still persists you may need to replace some devices such as the controller or EMM See the hardware documentation for more information None Related Alert None LRA Number 2060 Storage Management Message Reference 199 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2330 SAS port OK Normal Cause The 1 Clear Alert 751 report 1 Informational indicates a substitution None variable The text for Related this substitution Alert None variable is generated by the controller and is LRA displayed with the alert Number in the alert log This None text can vary depending on the situation This alert
126. he Rebuild task See the Dell OpenManage Server Administrator Storage Management User s Guide for more information 186 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2306 Bad block table Warning Cause The bad block Clear Alert 903 is 80 full Non critical table is used for remapping bad disk blocks This table fills as bad disk blocks are remapped When the table is full bad disk blocks can no longer be remapped and disk errors can no longer be corrected At this point data loss can occur The bad block table is now 80 full Action Back up your data Replace the disk generating this alert and restore from back up None Related Alert Number 2307 LRA Number 2070 Storage Management Message Reference 187 Table 3 4 Storage Management Messages continued 188 Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2307 Badblock table Critical Cause The bad block ClearAlert 904 is full Unable Failure Error table is used for None to log block 1 remapping bad disk Related blocks This table fills Alert as bad disk blocks are N mbet remapped When the 2048 table is full bad disk blocks can no longer be LRA remapped and disk Number errors can no longer be 2071 corrected At this poi
127. his event is generated fault state when the specified drive recovers from a faulty condition Drive lt Drive gt Informational This event is generated drivecpresence was asserted when the drive is installed Drive lt Drive gt Warning This event is generated sik pa cs cecal Farn ot he led oe the drive is about to asserted alls Drive lt Drive gt Informational This event is generated when the drive from earlier predictive failure is corrected predictive failure was deasserted hot spare was asserted Drive lt Drive gt Warning This event is generated when the drive is placed in a hot spare System Event Log Messages for IPMI Systems 265 Table 4 10 Drive Events continued Event Message Severity Cause Drive lt Drive gt Informational This event is generated when the drive is taken out of hot spare hot spare was deasserted Drive lt Drive gt Warning This event is generated when the drive is placed in consistency check in consistency check progress was asserted Drive lt Drive gt Informational This event is generated when the consistency check of the drive is completed consistency check in progress was deasserted Drive lt Drive gt Critical This event is generated when the drive is placed in in critical array was A critical array asserted Drive lt Drive gt Informational This event is generated when the drive is removed in crit
128. ical array was aan from critical array deasserted Drive lt Drive gt Critical This event is generated when the drive is placed in in failed array was asserted the fail array Drive lt Drive gt Informational This event is generated when the drive is removed in failed array was from the fail array deasserted Drive lt Drive gt Informational This event is generated rebuild in progress was when the drive is deserted rebuilding Drive lt Drive gt Warnin This event is generated 5 5 when the drive rebuilding rebuild aborted was asserted process is aborted Drive lt Drive gt is installed Informational This event is generated when the drive is installed Drive lt Drive gt is removed Critical This event is generated when the drive is removed 266 System Event Log Messages for IPMI Systems Table 4 10 Drive Events continued Event Message Severity Cause Fault detected on drive Critical This event is generated lt Drive gt when the specified drive in the array is faulty Intrusion Events The chassis intrusion messages are a security measure Chassis intrusion alerts are generated when the system s chassis is opened Alerts are sent to prevent unauthorized removal of parts from the chassis Table 4 11 Intrusion Events Event Message Severity Cause lt Intrusion sensor Critical This event is generated when the Name gt sensor detected ntrusion se
129. imum required not performed versions of the LRA The RAID controller Number configuration firmware and drivers 2060 file is out of This situation has date missing occurred because a the required configuration file is out information or of date missing the not properly required information or formatted to not properly formatted complete the to complete the comparison comparison Action Reinstall Storage Management 132 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2167 The current Warning Cause The version of ClearAlert 103 kernel version Non critical the kernel and the driver None and the do not meet the Related non RAID minimum requirements 4 it None SCSI driver Storage Management version are may not be able to LRA older than the display the storage or Number minimum perform storage 2050 required levels management functions See readme txt until you have updated for a list of the system to meet the validated minimum requirements kernel and Action See the Readme driver versions file for a list of validated kernel and driver versions Update the system to meet the minimum requirements and then reinstall Storage Management Storage Management Message Reference 133 Table 3 4 Storage Management Messages continued Event Description Severity Cause and A
130. insertion into the system and by measuring how long a fan enclosure is absent from the chassis This sensor monitors the chassis and in attached system s AC Power Cord Sensor Monitors the presence of AC power for an AC power cord Hardware Log Sensor Monitors the size of a hardware log Processor Sensor Monitors the processor status in the system Introduction 9 e Pluggable Device Sensor Monitors the addition removal or configuration errors for some pluggable devices such as memory cards e Battery Sensor Monitors the status of one or more batteries in the system SD Card Device Sensor Monitors instrumented Secure Digital SD card devices in the system Sample Event Message Text The following example shows the format of the event messages logged by Server Administrator EventID 1000 Source Server Administrator Category Instrumentation Service Type Information Date and Time Mon Oct 21 10 38 00 2002 Computer lt computer name gt Description Server Administrator starting Data Bytes in Hex Viewing Alerts and Event Messages An event log is used to record information about important events Server Administrator generates alerts that are added to the operating system event log and to the Server Administrator alert log To view these alerts in Server Administrator 1 Select the System object in the tree view 2 Select the Logs tab 3 Select the Alert tab You c
131. ion Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2153 Enclosure OK Normal Cause An enclosure Clear Alert 851 service tag Informational service tag was changed None changed In most circumstances Related this service tag should Alert None only be changed by Dell support or your service LRA provider Number None Action Ensure that the tag was changed under authorized circumstances 2154 Maximum OK Normal Cause A user has Clear Alert 1051 temperature Informational changed the value for None probe warning the maximum Related threshold value temperature probe Alert None changed warning threshold This alert is for informational LRA purposes Number None Action None 2155 Minimum OK Normal Cause A user has Clear Alert 1051 temperature Informational changed the value for None probe warning the minimum Related threshold value temperature probe Alert Non changed warning threshold This alert is for informational LRA purposes Number None Action None 2156 Controller OK Normal Cause The controller Clear Alert 751 alarm has been Informational alarm test has run None tested successfully This alert is Related for informational Alert None purposes ae Action None Nutiber None Storage Management Message Reference 125 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2157 Controller OK Norma
132. is outside of range system board system inlet or the carrier in the specified system lt Sensor Name Location gt is outside of normal operating range System Event Log Messages for IPMI Systems Table 4 1 Temperature Sensor Events continued Event Message Severity Cause The lt Sensor Name Location gt temperature is within range Information Temperature of the backplane system board system inlet or the carrier in the specified system lt Sensor Name Location gt returned to a normal operating range Voltage Sensor Events The voltage sensor event messages monitor the number of volts across critical components These messages provide status and warning information for voltage sensors for a particular chassis Table 4 2 Voltage Sensor Events Event Message Severity Cause lt Sensor Name Location gt Critical The voltage of the monitored voltage sensor detected device has exceeded the critical a failure lt Reading gt where threshold lt Sensor Name Location gt is the entity that this sensor is monitoring Reading is specified in volts For example 3 860 V lt Sensor Name Location gt Critical The voltage specified by voltage sensor state lt Sensor Name Location gt is in asserted critical state lt Sensor Name Location gt Information The voltage of a previously voltage sensor state de asserted System Event Log Messages for IPMI Systems reported lt Sen
133. is event is generated when the device absent was asserted was not detected The lt Device Name gt Information This event is generated when the device is present was detected The lt Device Name gt Critical This event is generated when the device is absent was not detected System Event Log Messages for IPMI Systems Miscellaneous The following table provides events related to hardware and software components like mezzanine cards sensors firmware etc and compatibility issues Table 4 19 Miscellaneous Events Description Severity Cause System Board Video Critical This event is generated when the Riser Module required module is removed sensor for System Board device removed was asserted Mezz B lt slot number gt Critical This event is generated when an Status Add in Card incorrect Mezzanine card is installed for sensor for Mezz I O fabric B lt slot number gt install error was asserted Mezz C lt slot number gt Critical This event is generated when an Status Add in Card incorrect Mezzanine card is installed for sensor for Mezz I O fabric C lt slot number gt install error was asserted Hdwar version err Critical This event is generated when an Version Change incompatible hardware is detected sensor hardware incompatibility was asserted Hdwar version err Critical This event is generated when a hardware Version Change sensor hardware incompatibility BMC firmw
134. is generated when the errors detected on a chipset is unable to correct the memory memory device at errors Usually more than on DIMM is location s listed because a single DIMM may or lt location gt may not be identifiable depending on the error Correctable memory Critical This event is generated when the error logging chipset in the ECC error correction rate disabled for a memory exceeds a predefined limit device at location lt location gt 260 System Event Log Messages for IPMI Systems BMC Watchdog Events The BMC watchdog operations are performed when the system hangs or crashes These messages monitor the status and occurrence of these events in a system Table 4 7 BMC Watchdog Events Event Message Severity Cause BMC OS Watchdog timer Information expired This event is generated when the BMC watchdog timer expires and no action is set This event is generated when the BMC watchdog detects that the system has crashed timer expired because no response was received from Host and the action is set to reboot This event is generated when the BMC watchdog detects that the system has crashed timer expired because no response was received from Host and the action is set to power off BMC OS Watchdog Critical performed system reboot BMC OS Watchdog Critical performed system power off BMC OS Watchdog Critical performed system power cycle This event
135. ituation Action See the hardware documentation for more information None Related Alert None LRA Number 2061 203 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2337 Thecontrolleris Critical Cause The controller ClearAlert 1154 unable to Failure Error was unable to recover None recover cached data froin the cache Related data from the This may occur when Alert None battery backup the system is without unit BBU power for an extended LRA period of time when the Number battery is discharged 2101 Action Check if the battery is charged and in good health When the battery charge is unacceptably low it cannot maintain cached data Check if the battery has reached its recharge limit The battery may need to be recharged or replaced 2338 The controller OK Normal Cause This alert is for ClearAlert 1151 has recovered Informational informational purposes None cached data Action None Related from the BBU Alert None LRA Number None 2339 The factory OK Normal Cause This alert is for Clear Alert 751 default Informational informational purposes None settings have Action None Related been restored Alert None LRA Number None 204 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP
136. k extended medium error 124 Bad block extended sense error 124 Bad block medium error 123 Bad block replacement error 123 Bad block sense error 123 Bad PHY 1 183 battery messages 280 BIOS Generated System Events 268 BIOS generated system messages 268 BMC Watchdog Events 261 BMC watchdog messages 261 C cable interconnect messages 279 Change write policy 107 chassis intrusion messages 35 Chassis intrusion sensor 255 chassis intrusion sensor 9 Communication regained 129 Communication timeout 119 Controller event log 1 201 203 Controller rebuild rate 122 cooling device messages 26 current sensor 9 Current sensor has failed 253 current sensor messages 32 D Dead disk segments 121 Diagnostic message 1 192 Drive Events 265 Driver version mismatch 117 drives messages 265 Index 289 E Enclosure alarm 120 Enclosure firmware mismatch 107 entity presence messages 281 Error occurred 1 208 event description reference 14 F fan enclosure messages 47 fan enclosure sensor 9 fan sensor 9 Fan Sensor Events 251 Fan sensor has failed 249 fan sensor messages 251 Firmware version mismatch 116 G Global hot spare 93 H hardware log sensor 9 Hardware Log Sensor Events 265 hardware log sensor messages 264 290 Index Hot spare SMART polling 176 Intrusion Events 267 intrusion messages 267 L Log monitoring 267 M
137. k the health of the enclosure and its components by selecting the enclosure object in the tree view The Health subtab displays a red X or yellow exclamation point for enclosure components that are Failed or Degraded See the enclosure documentation for more information 2358 The battery OK Normal Cause This alert is for ClearAlert 1151 charge cycle is Informational informational purposes None complete Action None Related Alert None LRA Number None Storage Management Message Reference 215 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2359 The physical Warning Cause The physical Clear Alert 903 disk is not Non critical disk does not comply None certified with the standards set Related by Dell and is not Alert None supported LRA Action Replace the Number physical disk with a 2070 physical disk that is supported 2360 A user has OK Normal Cause This alert is for Clear Alert 751 discarded data Informational informational purposes None from the Action None Related controller Alert None cache LRA Number None 2361 Physical OK Normal Cause This alert is for ClearAlert 751 disk s that are Informational informational purposes None part of a virtual Action None Related disk have been Alert None removed while the system was LRA shut down Number None This removal was discovered du
138. l Cause A user has reset Clear Alert 751 configuration Informational the controller None has been reset configuration See the Related online help for more Alert None information This alert is for informational LRA purposes Number None Action None 2158 Physical disk OK Normal Cause An offline Clear Alert 901 online Informational physical disk has been Status made online This alert Alert 2158 is for informational is a clear purposes alert for Action None alert 2050 Related Alert Number 2048 2050 2065 2099 2121 2196 2201 2203 LRA Number None 126 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2159 Virtual disk OK Normal Cause A user has Clear Alert 1201 renamed Informational renamed a virtual disk None When renaming a Related virtual disk on a PERC Alert None 4 SC 4 DC 4e DC LRA 4 Di CERC Number ATA100 4ch PERC 5 E None PERC 5 i or SAS 5 iR controller this alert displays the new virtual disk name On the PERC 4 SC 4 DC 4e DC 4 Di 4 IM 4e Si 4e Di and CERC ATA 100 4ch controllers this alert displays the original virtual disk name This alert is for informational purposes Action None 2160 Dedicated hot OK Normal spare assigned Informational Storage Management Message Reference Cause A user has a
139. le None Action Check if the disk is healthy and that it has not been removed Check the cables If necessary replace the disk and reassign the hot spare Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2203 lt A dedicated Warning Cause The controller is Clear Alert 903 hot spare Non critical unable to communicate None failed with a disk that is Related assigned as a dedicated AJert hot spare The disk may Number have failed or been 2048 removed There may also be a bad or loose LRA cable Number 2070 Action Check if the disk is healthy and that it has not been removed Check the cables If necessary replace the disk and reassign the hot spare 2204 A dedicated hot spare has been removed OK Normal Informational Storage Management Message Reference Cause The controller is unable to communicate with a disk that is assigned as a dedicated hot spare The disk may have been removed There may also be a bad or loose cable Action Check if the disk is healthy and that it has not been removed Check the cables If necessary replace the disk and reassign the hot spare Clear Alert 901 None Related Alert None LRA Number None 149 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Rel
140. learAlert 751 Controller Informational informational purposes None Property are Action You should Related A change at least one Alert None controller property and RA run the command N miber agam None 2219 Abort Check OK Normal Cause This alert is for ClearAlert 751 Consistencyon Informational informational purposes None Error Action Change at least Related Copyback one controller property Alert None AutoCopyback and run the command RA on Predictive again Failure and l Number Loadbalance None changed 2220 Copyback OK Normal Cause This alert is for ClearAlert 751 AutoCopyback Informational informational purposes None on Predictive Action Change at least Related Failure and one controller property Alert None Loadbalance and run the command LRA changed Jsa gam Number None 2221 Auto OK Normal Cause This alert is for ClearAlert 751 Copyback on Informational informational purposes None Predictive Action Change at least Related Failure Abort one controller property Alert None CC on Error and run the command LRA and again Number Loadbalance umber changed None 154 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2222 Loadbalance OK Normal Cause This alert is for Clear Alert 751 and Auto Informational informational purposes None ony Das is Action Change at least
141. ling capacity has changed The system Warning This event is generated when performance change in power supply degraded becaus degrades system power capacity has performance changed The system Warning This event is generated when performance change in power supply degraded becaus degrades system of user defined performance power capacity has changed The system halted Critical This event is generated when because system there is inefficient power for power exceeds the system capacity The system Warning This event is generated when performance system power is inefficient degraded becaus causing system performance power exceeds to degrade capacity The system Critical This event is generated when performance degraded becaus power draw exceeds the power threshold system power is inefficient causing system performance to degrade System Event Log Messages for IPMI Systems 283 284 Table 4 17 Power And Performance Events continued Description Severity Cause The system Information This event is generated when performance system performance was restored restored Entity Presence Events The entity presence messages are used for detecting different hardware devices Table 4 18 Entity Presence Events Description Severity Cause lt Device Name gt Information This event is generated when the device presence was was detected asserted lt Device Name gt Critical Th
142. loss or data Related critically corruption may be Alert degraded imminent Number Action Replace the 2321 DIMM immediately to IRA avoid data loss or data N mb t corruption The DIMM 2061 is a part of the controller battery pack See your hardware documentation for information on replacing the DIMM or contact technical support 2321 Single bit Critical Cause The DIMM is ClearAlert 754 ECC error Failure Error malfunctioning None The DIMM is Data loss or data Related critically corruption is imminent Alert None nonfunctional No further alerts are There is no generated LRA further Action Replace the er reporting DIMM immediately The DIMM is a part of the controller battery pack See your hardware documentation for information on replacing the DIMM l Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2322 The DC power Critical Cause The power Clear Alert 1004 supply is Failure Error supply unit is switched Number switched off off Either a user 2323 switched off the power Related supply unit or it is Alert None defective l LRA Action Check if the Number power switch is turned 209 off If it is turned off turn it on If the problem persists check if the power cord is attached and functional If the problem is still not corrected or if the power switch is a
143. lready turned on replace the power supply unit 2323 The power OK Normal Cause This alert is for Clear Alert 1001 supply is Informational informational purposes Status switched on Attion None Alert 2323 is a clear alert for alerts 2313 and 2322 Related Alert None LRA Number None Storage Management Message Reference 195 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2324 The AC power Critical Cause The power cable Clear Alert 1004 supply cable Failure Error may be pulled out Number has been or removed The power 2325 removed cable may also have Related overheated and become 4 ert None warped and nonfunctional LRA Number Action Replace the 2091 power cable 2325 The power OK Normal Cause This alert is for Clear Alert 1001 supply cable Informational informational purposes Status has been Acton None Alert 2325 inserted is a clear alert for alerts 2324 and 2312 Related Alert None LRA Number None 196 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2326 A foreign OK Normal Cause This alert is for Clear Alert 751 configuration Informational informational purposes None has been The controller has Related detected physical disks that were Al
144. mational informational purposes None Error Action Change at least Related Copyback and one controller property Alert None Auto and run the command RA Copyback on ain Predictive l Number Failure None changed 2228 Copybackand OK Normal Cause This alert is for Clear Alert 751 Auto Informational informational purposes None Copyback oe Action Change at least Related Predictive one controller property Alert None Failure and run the command changed LRA apain Number None 2229 Abort Check OK Normal Cause This alert is for ClearAlert 751 Consistencyon Informational informational purposes None Error and Auto Action Change at least Related Copyback on one controller property Alert None Predictive and run the command LRA Failure again changed l Number None 156 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Cause and Action Related SNMP ID Alert Trap Information Numbers 2230 Auto Cause This alert is for Clear Alert 751 Copyback on informational purposes None Predictive Action Change at least Related Failure one controller property Alert None d Property enanecds and run the command LRA again Number None 2231 Copyback and OK Normal Cause This alert is for Clear Alert 751 and Abort informational purposes None Check Action Change at least Related Consistency on one controller property Alert None Error changed and run the command L
145. mum times than the battery Related exceeded recharge limit allows Alert None Action Replace the LRA battery pack Nuniber 2100 152 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity ID Cause and Action Related Alert Information SNMP Trap Numbers 2214 Battery charge OK Normal in progress Informational Cause This alert is for informational purposes None Clear Alert None Related Alert None LRA Number None 115 2215 Battery charge OK Normal process Informational interrupted Cause This alert is for informational purposes None Clear Alert None Related Alert None LRA Number None 115 2216 The battery OK Normal learn mode has Informational changed to auto Cause This alert is for informational purposes Action None Clear Alert None Related Alert None LRA Number None 1151 2217 The battery OK Normal learn mode has Informational changed to warn Cause This alert is for informational purposes Action None Clear Alert None Related Alert None LRA Number None Storage Management Message Reference 1151 153 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2218 Noneofthe OK Normal Cause This alert is for C
146. n Storage Management Message Reference Clear Alert 1053 Number 2353 Related Alert Number 2112 LRA Number 2090 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2101 Temperature Warning Cause The physical Clear Alert 1053 dropped below Non critical the minimum warning threshold disk enclosure is too cool Action Check if the thermostat setting is too low and if the room temperature is too cool Number 2353 Related Alert Number None LRA Number 2090 Storage Management Message Reference 95 Table 3 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2102 Temperature exceeded the maximum failure threshold 9 Critical Failure Error Cause The physical disk enclosure is too hot A variety of factors can cause the excessive temperature For example a fan may have failed the thermostat may be set too high or the room temperature may be too hot Action Check for factors that may cause overheating For example verify that the enclosure fan is working You should also check the thermostat settings and examine whether the enclosure is located near a heat source Make sure the enclosure has enough ventilation and that the room temperature is not too
147. n chassis location previous Previous state was lt State gt state and fan sensor value information is provided Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Fan sensor value lt Reading gt 28 Server Management Messages Voltage Sensor Messages The voltage sensors listed in Table 2 4 monitor the number of volts across critical components Voltage sensor messages provide status and warning information for voltage sensors in a particular chassis Table 2 4 Voltage Sensor Messages Event Description Severity Cause ID 1150 Voltage sensor has failed Error A voltage sensor in Sensor location lt Location ae system in chassis gt ALES HE SENSO location chassis Chassis location lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Voltage sensor value in Volts lt Reading gt If sensor type is discrete Discrete voltage state lt State gt location previous state and voltage sensor value information is provided 1151 Voltage sensor value unknown Information Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Voltage sensor value in Volts lt Reading gt If sensor type is discrete Discrete voltage state lt State gt Server Management Messages A voltage sensor in the specified
148. n Numbers 2161 Action Although this Cont alert is provided for informational purposes you may need to assign a new hot spare to the virtual disk 2162 Communicatio OK Normal Cause Communication Clear Alert 851 n regained Informational with an enclosure has Status been restored This alert Alert 2162 is for informational is a clear purposes alert for Action None alerts 2137 and 2292 Related Alert None LRA Number None 2163 Rebuild Critical Cause During a rebuild Clear Alert 904 completed Failure Error one or more blocks of None with errors Storage Management Message Reference data was not recoverable due to missing parity information Some data loss may have occurred Action Perform a check to verify the built array Any files that are impacted should be restored from a backup See the Storage Management online help for more information Related Alert None LRA Number 2071 129 Table 3 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 130 2164 See the OK Normal Readme file for Informational a list of validated controller driver versions Cause Storage Management is unable to determine whether the system has the minimum required versions of the RAID controller drivers This alert is for informational purposes Action See the Readme file for driver and firmwa
149. ne After being split both virtual disks retain a copy LRA of the data although the Number mirror is no longer intact None The updates to the data are no longer copied to the mirror This alert is for informational purposes Action None 2117 A mirrored OK Normal Cause A user has Clear Alert 1201 virtual disk has Informational caused a mirrored Number been virtual disk to be None unmirrored unmirrored When a Related virtual disk is mirrored Alert its data is copied to Nomber another virtual disk in None order to maintain redundancy After being LRA unmirrored the disk Number formerly used as the None mirror returns to being a physical disk and becomes available for inclusion in another virtual disk This alert is for informational purposes Action None Storage Management Message Reference 106 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2118 Change write OK Normal Cause A user has Clear Alert 1201 policy Informational changed the write Number policy for a virtual disk None This alert is for Related informational purposes A Jert Action None Number None LRA Number None 2120 Enclosure Warning Cause The firmware on Clear Alert 853 firmware Non critical the EMM is not the Number mismatch same version It is None required that both Related modules have the same Alert version of the firmwar
150. ne found to be in stopped for some reason Related security locked and hence the device is Alert None state Full in security locked state RA initialization Miche Puntal Number has to be done initialization to recover None on the security h device locked drive to recover the drive in usable state 2699 Connection to Error No action required Clear Alert 1604 CFM lost None FluidCache Related Alert None LRA Number None 2700 The following Information No action required Clear Alert 1601 journal mirror None is available 1 Related Alert None LRA Number None 2701 The following Information No action required Clear Alert 1601 journal mirror None is being Related replaced wwn Alert None l LRA FluidCache Number None 236 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2702 The following Warning No action required Clear Alert 1603 journal mirror None has Related failed wwn Alert None l LRA FluidCache Number None 2703 There are not Error To resolve the issue you ClearAlert 1604 enough journal must ensure that there None mirrors are at least two journal Related available to mirrors that are Alert None operate accessible You must LRA FluidCache activate either one or Number more failed cache None devices or use the fldc_restore utilit
151. nerated System Events continued ate exceeded for location gt Event Message Severity Cause A fatal IO error detected Critical This error is generated when a on a component at bus fatal IO error is detected lt number gt device lt number gt function lt number gt A fatal IO error detected Critical This error is generated when a on a component at slot fatal IO error is detected lt number gt A non fatal PCIe error Warning This event is generated in detected on a component at association with a CPU JERR bus lt number gt device lt number gt function lt number gt A non fatal PCIe error Warning This event is generated in detected on a component at association with a CPU JERR slot lt number gt A non fatal IO error Warning This event is generated in detected on a component at association with a CPU IERR bus lt number gt device and indicates the PCI PCle lt number gt function device that caused the CPU lt number gt TERR Memory device was added at Information This event is generated when location lt location gt memory is added to the system Memory device is removed Information This event is generated when from location lt location gt memory is removed from the system Unsupported memory Critical This event is generated when configuration check memory configuration is memory device at location incorrect for the system lt location gt Correctable memory error War
152. nfiguration This event is generated when there is a memory failure in a mirror configured memory configuration This event is generated when the memory redundancy mode has change to spare redundant This event is generated when redundancy is lost in a sparer configured memory configuration This event is generated when there is a memory failure in a spare configured memory configuration Hardware Log Sensor Events The hardware logs provide hardware status messages to the system management software On particular systems the subsequent hardware messages are not displayed when the log is full These messages provide status and warning messages when the logs are full 264 System Event Log Messages for IPMI Systems Table 4 9 Hardware Log Sensor Events Event Message Severity Cause Log full Critical This event is generated when the SEL device detected detects that only one entry can be added to the SEL before it is full Log cleared Information This event is generated when the SEL is cleared Drive Events The drive event messages monitor the health of the drives in a system These events are generated when there is a fault in the drives indicated Table 4 10 Drive Events Event Message Severity Cause Drive lt Drive gt asserted Critical This event is generated fault state when the specified drive in the array is faulty Drive lt Drive gt de asserted Information T
153. nformational purposes Related Action None Alert Number None LRA Number None 2144 Controller OK Normal Cause A user has Clear Alert 751 alarm disabled Informational disabled the controller Number alarm This alert is for None informational purposes Related Action None Alert Number None LRA Number None 122 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2145 Controller Warning Cause The controller Clear Alert 1153 battery low Non critical battery charge is low None Action Recondition Related the battery See the Alert None online help for more LRA information Nunes 2100 2146 Bad block Warning Cause A portion ofa Clear Alert 753 replacement Non critical physical disk is None error damaged Related Action See the Dell Alert None OpenManage Server LRA Administrator Storage Number Management 2060 i online help for more information 2147 Bad block Warning Cause A portion ofa ClearAlert 753 sense Non critical physical disk is None error damaged Related Action See the Dell Alert None OpenManage Server LRA Administrator Storage Number Management online 2060 f help for more information 2148 Bad block Warning Cause A portion ofa ClearAlert 753 medium error Non critical physical disk is None damaged Related Action See the Dell
154. ng the ECC memory correction rate and the type of memory events that have occurred K NOTE A critical status does not always indicate a system failure or loss of data In some instances the system has exceeded the ECC correction rate Although the system continues to function you should perform system maintenance as described in Table 2 9 K NOTE In Table 2 9 lt status gt can be either critical ornon critical Table 2 9 Memory Device Messages Event Description Severity Cause ID 1403 Memory device status is Waring A memory device correction lt status gt rate exceeded an acceptable enor yide riced oca ion value er search device Llaeatkion dn Chassiss status and possible memory module event cause P 7 P Possible memory module information is provided event cause lt list of causes gt 1404 Memory device status is Error A memory device correction lt status gt rate exceeded an acceptable emory device l cation value a memory spare bank was lt location in chassis gt Possible memory module event cause lt list of causes gt activated or a multibit ECC error occurred The system continues to function normally except for a multibit error Replace the memory module identified in the message during the system s next scheduled maintenance Clear the memory error on multibit ECC error The memory device status and possible memory module event cause information is provided 46 Serve
155. ning This event is generated when rate exceeded for correctable ECC errors have lt location gt increased from a normal rate Correctable memory error Critical This event is generated when r lt correctable ECC errors reach a critical rate System Event Log Messages for IPMI Systems Table 4 12 BIOS Generated System Events continued Event Message Severity Cause Memory device at location Critical This event is generated when lt location gt is overheating system memory reaches critical temperature An OEM diagnostic event Information This event is generated when an occurred OEM event occurs OEM events can be used by Dell service team to better understand the cause of the failure CPU lt number gt protocol Critical This event is generated when error detected the processor protocol enters a non recoverable state CPU bus parity error Critical This event is generated when detected the processor bus PERR enters a non recoverable state CPU lt number gt Critical This event is generated when initialization error the processor initialization detected enters a non recoverable state CPU lt number gt machine check Critical This event is generated when error detected the processor machine check enters a non recoverable state All event logging is Critical This event is generated when all disabled event logging is disabled Logging is disabled Critical This event is generated when the
156. nk Tuning sensor or Flex address information is not failed to get link obtained from BMC iDRAC tuning or flex address data from BMC iDRAC was asserted The lt name gt is Critical This event is generated when the device removed was removed The lt name gt is Information This event is generated when the device inserted was inserted or installed A fabric mismatch Critical This event is generated when an detected between incorrect Mezzanine card is installed for IOM and mezzanine T O fabric card lt number gt Hardware Critical This event is generated when an incompatibility incorrect Mezzanine card is installed in detected with the system mezzanine card lt number gt The QuickPath Warning This event is generated when the bus is Interconnect QPI not operating at maximum speed or width degraded width The QuickPath Information This event is generated when the bus is Interconnect QPI operating at maximum speed or width width regained BIOS detected an Critical This event is generated when TXT error configuring initialization failed the Intel Trusted Execution Technology TXT Processor detected Critical This event is generated when TXT CPU Technology operation an error while performing an Intel Trusted Execution TXT microcode boot failed System Event Log Messages for IPMI Systems 287 Table 4 19 Miscellaneous Events continued BIOS Authenticated Code Module detecte
157. nsor detects an intrusion an intrusion y lt Intrusion sensor Information This event is generated when the Name gt sensor returned earlier intrusion has been corrected to normal state lt Intrusion sensor Critical This event is generated when the Name gt sensor intrusion intrusion sensor detects an intrusion was asserted while while the system is on system was ON lt Intrusion sensor Critical This event is generated when the Name gt sensor intrusion intrusion sensor detects an intrusion was asserted while while the system is off system was OFF The chassis is open Critical This event is generated when the intrusion sensor detects an intrusion The chassis is closed Information This event is generated when the earlier intrusion has been corrected o The chassis is open Critical This event is generated when the while the power is on intrusion sensor detects an intrusion while the system is on System Event Log Messages for IPMI Systems 267 Table 4 11 Intrusion Events continued Event Message Severity Cause while the power is on The chassis is closed Information This event is generated when the earlier intrusion has been corrected while the power is on The chassis is open Critical This event is generated when the while the power is intrusion sensor detects an intrusion off while the system is off The chassis is closed Information Thi
158. nt data loss can occur The 1 indicates a substitution variable The text for this substitution variable is displayed with the alert in the alert log and can vary depending on the situation Action Replace the disk generating this alert If necessary restore your data from backup Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2309 A physical disk Warning Cause You have Clear Alert 903 is Non critical attempted to replace a None incompatible disk with another disk Related that is using an Alert None incompatible technology For LRA example you may have Number replaced one side of a 2070 mirror with a SAS disk when the other side of the mirror is using SATA technology Action See the hardware documentation for information on replacing disks 2310 Avirtualdisk is Critical Cause A redundant Clear Alert 1204 permanently Failure Error virtual disk has lost None degraded redundancy This may Related occur when the virtual Alert None disk suffers the failure of multiple physical LRA disks In this case both Number the source physical disk 2081 and the target disk with redundant data have failed A rebuild is not possible because there is no redundancy Action Replace the failed disks and restore from backup Storage Management Message Reference 189 Table 3 4
159. nued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2252 The physical OK Normal Cause This alert is for Clear Alert 901 disk blink has Informational informational purposes None ceased Action None Related Alert None LRA Number None 2253 Redundant OK Normal Cause This alert is Clear Alert 751 path restored Informational provided for None informational purposes Related None Alert None LRA Number None 2254 The Clear OK Normal Cause This alert is for Clear Alert 901 operation has Informational informational purposes None cancelled Action None Related Alert None LRA Number None Storage Management Message Reference 163 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2255 The physical OK Normal Cause This alert is for Clear Alert 901 disk has been Informational informational purposes None started Action None Related Alert Number 2048 2050 2065 2099 2121 2196 2201 2203 LRA Number None 2257 Controller Warning Non Cause The controller Clear Alert 753 preserved critical cache is discarded by None cache is the user This alert is for Related discarded informational purposes Alert None Action None LRA Number None 2258 Controller has Warning Non Cause I O interrupted Clear Alert 753 preserved critical for a virtual
160. nued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2930 Eventhough Warming To resolve the issue add Clear Alert 1603 caching was a PCle SSD to the None enabled in cache pool Related write back Alert None mode it is LRA currently Number operating in None write through mode FluidCache 2931 Eventhough Warming To resolve the issue add Clear Alert 1203 caching was one or more PCIe SSDs None 1503 enabled in to the cache pool Related write back or Alert None write through LRA mode it is Number currently None operating in pass through mode FluidCache 2932 Cachingisno Warning No action required Clear Alert 1203 longer None degraded to Related write through Alert None mode and is LRA now operating Number in write back None mode FluidCache Storage Management Message Reference 245 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2933 Cachingisno Warning No action required Clear Alert 1603 longer None degraded to Related pass through Alert None mode and is LRA now operating Number in its None configured mode FluidCache 246 Storage Management Message Reference System Event Log Messages for IPMI Systems The tables in this chapter list the system event log SEL messages their severity and cause K NOTE For corrective actions se
161. ocation chassis gt Chassis location lt Name of chassis gt previous state and Previous state was lt State gt battery sensor status information is Battery sensor status provided lt status gt 1701 Battery sensor value unknown Warning A battery sensor in the specified system could not retrieve a reading The sensor location chassis location previous Previous state was lt State gt state and battery sensor status information is provided Sensor Location lt Location in chassis gt Chassis Location lt Name of chassis gt Battery sensor status lt status gt Server Management Messages _ 57 Table 2 15 Battery Sensor Messages continued Event Description Severity Cause ID 1702 Battery sensor returned to a Information A battery sensor in normal value the specified system Sensor Location lt Location in rete that a d chadsiss battery transitione back to a normal Chassis Location lt Name of state The sensor chassis gt location chassis Previous state was lt State gt location previous Battery sensor status state and battery sensor status lt status gt i AEA information is provided 1703 Battery sensor detected a Warning A battery sensor in warning value the specified system Sensor Location lt Location in eaa that ahaesies a pattery Is TE a predictive failure Chassis Location lt Name of state The sensor chassis gt location chassis Previous state wa
162. of the specified fan exceeded the warning threshold The speed of the specified fan exceeded the critical threshold The speed of the specified fan might not provide enough cooling to the system The speed of the specified fan is operating in a normal range A required fan was removed A fan was added The total number of fans present A required fan is missing One or more fans may have started functioning or installed and the redundancy has been regained 252 System Event Log Messages for IPMI Systems Table 4 3 Fan Sensor Events continued Event Message Severity Cause Fan redundancy is Critical One or more required fans may have lost failed or removed and hence the redundancy was lost Fan redundancy is Warning One or more fans may have failed or degraded removed and hence the redundancy has been degraded Processor Status Events The processor status messages monitor the functionality of the processors in a system These messages provide processor health and warning information of a system Table 4 4 Processor Status Events Event Message Severity Cause lt Processor Entity gt status Critical processor sensor IERR where lt Processor Entity gt is the processor that generated th vent For example PROC for a single processor system and PROC for multiprocessor system TERR internal error generated by the lt Processor Entity gt This event is gener
163. on the operating system and is located in the opt dell srvadmin var log openmanage directory e On systems running the in ESXi operating system the dcemdy32 ini file is located under etc cim dell srvadmin srvadmin deng ini and the desys lt xx gt log where xx is 32 or 64 bit depending on the operating system and is located under etc cim dell srvadmin log openmanage The following sub sections explain how to launch the Windows Server 2008 Red Hat Enterprise Linux SUSE Linux Enterprise Server VMware ESX and VMware ESXi event viewers Viewing Events in Microsoft Windows Server 2008 1 Click the Start button point to Settings and click Control Panel 2 Double click Administrative Tools and then double click Event Viewer 3 In the Event Viewer window click the Tree tab and then click System Log The System Log window displays a list of recently logged events 4 To view the details of an event double click one of the event items K NOTE You can also look up the desys lt xx gt xml file in the lt install_path gt omsa log directory to view the separate event log file where the default install_path is C Program Files Dell SysMgt and xx is 32 or 64 depending on the operating system that is installed Viewing Events in Red Hat Enterprise Linux and SUSE Linux Enterprise Server 1 Log in as root 2 Use a text editor such as vi or emacs to view the file named var log messages The following example shows the Red Hat Enterprise
164. onal details for the events gt specified system The device location chassis location and additional event details if available are provided Server Management Messages 55 Table 2 14 Pluggable Device Messages continued Event Description Severity Cause ID 1652 Device removed from Information A device was removed from the system specified system The device Paice eee ious maar n ate KLbCRELOnCIn A _ ae T T S Shasa G gt if available are provided Chassis location lt Name of chassis gt Additional details lt Additional details for the events gt 1653 Device configuration Error A configuration error was error detected Device location lt Location in chassis gt Chassis location lt Name of chassis gt Additional details lt Additional details for the events gt detected for a pluggable device in the specified system The device may have been added to the system incorrectly 56 Server Management Messages Battery Sensor Messages The battery sensors monitor how well a battery is functioning The battery messages listed in Table 2 15 provide status and warning information for batteries in a particular chassis Table 2 15 Battery Sensor Messages Event Description Severity Cause ID 1700 Battery sensor has failed Critical A battery sensor in sensor locations lt rocatreon in Tailors the specified system Error is not functioning The sensor location chassis l
165. onal provided for Number from physical informational purposes None disk 2 to Action None Related physical disk Alert 1 Number 2060 LRA Number None 2076 Virtual disk Critical Cause A physical disk Clear Alert 1204 Check Failure Error included in the virtual Number Consistency disk failed or there isan None failed error in the parity Related information A failed Alert physical disk can cause Number errors in parity None information LRA Action Replace the Number failed physical disk You 208 can identify which disk has failed by locating the disk that has a red X for its status Rebuild the physical disk When finished restart the check consistency operation 82 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2077 Virtual disk Critical Cause A physical disk Clear Alert 1204 format failed Failure Error included in the virtual disk failed Action Replace the failed physical disk You can identify which physical disk has failed by locating the disk that has a red X for its status Rebuild the physical disk When finished restart the virtual disk format operation Number None Related Alert Number None LRA Number 2081 2079 Virtual disk initialization failed Critical Failure Error Cause A physical
166. one power supply is functional Power supply redundancy Warning Power supply redundancy is is degraded degraded if one of the power supply sources is removed or failed The power supplies are Information This event is generated if the redundant power supply has been reconnected or replaced System Event Log Messages for IPMI Systems 259 Memory ECC Events The memory ECC event messages monitor the memory modules in a system These messages monitor the ECC memory correction rate and the type of memory events that occurred Table 4 6 Memory ECC Events Event Message Severity Cause ECC error correction Information This event is generated when there is a detected on Bank memory error correction on a particular DIMM A B Dual Inline Memory Module DIMM ECC uncorrectable Critical This event is generated when the error detected on chipset is unable to correct the memory Bank DIMM errors Usually a bank number is provided and DIMM may or may not be identifiable depending on the error Correctable memory Critical This event is generated when the error logging chipset in the ECC error correction rate disabled exceeds a predefined limit Persistent Warning This event is generated when there is a correctable memory memory error correction on a particular errors detected on a Dual Inline Memory Module DIMM memory device at location s lt DIMM number gt Multi bit memory Critical This event
167. ontroller to read a blockon the 3359 physical disk and marked that block as LRA invalid If the error was Number encountered ona 2071 source physical disk during a rebuild or reconfigure operation it also punctures the corresponding block on the target physical disk The invalid block is cleared during a write operation Action Back up your data If you are able to back up the data successfully initialize the disk and restore from the back up 2274 The physical disk rebuild has resumed 172 OK Normal Informational Cause This alert is for informational purposes Action None Storage Management Message Reference Clear Alert 901 None Related Alert None LRA Number None Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2276 The dedicated Warning Cause The dedicated Clear Alert 903 hot spare is too Non critical hot spare is not large None small enough to protectall Related virtual disks that reside Alert None on the disk group j LRA Action Assign a larger Number disk as the dedicated 2070 i hot spare 2277 The global hot Warning Cause The global hot Clear Alert 903 spare is too Non critical spare is not large enough None small to protect all virtual Related disks that reside on the Alert None controller LRA Action Assign a larger Nunihee disk a
168. oot Server Management Messages the flash basic input output system BIOS 19 Table 2 1 Server Administrator General Messages continued 20 Event Description Severity Cause ID 1003 A previously scheduled Information The user decides to cancel the system BIOS update has flash BIOS update or an error been canceled occurs during the flash 1004 Thermal shutdown Error This message is generated protection has been when a system is configured for initiated thermal shutdown due to an error event If a temperature sensor reading exceeds the error threshold for which the system is configured the operating system shuts down and the system powers off This event may also be initiated on certain systems when a fan enclosure is removed from the system for an extended period of time 1005 SMBIOS data is absent Error The system does not contain the required systems management BIOS version 2 2 or higher or the BIOS is corrupted 1006 Automatic System Error This message is generated Recovery ASR action when an automatic system was performed recovery action is performed Action performed was due to a hung operating lt Action gt system The action performed Date and time of and the time of action is action lt Date and provided time gt Server Management Messages Table 2 1 Server Administrator General Messages continued Event Description Severity Cause ID 1007 User initiated host
169. ormal operating range The lt Sensor Name Location gt Information Voltage of the monitored voltage is within range Entity lt Sensor Name Location gt returned to a normal operating range System Event Log Messages for IPMI Systems Fan Sensor Events The cooling device sensors monitor how well a fan is functioning These messages provide status warning and failure messages for fans for a particular chassis Table 4 3 Fan Sensor Events Event Message Severity Cause lt Sensor Name Location gt Critical Fan sensor detected a failure lt Reading gt where lt Sensor Name Location gt is the entity that this sensor is monitoring For example BMC Back Fan or BMC Front Fan Reading is specified in RPM For example 100 RPM The speed of the specified lt Sensor Name Location gt fan is not sufficient to provide enough cooling to the system lt Sensor Name Location gt Information Fan sensor returned to normal state The fan specified by lt Sensor Name Location gt has returned to its normal operating speed lt Reading gt lt Sensor Name Location gt Warning The speed of the specified lt Sensor Fan sensor detected a Name Location gt fan may not be warning lt Reading gt sufficient to provide enough cooling to the system lt Sensor Name Location gt Information The fan specified by lt Sensor Name Fan Redundancy sensor Location gt may have failed and hence
170. ormation e Date The date the event occurred Time The local time the event occurred e Type A classification of the event severity Information Warning or Error Introduction 13 e User The name of the user on whose behalf the event occurred e Computer The name of the system where the event occurred Source The software that logged the event e Category The classification of the event by the event source Event ID The number identifying the particular event type e Description A description of the event The format and contents of the event description vary depending on the event type Understanding the Event Description Table 1 2 lists in alphabetical order each line item that may appear in the event description Table 1 2 Event Description Reference Description Line Item Explanation Action performed was lt Action gt Specifies the action that was performed for example Action performed was Power cycle Action requested was lt Action gt Specifies the action that was requested for example Action requested was Reboot shutdown OS first Additional Details lt Additional details for the event gt Specifies additional details available for the hot plug event for example Memory device DIMM1 A Serial number FFFF30B1 lt Additional power supply status information gt Specifies information pertaining to
171. orrected while the controller was Related completing a Aleit Nowe background task A bad disk block was LRA identified The disk Number block has been None remapped Action Consider replacing the disk If you receive this alert frequently be sure to replace the disk You should also routinely back up your data 2281 Virtual diskhas OK Normal Cause The virtual disk Clear Alert 1201 inconsistent data Informational has inconsistent data None This may be caused Related when a power loss or Alert system shutdown occurs Number while data is being 2127 written to the virtual disk This alert is for LRA informational purposes Number None Action None Storage Management Message Reference 175 Table 3 4 Storage Management Messages continued Event ID Description Severity Related SNMP Alert Trap Information Numbers Cause and Action 2282 Critical Failure Error Hot spare SMART polling failed 1716 Clear Alert 904 None Related Alert None LRA Number 2071 Cause The controller firmware attempted a SMART polling on the hot spare but was unable to complete it The controller has lost communication with the hot spare Action Check the health of the disk assigned as a hot spare You may need to replace the disk and reassign the hot spare Make sure the cables are attached securely See the Dell OpenManage Server Administrator Storage Management User
172. peration is successful it indicates that the un recoverable medium did not affect user data If the Backup operation fails restore the file from a previous backup After restoring the file run check consistency operation e If the consistency check is successful no further action is required If the consistency check finds and un recoverable medium error it means that the medium error is located in non user data No further action is required as writing data to the location of the medium error fixes the problem Storage Management Message Reference Clear Alert 1204 None Related Alert None LRA Number None 233 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2417 NOTE If the cntd unrecoverable medium error has not been corrected it may be reported again by the system This error can be fixed by writing data on the affected area or deleting and recreating the Virtual Disk as demonstrated in the following procedure 1 Back up the data 2 Delete the Virtual Disk 3 Recreate the Virtual Disk using the same parameters like size RAID level disks etc 4 Restore data 2418 Disk medium Informational Cause This alert is for ClearAlert 1201 error on virtual informational purposes None disk has been Action None Related corrected Alert None LRA Number None 2425 Statechange Info
173. place the battery pack Monitor the battery to make sure that it recharges successfully 2247 The controller battery is charging OK Normal Informational Storage Management Message Reference Cause This alert is for informational purposes Action None Clear Alert 1151 Number 2358 Related Alert None LRA Number None 161 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2248 The controller OK Normal Cause This alert is for ClearAlert 1151 battery is Informational informational purposes None executing a Action None Related Learn cycle Alert None LRA Number None 2249 The physical OK Normal Cause This alert is for Clear Alert 901 disk Clear Informational informational purposes None operation has Action None Related started Alert None LRA Number None 2250 Redundant Warning Non Cause The redundant Clear Alert 751 Path is broken critical path is broken Number Action Check the 2370 connection to the Related enclosure which is Alert degraded Number 2370 LRA Number None 2251 The physical OK Normal Cause This alert is for Clear Alert 901 disk blink has Informational informational purposes None initiated Action None Related Alert None LRA Number None 162 Storage Management Message Reference Table 3 4 Storage Management Messages conti
174. r lt error description gt fatal error occurs during system boot See Table 4 13 for more information emory Spared Critical This event is generated when Gunner ager memory spare is no longer redundant emory Mirrored Critical This event is generated when redundancy lost memory mirroring is no longer redundant emory RAID Critical This event is generated when redundancy Lost memory RAID is no longer redundant Err Reg Pointer Information This event is generated when an OEM Diagnostic data event OEM event occurs OEM events ee A SESTE can be used by Dell service team to better understand the cause of the failure System Board PFault Fail Critical This event is generated when Safe state asserted the system board voltages are not at normal levels System Board PFault Fail Information This event is generated when Saf state deasserted System Event Log Messages for IPMI Systems earlier PFault Fail Safe system voltages return to a normal level 269 Table 4 12 BIOS Generated System Events continued Event Message Severity Cause Memory Add Information This event is generated when BANK DIMM presence was memory is added to the system asserted Memory Removed Information This event is generated when BANK DIMM presence was memory is removed from the asserted system emory Cfg Err Critical This event is generated when memory configuration is configuration error BANK incorrect for the system DIMM
175. r Management Messages Fan Enclosure Messages Some systems are equipped with a protective enclosure for fans Fan enclosure messages listed in Table 2 10 monitor whether foreign objects are present in an enclosure and how long a fan enclosure is missing from a chassis Table 2 10 Fan Enclosure Messages Event Description Severity Cause ID 1450 Fan enclosure sensor Critical The fan enclosure sensor in has failed Failure the specified system failed sheo location Error a ae chassis location Une ehassiss ocation information 1s l provided Chassis location lt Name of chassis gt 1451 Fan enclosure sensor Warning The fan enclosure sensor in value unknown the specified system could not cansor Toes on ce a ne sensor lt Location in chassis gt an e Hassle ocation information is provided Chassis location lt Name of chassis gt 1452 Fan enclosure inserted Information A fan enclosure has been into system inserted into the specified concor Thea a a sensor and chassis CTOCA Tor in Chase a gt ocation information 1s provided Chassis location lt Name of chassis gt 1453 Fan enclosure removed Warning A fan enclosure has been from system Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt removed from the specified system The sensor and chassis location information is provided Server Management Messages 47 Table 2 10 Fan Enclosure Messages continue
176. r cancelled Clear Alert 901 Copyback Informational the copyback operation Number cancelled Action None None Related Alert Number 2060 LRA Number None 2185 Physical disk Warning Non Cause This alert is Clear Alert 903 Copyback critical provided for Number stopped for spare Storage Management Message Reference informational purposes None Action None Related Alert Number 2060 LRA Number None 141 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2186 The controller Warning Cause The controller Clear Alert 753 cache has been Non critical discarded has flushed the cache and any data in the cache has been lost This may happen if the system has memory or battery problems that cause the controller to distrust the cache Although user data may have been lost this alert does not always indicate that relevant or user data has been lost Action Verify that the battery and memory are functioning properly None Related Alert None LRA Number 2060 2187 Single bit ECC error limit exceeded on the controller DIMM Warning Non critical 142 Cause The system memory is malfunctioning Action Contact Dell technical support to replace the controller memory Storage Management Message Reference Clear Alert 753 None Related Alert
177. r power Critical Temperature of specified power supply lt number gt is supply entered into critical state outside of range An under voltage fault Critical The specified power supply detected on power supply detected inefficient voltage lt number gt An over voltage fault Critical The specified power supply detected on power supply lt number gt 258 detected an over voltage condition System Event Log Messages for IPMI Systems Table 4 5 Power Supply Events continued Event Message Severity Cause An over current fault Critical The specified power supply detected on power supply detected an over current lt number gt condition Fan failure detected on Critical The specified power supply fan power supply lt number gt has failed Communication has been Information This event is generated when the restored to power supply power supply has recovered from lt number gt an earlier communication problem A power supply wattage Critical This event is generated when mismatch is detected there is more than one power power supply lt number gt is supplies in the system and the rated for lt value gt watts power supply wattage do not match Power supply lt number gt Information This event is generated when the wattage mismatch power supply has recovered from corrected an earlier power supply wattage mismatch Power supply redundancy Critical Power supply redundancy is lost if is lost only
178. ratures become too high inside a chassis also monitors the temperature in a variety of locations in the chassis and in attached system s Fan Sensor Monitors fans in various locations in the chassis and in attached system s Voltage Sensor Monitors voltages across critical components in various chassis locations and in attached system s Current Sensor Monitors the current or amperage output from the power supply or supplies in the chassis and in attached system s Chassis Intrusion Sensor Monitors intrusion into the chassis and attached system s Redundancy Unit Sensor Monitors redundant units critical units such as fans AC power cords or power supplies within the chassis also monitors the chassis and attached system s For example redundancy allows a second or nth fan to keep the chassis components at a safe temperature when another fan has failed Redundancy is normal when the intended number of critical components are operating Redundancy is degraded when a component fails but others are still operating Redundancy is lost when there is one less critical redundancy device than required Power Supply Sensor Monitors power supplies in the chassis and in attached system s Memory Prefailure Sensor Monitors memory modules by counting the number of Error Correction Code ECC memory corrections Fan Enclosure Sensor Monitors protective fan enclosures by detecting their removal from and
179. rd system board or drive carrier in the specified system detected an error from which it cannot recover The sensor location chassis location previous state and temperature sensor value information is provided Server Management Messages 25 Cooling Device Messages The cooling device sensors listed in Table 2 3 monitor how well a fan is functioning Cooling device messages provide status and warning information for fans in a particular chassis Table 2 3 Cooling Device Messages Event Description ID Cause 1100 Fan sensor has failed Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Fan sensor value lt Reading gt A fan sensor in the specified system is not functioning The sensor location chassis location previous state and fan sensor value information is provided 1101 Fan sensor value unknown Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Fan sensor value lt Reading gt 26 Server Management Messages A fan sensor in the specified system could not obtain a reading The sensor location chassis location previous state anda nominal fan sensor value information is provided Table 2 3 Cooling Device Messages continued Event Description ID Cause 1102 Fan sensor returned to a normal value
180. re requirements In particular if Storage Management experiences performance problems you should verify that you have the minimum supported versions of the drivers and firmware installed Storage Management Message Reference Clear Alert 101 None Related Alert None LRA Number None Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2165 The RAID Warning Cause Storage Clear Alert 753 controller Non critical Management is unable None firmware and to determine whether Related driver l the system has the Alert None validation was minimum required not performed versions of the RAID LRA The controller firmware and Number configuration drivers This situation 2060 file cannot be may occur for a variety opened of reasons For example the installation directory path to the configuration file may not be correct The configuration file may also have been removed or renamed Action Reinstall Storage Management Storage Management Message Reference 131 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2166 The RAID Warning Cause Storage Clear Alert 753 controller Non critical Management is unable None firmware and to determine whether Related driver the system has the Alert None validation was min
181. re stat for example Satare Discrete temperature stat Good Discrete voltage Specifies the state of the voltage sensor for example state lt State gt Discrete voltage state Good Fan sensor value Specifies the fan speed in revolutions per minute RPM lt Reading gt or On Off for example Fan sensor value in RPM 2600 Fan sensor value Off Log type lt Log Specifies the type of hardware log for example type gt Log type ESM Introduction 15 Table 1 2 Event Description Reference continued Description Line Item Explanation Memory device bank location lt Bank name in chassis gt Specifies the name of the memory bank in the system that generated the message for example Memory device bank location Bank_1 Memory device location lt Device name in chassis gt Specifies the location of the memory module in the chassis for example Memory device location DIMM A Number of devices required for full Specifies the number of power supply or cooling devices required to achieve full redundancy for example Number of devices required for full redundancy 4 redundancy lt Number gt Peak value in Watts lt Reading gt Specifies the peak value in Watts for example Peak value in Watts 1 693 Possible memory module event cause lt list of causes gt Specifies a list of possible causes for the memory module event for example Possibl
182. redundancy degraded the redundancy has been degraded lt Sensor Name Location gt Critical The fan specified by lt Sensor Name Fan Redundancy sensor redundancy lost System Event Log Messages for IPMI Systems Location gt may have failed and hence the redundancy that was degraded previously has been lost 251 Table 4 3 Fan Sensor Events continued Event Message Severity Cause lt Sensor Name Location gt Information Fan Redundancy sensor redundancy regained Fan lt number gt RPM is Warning less than the lower warning threshold Fan lt number gt RPM is Critical less than the lower critical threshold Fan lt number gt RPM is Warning greater than the upper warning threshold Fan lt number gt RPM is Critical greater than the upper critical threshold Fan lt number gt RPM is Critical outside of range Fan lt number gt RPM is Information within range Fan lt number gt is Critical removed Fan lt number gt was Information inserted Fan lt number gt is Information present Fan lt number gt is Critical absent The fans are Information redundant The fan specified by lt Sensor Name Location gt may have started functioning again and hence the redundancy has been regained The speed of the specified fan might not provide enough cooling to the system The speed of the specified fan is not sufficient to provide enough cooling to the system The speed
183. returned to normal EMM 1 Controller 1 Connector 0 Enclosure 2 6 Storage Management Message Reference Alert Message Change History The following table describes the changes made to the Storage Management alerts from the previous release of Storage Management to the current release Table 3 3 Alert Message Change History Storage Management 4 1 2 Product Versions to which changes apply Storage Management 4 1 2 Dell OpenManage Server Administrator 7 1 2 New Alerts 2699 2700 2701 2702 2703 2704 2705 2874 2875 2876 2900 2901 2902 2903 2904 2905 2906 2907 2908 2909 2910 2911 2912 2913 2914 2915 2916 2917 2918 2919 2920 2921 2922 2923 2924 2930 2931 2932 2933 Deleted Alerts None Modified Alerts None Storage Management 4 1 Product Versions to which changes apply Storage Management 4 1 0 Dell OpenManage Server Administrator 7 1 0 New Alerts 2432 Deleted Alerts None Modified Alerts None Storage Management 4 0 Product Versions to which changes apply Storage Management 4 0 0 Dell OpenManage Server Administrator 7 0 0 New Alerts 2425 2426 2429 2430 2431 Deleted Alerts None Modified Alerts None Storage Management 3 5 Product Versions to which changes apply Storage Management 3 5 0 Dell OpenManage Server Administrator 6 5 0 New Alerts None Deleted Alerts None Storage Managemen
184. ring system startup 216 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2362 Physical OK Normal Cause This alert is for Clear Alert 751 disk s have Informational informational purposes None been removed Action None Related from a virtual Alert None disk The virtual disk is LRA in Failed state Number during the None next system reboot 2364 Allvirtual disks OK Normal Cause This alert is for Clear Alert 751 are missing Informational informational purposes None from the Action None Related controller This Alert None situation was discovered LRA during system Number startup None 2366 Dedicated OK Normal Cause This alert is for Clear Alert 901 spare imported Informational informational purposes None B global due Action None Related to missing Alert None arrays i LRA Number None Storage Management Message Reference 217 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2367 Rebuild is not Warning Cause The physical Clear Alert 903 possible Non critical disk is using an None because incompatible Related mixing of technology Alert different Action All physical Number media type disks in the virtual disk 2326 SSD HDD must use the same and
185. rmation Numbers 2354 Enclosure OK Normal Cause This alert is Clear Alert 851 firmware Informational provided for Status download in informational purposes None progiess Action None Related Alert None LRA Number None 2355 Enclosure Warning Cause The system was Clear Alert 853 firmware Non critical unable to download Status download firmware to the None failed enclosure The Related controller may have lost Alert None communication with the enclosure There LRA may have been Number problems with the data 2090 transfer or the download media may be corrupt 212 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2355 Action Attempt to Cont download the enclosure firmware again If problems continue verify that the controller can communicate with the enclosure Make sure that the enclosure is powered on Check the cables See the Cables Attached Correctly section for more information on checking the cables Verify the health of the enclosure and its components To verify the health of the enclosure select the enclosure object in the tree view The Health subtab displays a red X or yellow exclamation point for enclosure components that are failed or degraded Storage Management Message Reference 213 Table 3 4 Storage Management Messages continued Event Sev
186. rmational Cause User triggered Clear Alert 901 on Physical action None disk from Action Configure the Related READY to drive to be non raid Alert None Non RAID using CLI GUI ERA Number 234 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2426 State change Informational Cause User triggered Clear Alert 901 on Physical action None disk from Non Action Configure the Related RAID to drive to be ready using Alert None READY CLI GUI LRA Number None 2429 Drive Prepared Informational Cause User triggered Clear Alert 901 for Removal action None Action Execute Related Prepare to Remove Alert None task from Ul in a LRA PCleSSD setup Number None 2430 Drive Export Informational Cause User triggered Clear Alert 901 Log action None Action Execute export Related log for physical device Alert None LRA Number None 2431 Physical Informational Cause User triggered Clear Alert 901 Device Full task None Initialization Action None Related completed Alert None LRA Number None Storage Management Message Reference 235 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2432 The PCleSSD Warming Cause Last full Clear Alert 902 device was initialization was No
187. s lt State gt location previous Battery Sensor status state and battery sensor status lt status gt x ee information is provided 1704 Battery sensor detected a Error A battery sensor in failure value the specified system Sensor Location lt Location in a ney Jed chassis gt a battery has aled The sensor location chassis Location lt Name of chassis location chassis gt previous state and Previous state was lt State gt battery sensor status inf ea Battery sensor status ie ae lt status gt proves 58 Server Management Messages Table 2 15 Battery Sensor Messages continued Event Description Severity Cause ID 1705 Battery sensor detected a Error A battery sensor in non recoverable valu Sensor Location lt Location in chassis gt Chassis Location lt Name of chassis gt Previous state was lt State gt Battery sensor status lt status gt the specified system could not retrieve a value The sensor location chassis location previous state and battery sensor status information is provided Secure Digital SD Card Device Messages The SD card device sensors monitor instrumented SD card devices in the system Table 2 16 lists the messages that provide status and error information for SD card devices present in a chassis Table 2 16 SD Card Device Messages Event Description Severity Cause ID 1750 SD card device sensor has Error An SD card device failed
188. s DMA controller failure 86 Interrupt controller failure This error code indicates interrupt controller failure 87 Timer refresh failure This error code indicates timer refresh failure 88 Programmable interval This error code indicates a programmable timer error interval timer error 89 Parity error This error code indicates a parity error 8A SIO failure This error code indicates SIO failure 8B Keyboard controller failure This error code indicates keyboard controller failure 8C SMI initialization failure This error code indicates SMI System Event Log Messages for IPMI Systems initialization failure 277 Table 4 13 POST Code Errors continued Fatal Error Description Cause Code Co Shutdown test failure This error code indicates a shutdown test failure Cl POST Memory test failure This error code indicates bad memory detection C2 RAC configuration failure Check screen for the actual error message C3 CPU configuration failure Check screen for the actual error message C4 Incorrect memory Memory population order not correct configuration FE General failure after video Check screen for the actual error message Operating System Generated System Events Table 4 14 Operating System Generated Events Description Severity Cause System Event OS stop Information The operating system was event shutdown restarted normally OS graceful shutdown detected OEM Event data record
189. s a member of the virtual disk and is no longer assigned as a hot spare You need to assign a new hot spare to maintain data protection in this situation On the CERC SATA1 5 6ch and CERC SATA1 5 2s controllers if you use another application such as the BIOS to include a hot spare in a virtual disk then Storage Management unassigns the physical disk as a hot spare Storage Management Message Reference 93 Table 3 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2099 Cont Action Although this alert is provided for informational purposes you may need to assign a new hot spare to the virtual disk 2100 Temperature exceeded the maximum warning threshold Warning Non critical 4 Cause The physical disk enclosure is too hot A variety of factors can cause the excessive temperature For example a fan may have failed the thermostat may be set too high or the room temperature may be too hot Action Check for factors that may cause overheating For example verify that the enclosure fan is working You should also check the thermostat settings and examine whether the enclosure is located near a heat source Make sure the enclosure has enough ventilation and that the room temperature is not too hot See the physical disk enclosure documentation for more diagnostic informatio
190. s event is generated when the while the power is earlier intrusion has been corrected off while the power is off BIOS Generated System Events The BIOS generated messages monitor the health and functionality of the chipsets I O channels and other BIOS related functions Table 4 12 BIOS Generated System Events Event Message Severity Cause System Event I O channel Critical This event is generated when a chk critical interrupt is generated in the I O Channel System Event PCI Parity Critical This event is generated when a Err parity error is detected on the PCI bus System Event Chipset Err Critical This event is generated when a chip error is detected System Event PCI System Information This event indicates historical Err data and is generated when the system has crashed and recovered System Event PCI Fatal Critical This error is generated when a Err fatal error is detected on the PCI bus 268 System Event Log Messages for IPMI Systems Table 4 12 BIOS Generated System Events continued Event Message Severity Cause System Event PCIE Fatal Critical This error is generated when a Err fatal error is detected on the PCIE bus POST Err Critical This event is generated when an error occurs during system boot See the system documentation for more information on the error code POST fatal error lt number gt Critical This event is generated when a o
191. s the global 2070 hot spare Storage Management Message Reference 173 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2278 The controller OK Normal Cause The battery is ClearAlert 1151 battery charge Informational discharging A battery None level is below discharge is a normal Related a normal activity during the Alert threshold battery Learn cycle The Number battery Learn cycle 2199 recharges the battery You should receive LRA alert 2179 when the Number recharge occurs None Action1 Check if the battery Learn cycle is in progress The battery also displays the Learn state while the Learn cycle is in progress Action2 If a Learn cycle is not in progress replace the battery pack 2279 The controller OK Normal Cause This alert Clear Alert 1151 battery charge Informational indicates that the None level is battery is recharging Related operating during the battery Aler No e within normal Learn cycle This alert is limits provided for LRA informational purposes Number None Action None 174 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2280 Adisk media OK Normal Cause A disk media Clear Alert 1201 error has been Informational error was detected None c
192. sed for Related copyback Alert Number None LRA Number None 2199 The virtual OK Normal Cause This alert is for Clear Alert 1201 disk cache Informational informational purposes None policy has Action None Related changed Alert None LRA Number None 2200 Copyback not Warning Non Cause This alert is for Clear Alert 903 possible as critical informational purposes None SAS SATA is Action None Related not supported Alert None in the same virtual disk LRA Number None Storage Management Message Reference 147 Table 3 4 Storage Management Messages continued 148 Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2201 lt Aglobalhot Warning Cause The controller is Clear Alert 903 spare failed Non critical not able to None communicate with a Related disk that is assigned as aa jeyy dedicated hot spare Number The disk may have been 2048 removed There may also be a bad or loose LRA cable Number 2070 Action Check if the disk is healthy and that it has not been removed Check the cables If necessary replace the disk and reassign the hot spare 2202 Aglobalhot OK Normal Cause The controller is Clear Alert 901 spare has been Informational unable to communicate None removed with a disk that is Related assigned as a global hot jest None spare The disk may have been removed LRA There may also be a bad Number or loose cab
193. ses if the multi bit error occurs during a read operation the data on the disk may be OK If the multi bit error occurs during a write operation data loss has occurred Action Replace the dual in line memory module DIMM The DIMM is a part of the controller battery pack See your hardware documentation for information on replacing the DIMM You may need to restore data from backup None Related Alert None LRA Number 2061 179 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2290 Single bit Warning Cause An error Clear Alert 753 ECC error on Non critical involving a single bit None controller has been encountered Related DIMM during a read or write Alert None operation The error correction algorithm LRA has corrected this error Number 2060 Action None 2291 An enclosure OK Normal Cause This alert is for ClearAlert 851 management Informational informational purposes None module Action None Related EMM has Alert None been discovered LRA Number None 2292 Communicatio Critical Cause The controller Clear Alert 854 n with the Failure Error has lost communication Number enclosure has with an EMM The 2162 been lost cables may be loose or Related defective Alert None Action Make sure the LRA cables are attached N mib r securely Reboot the 2091 system 180 Storage
194. sor Name Location gt is returned to normal state 249 250 Table 4 2 Voltage Sensor Events continued Event Message Severity Cause lt Sensor Name Location gt Warning Voltage of the monitored voltage sensor detected a entity warning lt Reading gt lt Sensor Name Location gt exceeded the warning threshold lt Sensor Name Location gt Information The voltage of a previously voltage sensor returned to reported normal lt Reading gt lt Sensor Name Location gt is returned to normal state The lt Sensor Name Location gt Warning Voltage of the monitored voltage is less than the Entity lt Sensor Name lower warning threshold Location gt exceeded the warning threshold The lt Sensor Name Location gt Critical Voltage of the monitored voltage is less than the Entity lt Sensor Name lower critical threshold Location gt exceeded the critical threshold The lt Sensor Name Location gt Warning Voltage of the monitored voltage is greater than the Entity lt Sensor Name upper warning threshold Location gt exceeded the warning threshold The lt Sensor Name Location gt Critical Voltage of the monitored voltage is greater than the Entity lt Sensor Name upper critical threshold Location gt exceeded the critical threshold The lt Sensor Name Location gt Critical Voltage of the monitored voltage is outside of Entity lt Sensor Name range Location gt is outside of n
195. ssigned a physical disk as a dedicated hot spare to a virtual disk This alert is provided for informational purposes Action None Clear Alert 901 2161 Related Alert None LRA Number None 127 Table 3 4 Storage Management Messages continued Event ID Description Severity Cause and Action Related SNMP Alert Trap Information Numbers 2161 Dedicated hot OK Normal spare Informational unassigned 1238 Cause A physical disk that was assigned as a hot spare has been unassigned and is no longer functioning as a hot spare The physical disk may have been unassigned by a user or automatically unassigned by Storage Management Storage Management unassigns hot spares that have been used to rebuild data Once data is rebuilt onto the hot spare the hot spare becomes a member of the virtual disk and is no longer assigned as a hot spare You need to assign a new hot spare to maintain data protection in this situation On the CERC SATA1 5 6ch and CERC SATA1 5 2s controllers if you use another application such as the BIOS to include a hot spare in a virtual disk then Storage Management unassigns the physical disk as a hot spare Storage Management Message Reference Clear Alert 901 None Related Alert None LRA Number None Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Informatio
196. state and the number of devices required for full redundancy information is provided 1305 Redundancy degraded Warning A redundancy sensor in Redundancy unit lt Redundancy location in chassis gt Chassis location lt Name of chassis gt Previous redundancy state was lt State gt 40 Server Management Messages the specified system detected that one of the components of the redundancy unit has failed but the unit is still redundant The redundancy unit location chassis location previous redundancy state and the number of devices required for full redundancy information is provided Table 2 7 Redundancy Unit Messages continued Event Description Severity Cause ID 1306 Redundancy lost Error A redundancy sensor in the specified system detected that one of the components in the redundant unit has been disconnected has failed Redundancy unit lt Redundancy location in chassis gt Chassis location lt Name of chasgroz or is not present Previous redundancy state The redundancy unit was lt State gt location chassis location previous redundancy state and the number of devices required for full redundancy are provided Server Management Messages 41 Power Supply Messages The power supply sensors monitor how well a power supply is functioning The power supply messages listed in Table 2 8 provide status and warning information for power supplies present in a particular
197. supply has normal Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Power Supply type lt type of power supply gt lt Additional power supply status information gt If in configuration error state Configuration error type lt type of configuration error gt Server Management Messages been reconnected or replaced The sensor location chassis location previous state power supply type additional power supply status and configuration error type information are provided 43 Table 2 8 Power Supply Messages continued Event Description Severity Cause ID 1353 Power supply detected a Warning A power supply sensor warning Sensor location reading in the lt Location in chassis gt specified system Chassis location lt Name of exceeded _ a user definable chassis gt warning threshold Previous state was lt State gt The s n or location Power Supply type lt type of chassis location power supply gt previous state power wer ly ty lt Additional power supply Supp OPE additional power status information gt supply status and If in configuration error configuration error state type information Configuration error type are provided lt type of configuration error gt 1354 Power supply detected a Error A power supply has failure been disconnected or Sensor location lt Location A oA de ChAS aioe ocation c
198. system e Operating system documentation e Application program documentation Understanding Event Messages This section describes the various types of event messages generated by the Server Administrator When an event occurs on your system Server Administrator sends information about one of the following event types to the systems management console Table 1 1 Understanding Event Messages Icon Alert Severity Component Status OK Normal An event that describes the successful operation of a unit Informational The alert is provided for informational purposes and does not indicate an error condition For example the alert may indicate the normal start or stop of an operation such as power supply or a sensor reading returning to normal Warning An event that is not necessarily significant but may indicate a Non critical possible future problem For example a Warning Non critical a alert may indicate that a component such as a temperature probe in an enclosure has crossed a warning threshold Critical A significant event that indicates actual or imminent loss of QO Failure Error data or loss of function For example crossing a failure threshold or a hardware failure such as an array disk 8 Introduction Server Administrator generates events based on status changes in the following sensors Temperature Sensor Helps protect critical components by alerting the systems management console when tempe
199. system could not obtain a reading The sensor location chassis location previous state and a nominal voltage sensor value are provided 29 Table 2 4 Voltage Sensor Messages continued Event Description Severity Cause ID 1152 Voltage sensor returned to Information A voltage sensor in a normal value the specified system Sensor location lt Location apa ae nies tn ehassiss range a ter crossing a failure threshold Chassis location lt Name of The sensor location chassis gt chassis location Previous state was lt State gt previous state and If sensor type is not voltage SERO value information is discrete provided Voltage sensor value in Volts lt Reading gt If sensor type is discrete Discrete voltage state lt State gt 1153 Voltage sensor detected a Warning A voltage sensor in warning value the specified system Sensor location lt Location E eaten in chee thresho The sensor location chassis Chassis location lt Name of 30 chassis gt Previous state was lt State gt If sensor type is not discrete Voltage sensor value in Volts lt Reading gt If sensor type is discrete Discrete voltage state lt State gt Server Management Messages location previous state and voltage sensor value information is provided Table 2 4 Voltage Sensor Messages continued Event Description Severity Cause ID 1154 Voltage sensor detected Error A voltage
200. system lt Sensor lt Reading gt Name Location gt returned from critical state to non critical state lt Sensor Name Location gt Information Temperature of the backplane temperature sensor board system board or the carrier returned to normal state in the specified system lt Sensor lt Reading gt Name Location gt returned to normal operating range The lt Sensor Name Warning Temperature of the backplane Location gt temperature is system board system inlet or the less than the lower carrier in the specified system warning threshold lt Sensor Name Location gt entered into non critical state The lt Sensor Name Critical Temperature of the backplane Location gt temperature is system board system inlet or the less than the lower carrier in the specified system critical threshold lt Sensor Name Location gt entered into critical state The lt Sensor Name Warning Temperature of the backplane Location gt temperature is system board system inlet or the greater than the upper carrier in the specified system warning threshold lt Sensor Name Location gt entered into non critical state The lt Sensor Name Critical Temperature of the backplane Location gt temperature is system board system inlet or the greater than the upper carrier in the specified system critical threshold lt Sensor Name Location gt entered into critical state The lt Sensor Name Critical Temperature of the backplane Location gt temperature
201. t None Caching LRA functionality is Number disabled None Expired days FluidCache 2919 Running onan_ Error A valid permanent Clear Alert 1604 expired invalid license must be None license installed Related Configuration Alert None changes are LRA disabled Number FluidCache None 2920 Alicensehas Information No action required Clear Alert 1601 been installed None FluidCache Related Alert None LRA Number None Storage Management Message Reference 243 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2921 Alicensehas Information A license should be Clear Alert 1601 been removed installed None FluidCache Related Alert None LRA Number None 2922 Not enough Error You must run on a Clear Alert 1604 memory to run system with adequate None necessary memory Related services Alert None FluidCache LRA Number None 2923 Oneormore Error To resolve the issue Clear Alert 1604 cache devices insert the missing cache None are missing device If the cache Related Cache is device was unplugged Alert None hung reactivate it LRA FluidCache Number None 2924 All cache Information No action required Clear Alert 1601 devices have None been found Related and registered Alert None FluidCache LRA Number None Storage Management Message Reference 244 Table 3 4 Storage Management Messages conti
202. t Message Reference 69 Table 3 3 Alert Message Change History continued Modified Alerts 2388 2347 2081 Storage Management 3 4 Product Versions to which Storage Management 3 4 0 changes apply Dell OpenManage Server Administrator 6 4 0 New Alerts 2405 2406 2407 2408 2409 2410 2411 2412 2413 2414 2415 2416 2417 2418 NOTE The CacheCade feature is available from calendar year 2011 Deleted Alerts None Modified Alerts None Storage Management 3 3 Product Versions to which Storage Management 3 3 0 changes apply Dell OpenManage Server Administrator 6 3 0 New Alerts 2394 2395 2396 2397 2398 2399 2400 2401 2402 2403 2404 Deleted Alerts None Modified Alerts Alert severity changed for 1151 and 1351 Storage Management 3 2 Product Versions to which Storage Management 3 2 0 changes apply Dell OpenManage Server Administrator 6 2 0 New Alerts 2387 2388 2389 2390 2392 2393 Deleted Alerts None Modified Alerts None Alert Descriptions and Corrective Actions The following sections describe alerts generated by the RAID or SCSI controllers supported by Storage Management The alerts are displayed in the Server Administrator Alert tab or through Windows Event Viewer These alerts can also be forwarded as SNMP traps to other applications SNMP traps are generated for the alerts listed in the following sections These traps are included in the Dell OpenMan
203. t Messages continued Event Description Severity Cause ID 1302 Redundancy not applicable Information A redundancy sensor in Redundancy unit oe o system lt Redundancy location etected that a unit was not redundant in chassis gt l l The redundancy Chass TS location lt Name of location chassis location chassis gt previous redundancy Previous redundancy state state and the number of was lt State gt devices required for full redundancy information is provided 1303 Redundancy is offline Information A redundancy sensor in Redundancy unit lt Redundancy location in chassis gt Chassis location lt Name of chassis gt Previous redundancy state was lt State gt Server Management Messages the specified system detected that a redundant unit is offline The redundancy unit location chassis location previous redundancy state and the number of devices required for full redundancy information is provided 39 Table 2 7 Redundancy Unit Messages continued Event Description Severity Cause ID 1304 Redundancy regained Information A redundancy sensor in Red nasney unit the specified system lt Redundancy location detected that a lost ia chassis redundancy device has been reconnected or Chass i s location lt Name of replaced full redundancy chassis gt is in effect The Previous redundancy state redundancy unit was lt State gt location chassis location previous redundancy
204. t link tuning or tuning or the Flex addressing flex address Mezz XX feature was asserted LinkT FlexAddr Link Critical This event is generated when Tuning sensor failed to BIOS fails to program virtual program virtual MAC MAC address on the given address lt location gt was NIC device asserted PCIE NonFatal Er Non Warning This event is generated in Fatal IO Group sensor association with a CPU IERR PCIe error lt location gt I O Fatal Err Fatal IO Critical This event is generated in Group sensor fatal IO association with a CPU IERR error lt location gt and indicates the PCI PCle device that caused the CPU IERR Unknown system event Critical This event is generated when an sensor unknown system unknown hardware failure is hardware failure was detected asserted An I O channel check error Critical This event is generated when a was detected critical interrupt is generated in the I O Channel A PCI parity error was Critical This event is generated when a detected on a component at parity error is detected on the bus lt number gt device PCI bus lt number gt function lt number gt A PCI parity error was Critical This event is generated when a detected on a component at parity error is detected on the slot lt number gt PCI bus 272 System Event Log Messages for IPMI Systems Table 4 12 BIOS Generated System Events continued Event Message Severity Cause A PCI system error
205. table Failure Error 184 Cause The controller is not receiving a consistent response from the enclosure There could be a firmware problem or an invalid cabling configuration If the cables are too long they degrade the signal Action Power down all enclosures attached to the system and reboot the system If the problem persists upgrade the firmware to the latest supported version You can download the most current version of the driver and firmware from support dell com Make sure the cable configuration is valid See the hardware documentation for valid cabling configurations Storage Management Message Reference Clear Alert 854 None Related Alert None LRA Number 2091 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2301 The enclosure Critical Cause The enclosure or ClearAlert 854 has a hardware Failure Error an enclosure None error component is in a Related Failed or Degraded Alert None state LRA Action Check the Number health of the enclosure 2091 and its components Replace any hardware that is in a Failed state See the hardware documentation for more information 2302 The enclosure Critical Cause The enclosure or Clear Alert 854 is not Failure Error an enclosure None responding component is in a Related Failed or Degraded Alert None state LRA Action
206. the intended number of critical components are operating Redundancy is degraded when a component fails but others are still operating Redundancy is lost when the number of components functioning falls below the redundancy threshold Table 2 7 lists the redundancy unit messages The number of devices required for full redundancy is provided as part of the message when applicable for the redundancy unit and the platform For details on redundancy computation see the respective platform documentation Table 2 7 Redundancy Unit Messages Event Description Severity Cause ID 1300 Redundancy sensor has Warning A redundancy sensor in failed the specified system Redundancy unit lt Redundancy a The Seems toGabi on ah Chasisiss unit location hassis location previous Chassis location lt Name of redundancy state and chassis gt the number of devices Previous redundancy state required for full was lt State gt redundancy are provided 1301 Redundancy sensor value Warning A redundancy sensor in unknown the specified system 38 Redundancy unit lt Redundancy location in chassis gt Chassis location lt Name of chassis gt Previous redundancy state was lt State gt Server Management Messages could not obtain a reading The redundancy unit location chassis location previous redundancy state and the number of devices required for full redundancy are provided Table 2 7 Redundancy Uni
207. tion previous state and Chassis Locattor processor sensor status lt Name of chassis gt are provided Previous state was lt State gt Processor sensor status lt status gt 1605 Processor sensor Error A processor sensor in the detected a non recoverable valu Sensor Location lt Location in chassis gt Chassis Location lt Name of chassis gt Previous state was lt State gt Processor sensor status lt status gt specified system has failed The sensor location chassis location previous state and processor sensor status are provided 54 Server Management Messages Pluggable Device Messages The pluggable device messages listed in Table 2 14 provide status and error information when some devices such as memory cards are added or removed Table 2 14 Pluggable Device Messages Event Description Severity Cause ID 1650 lt Device plug event Information A pluggable device event message type unknown gt of unknown type was received beries locatis S device eae ee lt Location in chassis ponies a o es 3 if avai Tapas etails if available are provided Chassis location lt Name of chassis if available gt Additional details lt Additional details for the events if available gt 1651 Device added to Information A device was added in the system Device location lt Location in chassis gt Chassis location lt Name of chassis gt Additional details lt Additi
208. tion Severity Cause and Action ID Related SNMP Alert Trap Information Numbers 2063 Virtual disk OK Normal Cause This alert is for reconfiguratio Informational informational purposes n started Action None Clear Alert 1201 Number 2090 Related Alert Number None LRA Number None 2064 Virtual disk OK Normal Cause This alert is for rebuild started Informational informational purposes Action None Clear Alert 1201 Number 2091 2065 Physical disk OK Normal Cause This alert is for rebuild started Informational informational purposes Action None Clear Alert 901 Number 2092 Related Alert Number 2099 2121 2196 LRA Number None Storage Management Message Reference 79 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2067 Virtual disk OK Normal Cause The check Clear Alert 1201 check Informational consistency operation Number consistency was cancelled because a None cancelled physical disk in the Related array has failed or Alert because a user cancelled Number the check consistency None operation LRA Action If the physical Number disk failed then replace None the physical disk You can identify which disk failed by locating the disk that has a red X for its status Perform a rescan after replacing the disk The consistency check can tak
209. tional informational purposes None cycle starts in The l indicates a Related l days substitution variable Alert None The text for this i substitution variable is LRA displayed with the alert Number in the alert log and can None vary depending on the situation Action None 2181 The controller OK Normal Cause The 1 Clear Alert 1151 battery learn Informational indicates a substitution None eyele starts variable The text for Related in 1 hours this substitution Alert None variable is displayed with the alert in the LRA alert log and can vary Number depending on the None situation This alert is for informational purposes Action None 2182 An invalid SAS Critical Cause The controller Clear Alert 754 configuration Failure Error and attached enclosures None has been are not cabled correctly Related detected Action See the Alert None hardware l LRA documentation for Nunib r information on correct 2061 cabling configurations 140 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2183 Copyback Critical Cause The physical Clear Alert 904 failed on Failure Error disk participating in the None physical disk copyback operation has Related 1 from failed Alert physical disk Action None Number ae 2060 LRA Number None 2184 Physical disk OK Normal Cause Use
210. tional informational purposes None shutdown is Action None Related disabled Alert None LRA Number None 2264 lt A device is Warning Cause The controller Clear Alert 753 missing Non critical cannot communicate None 803 with a device The Related 853 device may be removed Ajert None 293 There may also be a bad 953 or loose cable LRA 1003 i i Number 1053 Action Check if the 2050 2060 1103 device is in and not NONM 2070 2080 1153 removed If it 1S 1n 2090 2100 1203 check the cables Also check the connection to the controller battery and the battery health A battery with a weak or depleted charge may cause this alert 166 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2265 Adevice isin Warning Cause The controller Clear Alert 753 an unknown Non critical cannot communicate None 803 state with a device The state Related 853 of the device cannot be Alert 903 determined There may Number 9 be a bad or loose cable 2048 2050 1003 The system may also be 1053 experiencing problems LRA 1103 with the application Number 1153 programming interface 2050 2060 1203 API There could also 2070 2080 be a problem with the 2090 2100 driver or firmware Action Check the cables Check if the controller has a supported version of the driver and firmw
211. torage Management Message Reference 104 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2114 lt Aconsistency OK Normal Cause The check Clear Alert 1201 check on a Informational consistency operation Number virtual disk has on a virtual disk was 2115 been paused paused by a user Related suspended Action To resume the Alert check consistency Number operation right click None the virtual diskinthe pa tree view and select N mber Resume Check None Consistency 2115 A consistency OK Normal Cause The check Clear Alert 1201 check on a Informational consistency operation Status virtual disk has on a virtual disk has Alert 2115 been resumed resumed processing is a clear after being paused by alert for a user This alert is for alert 2114 informational purposes Related Action None Alert Number None LRA Number None Storage Management Message Reference 105 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2116 A virtual disk OK Normal Cause A user has caused Clear Alert 1201 and its mirror Informational a mirrored virtual disk to Number have been split be split When a virtual None disk is mirrored its data Related is copied to another Alert virtual disk in order to Nomber maintain redundancy No
212. ttached securely See the Cables Attached Correctly section for more information on checking the cables Storage Management Message Reference 207 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2346 Error occurred Warning Cause A physical Clear Alert 903 l Non critical device may have an None error The 1 indicates Related a substitution variable Alert The text for this Number substitution variable is 2048 2050 generated by the 2056 2057 firmware and is i displayed with the alert ae in the alert log This 2095 3129 text can vary depending 2201 3793 on the situation 2270 2282 Action Verify the 2369 health of attached LRA devices Review the N mber alert log for significant 2070 events Run the PHY integrity diagnostic tests You may need to replace faulty hardware Make sure the cables are attached securely See the hardware documentation for more information 208 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2347 The rebuild Critical Hardware RAID Clear Alert 904 failed due to errors on the source physical disk Failure Error Storage Management Message Reference Cause You are attempting to rebuild data that resides on a defective disk
213. type et type information is i provided The SD card of SD card device gt state is provided if an SD card state lt State of SD card is present in SD card gt the SD card device 60 Server Management Messages Table 2 16 SD Card Device Messages Event Description Severity Cause ID 1753 SD card device detected a Warning An SD card device warning sensor in the specified Sensor location lt Location system aN T in Chassis warming con ition ne sensor location chassis Chassis location lt Name of location previous state chassis gt and SD card device Previous state was type information is lt State gt provided The SD card Sb card device typen iyo state is provided if an BR eara de creg SD card is present in the SD card device SD card state lt State of SD card gt 1754 SD card device detected a Error An SD card device failure Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt SD card device typ lt Typ of SD card device gt SD card state lt State of SD card gt Server Management Messages sensor in the specified system detected an error The sensor location chassis location previous state and SD card device type information is provided The SD card state is provided if an SD card is present in the SD card device 61 Table 2 16 SD Card Device Messages Event Description Severity Cause
214. upported Manager installation Alert on Dell Action Installing Number OpenManage Storage Management None Server and Array Manageron RA Administrator the same system is nota Number version 6 0 1 supported 5 Storage Management Message Reference configuration Uninstall either Storage Management or Array Manager 117 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2136 Virtual disk OK Normal Cause Virtual disk Clear Alert 1201 initialization Informational initialization is in Number progress This alert is for 2088 informational purposes Related Action None Alert Number None LRA Number None 118 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2137 Communica Warning Cause The controller is Clear Alert 853 tion timeout Non critical Storage Management Message Reference unable to communicate with an enclosure There are several reasons why communication may be lost For example there may be a bad or loose cable An unusual amount of I O may also interrupt communication with the enclosure In addition communication loss may be caused by software hardware or firmware problems bad or failed power supplies and enclosure shutdown When
215. value in Watts is Watts lt Reading gt provided 1014 System software Warning This event is generated when event lt Description gt the systems management agent Date and time of detects a critical system action lt Date and time gt software generated event in the system event log which could have been resolved Temperature Sensor Messages The temperature sensors listed in Table 2 2 help protect critical components by alerting the systems management console when temperatures become too high inside a chassis The temperature sensor messages use additional variables sensor location chassis location previous state and temperature sensor value or state 22 Server Management Messages Table 2 2 Temperature Sensor Messages Event Description Severity Cause ID 1050 Temperature sensor has failed Error A temperature Sensor location lt Location in ae i i chassis gt ae ET a 2 Chassis location lt Name of aa Oat 7 1 chaseres ort Paras in the Previous state was lt State gt Spec led system failed The sensor If sensor type is not discrete location chassis Temperature sensor value location previous in degrees Celsius lt Reading gt state and If sensor type is discrete temperature SER value are provided Discrete temperature stat lt State gt 1051 Temperature sensor value Information A temperature unknown Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt
216. verity Cause System Board Power Warning This event is generated when Optimized a change in power supply Performance status degrades system sensor for System performance Board degraded user defined power capacity was asserted System Board Power Normal This event is generated when Optimized the system performance is Performance status restored sensor for System Board degraded user defined power capacity was deasserted System Board Power Critical This event is generated when Optimized a change in power supply Performance status degrades system sensor for System performance Board Halted system power exceeds capacity was asserted System Board Power Normal This event is generated when Optimized system performance was Performance status restored sensor for System Board Halted system power exceeds capacity was deasserted The system Warning This event is generated when performance degraded 282 System Event Log Messages for IPMI Systems a change degrades system performance Table 4 17 Power And Performance Events continued Description Severity Cause The system Warning This event is generated when performance a change in thermal degraded becaus protection degrades system of thermal performance protection The system Warning This event is generated when performance a change in cooling degrades degraded becaus system performance coo
217. version 7 1 2 and displayed in the Server Administrator alert log Server Administrator creates events in response to sensor status changes and other monitored parameters The Server Administrator event monitor uses these status change events to add descriptive messages to the operating system event log or the Server Administrator alert log Each event message that Server Administrator adds to the alert log consists of a unique identifier called the event ID for a specific event source category and a descriptive message The event message includes the severity cause of the event and other relevant information such as the event location and the previous state of the monitored item The tables in this guide list all Server Administrator event IDs in numeric order Each entry includes the description severity level and cause of the event ID The message text in angle brackets for example lt State gt describes the event specific information provided by the Server Administrator Introduction 7 What s New in this Release New Alert messages for Fluid cache for DAS Messages Not Described in This Guide This guide describes only event messages logged by Server Administrator and Storage Management that are displayed in the Server Administrator alert log For information on other messages generated by your system see one of the following sources e The Installation and Troubleshooting Guide or Hardware Owner s Manual shipped with your
218. viewed in the alert log the description for this event displays several variables These variables are controller and enclosure names type of communication problem return code and SCSI status Number 2162 Related Alert Number None LRA Number 2090 119 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2137 Action Check for contd problems with the cables See the online help for more information on checking the cables You should also check to see if the enclosure has degraded or failed components To do so select the enclosure object in the tree view and click the Health subtab The Health subtab displays the status of the enclosure components Verify that the controller has supported driver and firmware versions installed and that the EMM s are each running the same version of supported firmware 2138 Enclosure OK Normal Cause A user has Clear Alert 851 alarm enabled Informational enabled the enclosure Number alarm This alert is for None informational purposes Related Action None Alert Number None LRA Number None 120 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2139 Enclosure OK Normal Cause A user has Clear
219. was Critical This is generated when the detected on a component at system has crashed and bus lt number gt device recovered lt number gt function lt number gt A PCI system error was Critical This is generated when the detected on a component at system has crashed and slot lt number gt recovered A bus correctable error Critical This is generated when the was detected on a system has detected bus component at bus lt number gt correctable errors device lt number gt function lt number gt A bus correctable error Critical This is generated when the was detected on a system has detected bus component at slot correctable errors lt number gt A bus uncorrectable error Critical This is generated when the was detected on a system has detected bus component at bus lt number gt uncorrectable errors device lt number gt function lt number gt A bus uncorrectable error Critical This is generated when the was detected on a system has detected bus component at slot uncorrectable errors lt number gt A fatal error was detected Critical This error is generated when a on a component at bus fatal error is detected on the PCI lt number gt device lt number gt bus function lt number gt A fatal error was detected Critical This error is generated when a on a component at slot lt number gt fatal error is detected on the PCI bus System Event Log Messages for IPMI Systems 273 274 Table 4 12 BIOS Ge
220. wer cord status cannot be monitored The sensor and chassis location information is provided Hardware Log Sensor Messages The hardware logs provide hardware status messages to systems management software On certain systems the hardware log is implemented as a circular queue When the log becomes full the oldest status messages are overwritten when new status messages are logged On some systems the log is not circular On these systems when the log becomes full subsequent hardware status messages are lost Hardware log sensor messages listed in Table 2 12 provide status and warning information about the noncircular logs that may fill up resulting in lost status messages 50 Server Management Messages Table 2 12 Hardware Log Sensor Messages Event Description Severity Cause ID 1550 Log monitoring has Warning A hardware log sensor in the been disabled specified system is disabled tog type bod types The log type information is provided 1551 Log status is unknown Information A hardware log sensor in the Pog type xLog types specified system could not obtain a reading The log type information is provided 1552 Log size is no longer Information The hardware log on the near or at capacity specified system is no longer near fog tubes Shoe types or at its capacity usually as the result of clearing the log The log type information is provided 1553 Log size is near Warning The size of a hardware log on th
221. y lt number gt Critical This event is generated when the failed power supply has failed System Event Log Messages for IPMI Systems 257 Table 4 5 Power Supply Events continued Event Message Severity Cause A predictive failure Warning This event is generated when the detected on power supply power supply is about to fail lt number gt The power input for power Critical This event is generated when supply lt number gt is lost input power is removed from the power supply The input power for power Information This event is generated if the supply lt number gt has been power supply has been restored reconnected or replaced Power supply lt number gt is Critical This event is generated when an incorrectly configured Warning invalid power supply configuration is detected Power supply lt number gt is Information This event is generated when the correctly configured power supply has recovered from an earlier invalid configuration Power supply lt number gt is Information This event is generated when the operating normally power supply has recovered from an earlier failure event Cannot communicate with Critical The power supply may operate power supply lt number gt however power supply monitoring is degraded The temperature for power Warning Temperature of specified power supply lt number gt is ina supply entered into non critical warning range state The temperature fo
222. y to rebuild the node 2704 The cluster ID Error Service is required Clear Alert 1604 in the journal Contact Dell Technical None does not Support Related match the Alert None cluster ID in LRA the Number configuration None file FluidCache 2705 Thejournal Error Service is required Clear Alert 1604 could not be read written FluidCache Contact Dell Technical Support None Related Alert None LRA Number None Storage Management Message Reference 237 Table 3 4 Storage Management Messages continued Event Description Severity Cause and Action Related SNMP ID Alert Trap Information Numbers 2874 The following Warning There is a cache device Clear Alert 903 Cache Device specified in the None has no configuration with no Related associated associated cache server Alert None server in the configured LRA configuration Number l None FluidCache 2875 The following Information No action required Clear Alert 901 Disk is None beginning Related flushing wwn Alert None 1 path 2 LRA FluidCache Number None 2876 The following Information No action required Clear Alert 901 Disk has None finished Related flushing wwn Alert None 1 path 2 LRA FluidCacheg Number None 2900 The following Error Replace the failed Clear Alert 904 cache device device None has Related failed wwn Alert None 1 path 2 LRA FluidCache Number None 238 Storage Management Message Re
223. yed Action None Alert None LRA Number None 2389 The virtual OK Normal Cause Virtual disk bad Clear Alert 1201 disk bad block Informational blocks are cleared None medium error Action None Related is cleared Alert None LRA Number None 2390 The Instant OK Normal Cause Instant Encrypt Clear Alert 901 Encrypt Erase Informational Erase operation is None operation is successful on Self Related performed on Encryption Disks Alert None oe SEDs LRA BS Action None Number None 226 Storage Management Message Reference Table 3 4 Storage Management Messages continued Event Description Cause and Action Related SNMP ID Alert Trap Information Numbers 2392 The drive Cause The controller Clear Alert 753 Encryption failed to verify the None Key is invalid specified Passphrase Related Action Enter a correct Alert None Passphrase LRA Number 2060 2393 The virtual Cause The Encrypted Clear Alert 1201 disk is virtual disk operation None encrypted on normal virtual disk Related created using Self Alert None Pa disks only is LRA Number Action None None 2394 Persistent Hot Cause The Persistent Clear Alert 751 Spare is Hot Spare option is None enabled enabled Related Action None Alert None LRA Number None 2395 Persistent Hot Cause The Persistent Clear Alert 751 Spare is Hot Spare option is None disabled disabled Related Action None Alert None LRA Number None Storage Managem
Download Pdf Manuals
Related Search
Related Contents
LC-Power LC-35NAS storage enclosure REGISTER YOUR GUARANTEE TODAY 10/100 Mbps Network Card Rust-Oleum Specialty 241140 Instructions / Assembly Fujitsu Server PRIMEQUEST 2000 Series General Description MANUAL TÉCNICO PARA OBRAS PROVISIONALES DE Copyright © All rights reserved.
Failed to retrieve file