Home
Dell OpenManage Server Administrator Version 5.5 Messages Reference Guide
Contents
1. Event Message Severity Cause CPU Protocol Err Critical This event is generated when the transition to processor protocol enters a non aan averse Le recoverable state CPU Bus PERR Critical This event is generated when the tradsition t processor bus PERR enters a non none recoverabie recoverable state CPU Init Err Critical This event is generated when the eransi tion t processor initialization enters a non non recov erabie recoverable state CPU Machine Chk Critical This event is generated when the trans tion to processor machine check enters a non horn recoverapie recoverable state Logging Disabled Critical This event is generated when all event wit vene iegrima logging is disabled disabled was asserted LinkT FlexAddr Critical This event is generated when the PCI Tuning sensor device device option ROM for a NIC does not option ROM failed to support link tuning or the Flex support link tuning addressing feature or flex address Mezz XX was asserted LinkT FlexAddr Critical This event is generated when BIOS Tuning sensor failed to program virtual MAC address lt location gt was asserted fails to program virtual MAC address on the given NIC device System Event Log Messages for IPMI Systems 73 Table 3 12 BIOS Generated System Events continued Event Message Severity Cause PCIE NonFatal Er Non Warning Fatal IO Group sensor PCIe error
2. 2239 cleared A foreign configuratio n has been OK Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 751 2240 A foreign configuratio n has been imported OK Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 751 2241 The Patrol Read mode has changed OK Normal Cause This alert is for informational purposes Action None Clear Alert Number None Related Alert Number None LRA Number None 751 2242 The Patrol Read has started OK Normal Cause This alert is for informational purposes Action None Clear Alert Related Alert Number None LRA Number None Storage Management Message Reference Number 2243 751 163 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2243 The Patrol Read has stopped OK Normal Cause This alert is for informational purposes Action None Clear Alert Status Alert 2243 is a clear alert for alert 2242 Related Alert Number None None LRA Number 751 2244 A virtual disk blink has been initiated OK Normal Cause This a
3. Event Message Reference 43 mory Device Messages Memory device messages listed in Table 2 10 provide status and warning information for memory modules present in a particular system Memory devices determine health status by monitoring the ECC memory correction rate and the type of memory events that have occurred Z NOTE A critical status does not always indicate a system failure or loss of data In some instances the system has exceeded the ECC correction rate Although the system continues to function you should perform system maintenance as described in Table 2 9 Z NOTE In Table 2 9 lt status gt can be either critical ornon critical Table 2 10 Memory Device Messages Event Description Severity Cause 44 ID 1403 Memory device status is Warming A memory device correction lt status gt Memory device rate exceeded an acceptable location lt location in value The memory device chassis gt status and location are Possible memory module provided event cause lt list of causes gt 1404 Memory device status is Error A memory device correction lt status gt Memory device rate exceeded an acceptable location lt location in value a memory spare bank was chassis gt activated or a multibit ECC poseable men ry moans error occurred The system vent causer SIG Ee oF continues to function normally eames except for a multibit error Replace the memory module identified in the message during the sys
4. Power Supply Events The power supply sensors monitor the functionality of the power supplies These messages provide status and warning information for power supplies for a particular system Table 3 5 Power Supply Events Event Message Severity Cause lt Power Supply Sensor Critical This event is generated when the Name gt power supply sensor power supply sensor is removed removed lt Power Supply Sensor Information This event is generated when the Name gt power supply sensor power supply has been replaced AC recovered lt Power Supply Sensor Information This event is generated when the Name gt power supply sensor returned to normal state 62 power supply that failed or removed was replaced and the state has returned to normal System Event Log Messages for IPMI Systems Table 3 5 Power Supply Events continued Event Message Severity Cause lt Entity Name gt PS Information Power supply redundancy is Redundancy degraded if one of the power sensor redundancy supply sources is removed or degraded failed lt Entity Name gt PS Critical Power supply redundancy is lost if Redundancy only one power supply is sensor redundancy lost functional lt Entity Name gt PS Information This event is generated if the Redundancy power supply has been sensor redundancy reconnected or replaced regained lt Power Supply Sensor Warning This event is generated when the Name gt pre
5. Processor sensor status lt status gt Processor sensor Error detected a non recoverable value Sensor Location lt Location in chassis gt Chassis Location lt Name of chassis gt Previous state was lt State gt Processor sensor status lt status gt Severity Cause A processor sensor in the specified system is disabled has a configuration error or experienced a thermal trip The sensor location chassis location previous state and processor sensor status are provided A processor sensor in the specified system has failed The sensor location chassis location previous state and processor sensor status are provided 52 Event Message Reference Pluggable Device Messages The pluggable device messages listed in Table 2 15 provide status and error information when some devices such as memory cards are added or removed Table 2 15 Pluggable Device Messages Event Description Severity Cause ID 1650 lt Device plug event Information A pluggable device event message type unknown gt of unknown type was received Device locatione he device ore rae lt Location in chassis seein a ae Sf aad Lab Vas etails 1t available are provided Chassis location lt Name of chassis if available gt Additional details lt Additional details for the events if available gt 1651 Device added to Information A device was added in the system Device location lt Location in
6. 1153 Voltage sensor detected a Warning A voltage sensor in warning value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Voltage sensor value in Volts lt Reading gt If sensor type is discrete Discrete voltage state lt State gt the specified system exceeded its warning threshold The sensor location chassis location previous state and voltage sensor value are provided Event Message Reference 27 Table 2 4 Voltage Sensor Messages continued 28 Event Description Severity Cause ID 1154 Voltage sensor detected Error A voltage sensor in a failure value the specified system Sensor location lt Location een em ict Cop ae wives thresho ne sensor location chassis Chassis location lt Name of location previous chassis gt state and voltage Previous state was lt State gt sensor value are vided If sensor type is not ees discrete Voltage sensor value in Volts lt Reading gt If sensor type is discrete Discrete voltage state lt State gt 1155 Voltage sensor detected a Error A voltage sensor in non recoverable value the specified system Sensor location lt Location T ETLON in chasses rom wineries cannot recover The sensor Chassis location lt Name of locatidn ehassis chassis gt location previous Previous state was lt State gt state and voltage sensor
7. asserted lt Cable sensor Name Information This event is generated when Location gt the earlier cable connection Connection was asserted error was corrected Battery Events Table 3 15 Battery Events Description Severity Cause lt Battery sensor Name Critical This event is generated when Location gt the sensor detects a failed or Failed was asserted missing battery lt Battery sensor Name Information This event is generated when Location gt the earlier failed battery was Failed was deasserted corrected lt Battery sensor Name Warning This event is generated when Location gt the sensor detects a low battery is low was asserted condition lt Battery sensor Name Information This event is generated when Location gt is low was deasserted the earlier low battery condition was corrected System Event Log Messages for IPMI Systems 75 Power And Performance Events The power and performance events are used to detect degradation in system performance with change in power supply Table 3 16 Power And Performance Events Description Severity Cause System Board Power Normal This event is generated when Optimized system performance was Performance status restored sensor for System Board degraded lt description of why gt was deasserted System Board Power Warning This event is generated when Optimized change in power supply Performance status degrades system sensor for System per
8. lt location gt This event is generated in association with a CPU IERR I O Fatal Err Fatal Critical IO Group sensor fatal IO error lt location gt This event is generated in association with a CPU IERR and indicates which device caused the CPU IERR Unknown system event Critical sensor unknown system hardware failure was asserted This event is generated when an unknown hardware failure is detected R2 Generated System Events Table 3 13 R2 Generated Events Description System Event OS stop event OS graceful shutdown detected Severity Information Cause The OS was shutdown restarted normally EM Event data record after OS graceful hutdown restart event Information Comment string accompanying an OS shutdown restart System Event OS stop event runtime critical stop Critical The OS encountered a critical error and was stopped abnormally OEM Event data record after OS bugcheck event Information OS bugcheck code and paremeters 74 System Event Log Messages for IPMI Systems Cable Interconnect Events The cable interconnect messages are used for detecting errors in the hardware cabling Table 3 14 Cable Interconnect Events Description Severity Cause lt Cable sensor Name Critical This event is generated when Location gt the cable is not connected or Ree Ce ea yas is incorrectly connected
9. 1005 18 1006 18 1007 18 1008 19 1009 19 1011 19 1012 19 1050 20 1051 20 1052 21 1053 21 1054 22 1055 22 1100 23 1101 23 1102 24 1103 24 1104 25 1105 25 1150 26 1151 26 1152 27 1153 27 1154 28 1155 28 1200 29 1201 30 1202 30 1203 31 1204 31 1205 32 Index 217 1250 33 1251 33 1252 34 1253 34 1254 35 1255 35 1300 37 1301 38 1302 38 1303 38 1304 39 1305 39 1306 39 1350 40 1351 41 1352 41 1353 42 1354 42 1355 43 1403 44 1404 44 1450 45 1451 45 1452 45 1453 45 1454 46 1455 46 218 Index 1500 47 1501 47 1502 47 1503 48 1504 48 1505 48 1550 49 1551 49 1552 49 1553 49 1554 49 1555 49 1600 50 1601 50 1602 51 1603 51 1604 52 1605 52 1650 53 1651 53 1652 54 1653 54 1700 55 1701 55 1702 55 1703 56 1704 56 1705 56 2000 36 2002 36 2003 36 2004 36 2005 36 2048 87 2049 88 2050 89 2051 89 2052 89 2053 90 2054 90 2055 90 2056 91 2057 92 2058 93 2059 93 2060 93 2061 93 2062 94 2063 94 2064 94 2065 94 2067 95 2070 96 2074 96 2075 96 2076 97 2077 97 2079 98 2080 98 2081 99 2082 99 2083 100 2085 100 2086 100 2087 101 2088 101 2089 101 2090 102 2091 102 2092 102 2094 103 2095 104 2098 104 2099 105 2100 105 2101 106 2102 106 2103 107 2104 107 2105 107 2106 108 Index 219 2107 109 2108 110 2109
10. 111 2110 113 2111 113 2112 114 2114 114 2115 115 2116 115 2117 116 2118 116 2120 117 2121 117 2122 118 2123 119 2124 120 2125 121 2126 121 2127 122 2128 122 2129 122 2130 123 2131 123 2132 124 2135 124 2136 125 2137 125 220 Index 2138 126 2139 127 2140 127 2141 127 2142 128 2143 128 2144 128 2145 129 2146 129 2147 129 2148 130 2149 130 2150 130 2151 130 2152 131 2153 131 2154 131 2155 131 2156 132 2157 132 2158 132 2159 133 2162 133 2163 134 2164 134 2165 135 2166 135 2167 136 2168 136 2169 137 2170 137 2171 138 2173 139 2174 140 2175 140 2176 141 2177 141 2178 142 2179 142 2180 143 2181 143 2182 143 2183 144 2184 144 2185 144 2186 145 2187 145 2188 146 2189 146 2190 147 2191 147 2192 148 2193 148 2194 149 2195 149 2196 149 2197 149 2198 150 2199 150 2200 150 2201 151 2202 151 2203 152 2204 152 2205 152 2206 153 2207 153 2210 154 2211 154 2212 154 2213 155 2214 155 2215 155 2216 155 2217 156 2218 156 2219 156 2220 157 2221 157 22225057 2223 158 Index 221 2224 158 2226 159 2227 159 2228 160 2229 160 2230 160 2231 161 2232 161 2233 161 2234 161 2235 162 2236 162 2237 162 2238 162 2239 163 2240 163 2241 163 2242 163 2243 164 2244 164 2245 164 2246 165 2247 165 2248 165 2249 166 2251 166 2252 167 2
11. Action Replace the disk generating this alert If necessary restore your data from backup Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2309 A physical Warming Cause You have attempted Clear Alert 903 disk is Non to replace a disk with Number incompati critical another disk that is using an None ble incompatible technology Related Alert For example you may have Number replaced one side of a None mirror with a SAS disk when the other side of the LRA Number mirror is using SATA 2070 technology Action See the hardware documentation for information on replacing disks 2310 A virtual Critical Cause A redundant virtual Clear Alert 1204 disk is Failure disk has lost redundancy Number permanently Error This may occur when the None degraded virtual disk suffers the Related Alert failure of multiple physical Number disks In this case both the None source physical disk and the target disk with LRA Number redundant data have failed 2081 A rebuild is not possible because there is no redundancy Action Replace the failed disks and restore from backup Storage Management Message Reference 189 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2311 Th
12. Cause This alert is for Clear Alert 751 Revertible Normal informational purposes Number Hot Spare Achion Non None and Replace Related Alert Member and N rmnber Load None balance l changed LRA Number None 2225 Abort Check OK Cause This alert is for Clear Alert 751 Consistency Norma informational purposes Number on Error and Atton None None Load Related Alert balance Naber changed None LRA Number None Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2226 Load OK Cause This alert is for Clear Alert 751 balance Normal informational purposes Number changed Action Nene None Related Alert Number None LRA Number None 2227 Abort Check OK Cause This alert is for Clear Alert 751 Consistency Normal informational purposes Number on Error AchonNone None Allow Related Alert Revertible N mber Hot Spare None and Replace Member LRA Number and Auto None Replace Member Operation on Predictive Failure changed Storage Management Message Reference 159 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2228 Allow OK Cause This alert is for Clear Alert 751 Revertible Normal informational purposes Number Hot Spare Action No
13. Fail Safe state asserted This event is generated when the system board voltages are not at normal levels System Board PFault Information Fail Safe state deasserted This event is generated when earlier PFault Fail Safe system voltages returns to a normal level Memory Add Information BANK DIMM presence was asserted Memory Removed Information BANK DIMM presence was asserted This event is generated when memory is added to the system This event is generated when memory is removed from the system Memory Cfg Err Critical configuration error BANK DIMM was asserted Mem Redun Gain Information redundancy regained This event is generated when memory configuration is incorrect for the system This event is generated when memory redundancy is regained Mem ECC Warning Warning transition to non critical from OK This event is generated when correctable ECC errors have increased from a normal rate Mem ECC Warning Critical transition to critical from less severe This event is generated when correctable ECC errors reach a critical rate Mem CRC Err Critical transition to non recoverable This event is generated when CRC errors enter a non recoverable state Mem Fatal SB CRC Critical uncorrectable ECC was asserted This event is generated when CRC errors occur while storing to memory System Event Log Messages for IPMI Systems 71 Table
14. None 2276 The Waring Cause The dedicated hot Clear Alert 903 dedicated Non spare is not large enough to Number hot spare is critical protect all virtual disks that None too small reside on the disk group Related Alert Action Assign a larger disk Number as the dedicated hot spare None LRA Number 2070 2277 The global Warming Cause The global hot spare Clear Alert 903 hot spare is Non is not large enough to Number too small critical protect all virtual disks that None reside on the controller Related Alert Action Assign a larger disk Number as the global hot spare None LRA Number 2070 Storage Management Message Reference 175 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2278 The OK Cause The battery is Clear Alert 1154 controller Normal discharging A battery Number battery discharge is a normal None charge level activity during the battery Related Alert is below Learn cycle Before Number 2199 a normal completing the battery threshold Learn cycle recharges the LRA Number battery You should receive None alert 2179 when the recharge occurs Action Check if the battery Learn cycle is in progress Alert 2176 indicates that the battery Learn cycle has initiated The battery also displays the Learn state while the Learn cycle is in progress If a Learn cycle is not in progress replace the batter
15. Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2159 Virtual disk OK Cause This alert is for Clear Alert 1201 renamed Normal informational purposes A Number user has renamed a virtual None disk Related Alert When renaming a virtual Number disk on a PERC 3 SC 3 None DCL 3 DC 3 QOC 4 SC LRA Number 4 DC 4e DC 4 Di CERC None ATA100 4ch PERC 5 E PERC 5 i or SAS 5 iR controller this alert displays the new virtual disk name On the PERC 3 SC 3 DCL 3 DC 3 QOC 4 SC 4 DC 4e DC 4 Di 4 IM 4e Si 4e Di and CERC ATA 100 4ch controllers this alert displays the original virtual disk name Action None 2162 Communica OK Cause This alert is for Clear Alert 851 tion Normal informational purposes Status Alert regained Communication withan 2162 is a clear enclosure has been alert for alerts restored 2137 and 2292 Action None Related Alert Number None LRA Number None Storage Management Message Reference 133 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2163 Rebuild Critical Cause This alert is Clear Alert 904 completed Failure documented in the Storage Number with errors Error Management online help None Action See the online help Related Alert for more i
16. cause of the event and other relevant information such as the event location and the monitored item s previous state Tables provided in this guide list all Server Administrator event IDs in numeric order Each entry includes the event ID s corresponding description severity level and cause Message text in angle brackets for example lt State gt describes the event specific information provided by the Server Administrator What s New in this Release The following changes have been made for this release e Added new Chassis Management Controller Events For more information see Chassis Management Controller Messages on page 36 e Updated BIOS Generated System Events and added new Power and Performance Events For more information see Power And Performance Events on page 76 e Added new Storage Management alerts For more information see Alert Message Change History on page 81 Introduction 7 Messages Not Described in This Guide This guide describes only event messages created by Server Administrator and displayed in the Server Administrator Alert log For information on other messages produced by your system consult one of the following sources e Your system s Installation and Troubleshooting Guide e Other system documentation e Operating system documentation e Application program documentation Understanding Event Messages This section describes the various types of event messages generated
17. sequence of Error monitor or manage SAS None SAS devices Related Alert components Action Reboot the system Number iat during If problem persists make None Syste sure you have supported J LRA Number startup SAS versions of the drivers and 2051 a a eee firmware Also you may t and need to reinstall Storage monitoring Management or Server Is not Administrator because of possible some missing installation components 2315 Diagnostic OK Cause This alert is for Clear Alert 751 message 1 Normal informational purposes Number The 1 indicates a None substitution variable The Related Alert text for this substitution Nanaber variable is generated by the None utility that ran the diagnostics and is displayed LRA Number with the alert in the Alert one Log This text can vary depending on the situation Action None Storage Management Message Reference 191 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2316 Diagnostic Critical Cause A diagnostics test Clear Alert 754 message 1 Failure failed The 1 indicates a Number Error substitution variable The None text for this substitution Related Alert variable is generated by the Number utility that ran the Nove diagnostics and is displayed with the alert in the Alert LRA Number Log This text can vary 2061 depending on the situation Action See the documentatio
18. 172 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2269 The physical OK Cause This alert is for Clear Alert 901 disk Clear Normal informational purposes Number operation Action Nene None has Related Alert completed N mber None LRA Number None 2270 The physical Critical Cause A Clear task was Clear Alert 904 disk Clear Failure being performed on a Number operation Error physical disk but the task None failed was interrupted and did Related Alert not complete successfully Number The controller may have None lost communication with the disk The disk may have LRA Number been removed or the cables 2971 may be loose or defective Action Verify that the disk is present and not ina Failed state Make sure the cables are attached securely See the online help for more information on checking the cables Restart the Clear task 2271 The Patrol OK Cause This alert is for Clear Alert 901 Read Normal informational purposes Number corrected a Acton Note None media crror Related Alert Number None LRA Number None Storage Management Message Reference 173 174 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2272 Patrol Read Critical Cause The Patrol Read Clear Alert 904 found an Fai
19. 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2321 Single bit Critical Cause The DIMM is Clear Alert 754 ECC error Failure malfunctioning Data loss Number The DIMM Error or data corruption is None is critically imminent The DIMM Related Alert degraded must be replaced Number There will be immediately No further one no further alerts will be generated b reporting Action Replace the DIMM a TUE immediately The DIMM is a part of the controller battery pack See your hardware documentation for information on replacing the DIMM 2322 The DC Critical Cause The power supply Clear Alert 1004 power Failure unit is switched off Either Number 2323 supply is Error a user switched off the Related Alert switched off power supply unit or itis Number defective None Action Check if the power TERA Number switch is turned off If itis 209 turned off turn it on If the problem persists check if the power cord is attached and functional If the problem is still not corrected or if the power switch is already turned on replace the power supply unit 194 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2323 The power OK Cause This alert is for Clear Alert 1001 supply i
20. 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2220 Allow OK Cause This alert is for Clear Alert 751 Revertible Normal informational purposes Number Hot Spare Action Nene None and Replace Related Alert Member Number Auto None Replace Member LRA Number operation on None Predictive Failure and Load balance changed 2221 Auto OK Cause This alert is for Clear Alert 751 Replace Normal informational purposes Number Member Acton None None operation on Related Alert Predictive Naniber Failure None Abort Check Consistency LRA Number on Error and None Load balance changed 2222 Load OK Cause This alert is for Clear Alert 751 balance and Normal informational purposes Number Auto Action None None Replace Related Alert Member Number operation on None Predictive Failure LRA Number changed None Storage Management Message Reference 157 Table 4 4 Storage Management Messages continued 158 Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2223 Abort Check Ok Cause This alert is for Clear Alert 751 Consistency Normal informational purposes Number on Error Aiea None None Allow Related Alert Revertible Number Hot Spare None and Replace Member LRA Number and Load None balance changed 2224 Allow OK
21. 853 firmware Non the EMM is not the same Number mismatch critical version It is required that None both modules have the Related Alert same version of the N mber firmware This alert may be caused when a user attempts to insert an EMM LRA Number None module that has a different 2090 firmware version than an existing module Action Download the same version of the firmware to both EMM modules 2121 Device OK Cause This alert is for Clear Alert 752 returned to Normal informational purposes A Status Alert 802 normal device that was previously 2121 is a clear 852 in an error state has alert for alert 902 returned to a normal state 2048 952 For example if an Related Alert 1002 enclosure became too hot Number 2050 1052 and subsequently cooled 2065 2158 1102 down then you may LRA Number 1152 receive this alert None 1202 Action None Storage Management Message Reference 117 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2122 Redundancy Warning Cause One or more of the Clear Alert 1305 degraded Non enclosure components has Status 2124 critical failed Related Alert For example a fan or power Number 2048 supply may have failed LRA Number Although the enclosure is 2090 currently operational the failure of additional components could cause the enclosure to fail Action Identif
22. 901 Changed severity to Warning Changed SNMP trap number to 903 Storage Management Message Reference Table 4 3 Alert Message Change History continued Alert Message Change History Obsolete Alerts 2333 2354 2355 2365 2370 2354 replaced by 2368 Documentation Changes Severity for alert 2163 changed from Ok Normal to Critical Failure Error Severity for alert 2318 changed from Critical Failure Error to Warning Non critical Removed alert 2344 Replaced by alert 2070 Removed alert 2345 Replaced by alert 2079 Documentation change only made in the Dell OpenManage Server Administrator Messages Reference Guide to reflect the severity displayed in the Server Administrator Alert Log and documented in the Storage Management online help Documentation change only made in the Dell OpenManage Server Administrator Messages Reference Guide to reflect the severity displayed in the Server Administrator Alert Log and documented in the Storage Management online help Documentation change only made in the Dell OpenManage Server Administrator Messages Reference Guide to reflect existing Storage Management online help Documentation change only made in the Dell OpenManage Server Administrator Messages Reference Guide to reflect existing Storage Management online help Storage Management Message Reference 85 Table 4 3 Alert Message Change History continued Alert Message C
23. Action None Related Alert Number None Local Response Agent LRA Alert Number None 2254 The Clear OK Cause This alert is for Clear Alert 901 operation Normal informational purposes Number has None Action None cancelled Related Alert Number None LRA Number None Storage Management Message Reference 167 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2255 The physical OK Cause This alert is for Clear Alert 901 disk has Normal informational purposes Number been started Action None None Related Alert Number 2048 2050 2065 2099 2121 2196 2201 2203 LRA Number None 2257 Controller Warning Cause The controller Clear Alert 753 preserved cache is discarded by the Number cache is user None discarded Action None Related Alert Number None LRA Number None 2258 Controller Warning Cause IO interrupted for a Clear Alert 753 has virtual disk which is Number preserved connected to the controller None cache Action Check for foreign Related Alert configuration and import if Number any Check for cable fault None Recover any virtual disk LRA Number lost by the controller None 168 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Aler
24. Background OK Cause BGI of a virtual disk Clear Alert 1201 initialization Normal has started This alert is for Status 2130 BGI informational purposes Related Alert started Action None Number None LRA Number None 2128 BGI OK Cause BGI of a virtual disk Clear Alert 1201 cancelled Normal has been cancelled A user Number or the firmware may have None stopped BGI Related Alert Action None Number None LRA Number None 2129 BGI failed Critical Cause BGI of a virtual disk Clear Alert 1204 Failure has failed Number Error Action None None Related Alert Number 2340 LRA Number 2081 122 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2130 BGI OK Cause BGI of a virtual disk Clear Alert 1201 completed Normal has completed This alert is Number Alert for informational purposes 2130 is a clear alert for alert 2127 Related Alert Number None LRA Number None Action None 2131 Firmware Waring Cause The firmware on Clear Alert 753 version Non the controller is not a Number mismatch critical supported version None Action Install a supported Related Alert version of the firmware If Number you do not have a None supported version of the TRA Number firmware available it can 2060 be downloaded from the Dell support site at support dell c
25. Clear Alert 1053 exceeded the maximum warning threshold Non critical enclosure is too hot Number 2353 A variety of factors can Related Alert cause the excessive Number 2112 temperature For example a fan may have failed the thermostat may be set too high or the room temperature may be too hot LRA Number 2090 Action Check for factors that may cause overheating For example verify that the enclosure fan is working You should also check the thermostat settings and examine whether the enclosure is located near a heat source Make sure the enclosure has enough ventilation and that the room temperature is not too hot See the physical disk enclosure documentation for more diagnostic information Storage Management Message Reference 105 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2101 Temperature Warning Cause The physical disk Clear Alert 1053 dropped Non enclosure is too cool Number 2353 below the critical Action Check if the Related Alert ae thermostat setting is too Number Walling low and if the room None threshold temperature is too cool LRA Number 2090 2102 Temperature Critical Cause The physical disk Clear Alert 1054 exceeded Failure enclosure is too hot A Number the Error variety of factors can cause None maximum the excessive temperature Related Alert failure For
26. Current sensor value in Amps lt Reading gt Specifies the current sensor value in amps for example Current sensor value 7 853 in Amps Date and time of action lt Date and time gt Specifies the date and time the action was performed for example Date and time of action Sat Jun 12 16 20 33 2004 Device location lt Location in chassis gt Specifies the location of the device in the specified chassis for example Device location Memory Card A Discrete current state lt State gt 14 Introduction Specifies the state of the current sensor for example Discrete current state Good Table 1 2 Event Description Reference continued Description Line Item Explanation Discrete temperature state lt State gt Specifies the state of the temperature sensor for example Discrete temperature state Good Discrete voltage state lt State gt Specifies the state of the voltage sensor for example Discrete voltage state Good Fan sensor value Specifies the fan speed in revolutions per minute lt Reading gt RPM or On Off for example Fan sensor value in RPM 2600 Fan sensor value Off Log type lt Log type gt Specifies the type of hardware log for example Log type ESM Memory device bank location lt Bank name in chassis gt Memory device location lt Device name in chassis gt Specifies the name of the memory bank in
27. ID Information Trap Numbers 2206 Theonlyhot Warning Cause The only physical Clear Alert 903 spare Non disk available to be assigned Number available isa critical as a hot spare is using None SATA disk SATA technology The Related Alert SATA disks physical disks in the virtual Number cannot disk are using SAS _ None replace technology Because of this SAS disks difference in technology LRA Number the hot spare cannot 2070 rebuild data if one of the physical disks in the virtual disk fails Action Add a SAS disk that is large enough to be used as the hot spare and assign the new disk as a hot spare 2207 Theonlyhot Waming Cause The only physical Clear Alert 903 spare Non disk available to be assigned Number available isa critical as a hot spare is using SAS None SAS disk technology The physical Related Alert SAS disks disks in the virtual disk are Nance cannot using SATA technology None replace Because of this difference SATA disks in technology the hot spare LRA Number cannot rebuild data ifone 2070 of the physical disks in the virtual disk fails Action Add a SATA disk that is large enough to be used as the hot spare and assign the new disk as a hot spare Storage Management Message Reference 153 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2210 Battery Warning Cause Battery requires Clear
28. LRA Number None Storage Management Message Reference 198 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2334 Controller OK Cause This alert is for Clear Alert 751 event Normal informational purposes Number log 1 The 1 indicates a None substitution variable The Related Alert text for this substitution Number variable is generated by the None controller and is displayed with the alert in the Alert LRA Number None Log This text is from events in the controller event log that were generated while Storage Management was not running This text can vary depending on the situation Action None Storage Management Message Reference 199 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2335 Controller Warning Cause The 1 indicates a Clear Alert 753 event Non substitution variable The Number log 1 critical text for this substitution None variable is generated by the Related Alert controller and is displayed Number with the alert in the Alert None Log This text is from events in the controller LRA Number 2060 event log that were generated while Storage Management was not running This text can vary depending on the situation Action If there is a problem review the
29. Message Reference alert for alert 2059 Related Alert Number None LRA Number None Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2087 Copy ofdata OK Cause This alert is for Clear Alert 1201 resumed Normal informational purposes Status None from Action None Related Alert physical disk Niimber 260 2 to i physical disk LRA Number 1 None 2088 Virtual disk OK Cause This alert is for Clear Alert 1201 initialization Normal informational purposes Status Alert completed Acton Nene 2088 is a clear f alert for alerts 2061 and 2136 Related Alert Number None LRA Number None 2089 Physical disk OK Cause This alert is for Clear Alert 901 initialize Normal informational purposes Status Alert completed Action Nond 2089 is a clear l alert for alert 2062 Related Alert Number None LRA Number None Storage Management Message Reference 101 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2090 Virtual disk OK reconfigurati Normal Cause This alert is for on Action None completed informational purposes Clear Alert Status Alert 2090 is a clear alert for alert 2063 Related Alert Number None LRA Number None 1201 2091 Virtual disk OK re
30. Name of chassis gt Previous state was lt State gt Fan sensor value sensor location chassis location previous state and fan sensor value are provided lt Reading gt 1101 Fan sensor value Information A fan sensor in the specified unknown system could not obtain a Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Fan sensor value lt Reading gt reading The sensor location chassis location previous state and a nominal fan sensor value are provided Event Message Reference 23 Table 2 3 Cooling Device Messages continued Fan sensor returned Fan sensor detected Warning 24 Event Description ID 1102 1103 Severity Information to a normal value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Fan sensor value lt Reading gt a warning value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Fan sensor value lt Reading gt Event Message Reference Cause A fan sensor reading on the specified system returned to a valid range after crossing a warning threshold The sensor location chassis location previous state and fan sensor value are provided A fan sensor reading in the specified system exceeded a warning threshold The
31. Number None 2292 Communica Critical Cause The controller has Clear Alert 854 tion withthe Failure lost communication withan Number 2162 enclosure Error EMM The cables maybe Related Alert has been loose or defective Number lost Action Make sure the None cables are attached LRA Number securely Reboot the 2091 system 182 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2293 The EMM Critical Cause The failure may be Clear Alert 854 has failed Failure caused by a loss of power to Number Error the EMM The EMM self None test may also have Related Alert identified a failure There Number could also be a firmware None problem or a multi bit error LRA Number f 2091 Action Replace the EMM See the hardware documentation for information on replacing the EMM 2294 A device has OK Cause This alert is for Clear Alert 851 been Normal informational purposes Number inserted Action None None Related Alert Number None LRA Number None 2295 A device has Critical Cause A device has been Clear Alert 854 been Failure removed and the system is Number removed Error no longer functioning in None optimal condition Related Alert Action Replace the device Number None LRA Number 2091 Storage Management Message Reference 183 Table 4 4 Storag
32. Storage Clear Alert 753 controller Non Management is unable to Number firmware critical determine whether the None and driver system has the minimum Related Alert validation required versions of the Nene was not RAID controller firmware None performed and drivers This situation The has occurred because a LRA Number configuratio configuration file is 2060 n file is out unreadable or missing data of date or The configuration file may corrupted be corrupted Action Reinstall Storage Management Storage Management Message Reference 135 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2167 The current Warning Cause The version of the Clear Alert 103 kernel Non kernel and the driver do not Number version and critical meet the minimum None the non requirements Storage Related Alert RAID SCSI Management may notbe Number driver able to display the storage None version are or perform storage older than management functions LRA Number the until you have updated the 2050 minimum system to meet the required minimum requirements levels See Action See the Readme readme txt file for a list of validated fora list of kernel and driver versions validated Update the system to meet kernel and the minimum driver requirements and then NELSONS reinstall Storage Management 2168 The non Warning Cause The version of the Clear Ale
33. This event is generated when there is redundancy a memory failure in a RAID configured degraded memory configuration Memory RAID Critical This event is generated when redundancy is redundancy lost lost in a RAID configured memory configuration Memory RAID Information This event is generated when the redundancy redundancy lost or degraded earlier is regained regained in a RAID configured memory configuration Memory Mirrored Information This event is generated when there is redundancy degraded System Even a memory failure in a mirrored memory configuration t Log Messages for IPMI Systems 65 Table 3 8 Memory Events continued Event Message Severity Cause Memory Mirrored Critical This event is generated when redundancy is redundancy lost lost in a mirrored memory configuration Memory Mirrored Information This event is generated when the redundancy redundancy lost or degraded earlier is regained regained in a mirrored memory configuration Memory Spared Information This event is generated when there is redundancy a memory failure in a spared degraded memory configuration Memory Spared Critical This event is generated when redundancy is redundancy lost lost in a spared memory configuration Memory Spared Information This event is generated when the redundancy redundancy lost or degraded earlier is regained regained in a spared memory configuration Hardware Log Sensor Events The hardware logs
34. Warning This event is generated for all processor sensor disabled processors that are disabled lt Processor Entity gt status Information This event is generated if the processor sensor terminator is missing on an terminator not present empty processor slot lt Processor Entity gt Critical This event is generated when the presence was deasserted system could not detect the processor lt Processor Entity gt Information This event is generated when the presence was asserted earlier processor detection error was corrected System Event Log Messages for IPMI Systems 61 Table 3 4 Processor Status Events continued Event Message Severity Cause lt Processor Entity gt thermal Information tripped was deasserted This event is generated when the processor has recovered from an earlier thermal condition lt Processor Entity gt Critical This event is generated when the configuration error was processor configuration is asserted incorrect lt Processor Entity gt Information This event is generated when the configuration error was earlier processor configuration deasserted error was corrected lt Processor Entity gt Warning This event is generated when the throttled was asserted processor slows down to prevent over heating lt Processor Entity gt Information This event is generated when the throttled was deasserted earlier processor throttled event was corrected
35. Write Back None LRA Number None 146 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2190 The OK Cause This alert is Clear Alert 751 controller Normal provided for informational Number has detected purposes The SAS None a hot controller with firware Related Alert plugged version 6 1 or later has N mber enclosure detected a hot plugged None enclosure LRA Number Action None None 2191 Multiple Critical Cause Many enclosures are Clear Alert 854 enclosures Failure are attached Error to the controller This is an unsupported configuratio n attached to the controller Number port When the enclosure None limit is exceeded the Related Alert controller loses contact Number 2211 with all enclosures attached to the port LRA Number 2091 Action Remove the last enclosure You must remove the enclosure that has been added last and is causing the enclosure limit to exceed Storage Management Message Reference 147 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2192 The virtual OK Cause This alert is for Clear Alert 1203 disk Check Normal informational purposes Number Consistency The virtual disk Check None has made Consistency
36. been recovered and is now usable Any data LRA Number residing on these dead None segments has been lost Action None Storage Management Message Reference 127 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2142 Controller OK Cause This alert is for Clear Alert 751 rebuild rate Normal informational purposes A Number has changed user has changed the None controller rebuild rate Related Alert Action None Number None LRA Number None 2143 Controller OK Cause This alert is for Clear Alert 751 alarm Normal informational purposes A Number enabled user has enabled the None controller alarm Related Alert Action None Number None LRA Number None 2144 Controller OK Cause This alert is for Clear Alert 751 alarm Normal informational purposes A Number disabled user has disabled the None controller alarm Related Alert Action None Number None LRA Number None 128 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2145 Controller Warning Cause The controller Clear Alert 1153 battery low Non battery charge is low Number critical None Action Recondition the battery See the online help Related Alert for more information Number None LRA Number 210
37. dedicated Warning Cause The controller is Non critical hot spare failed unable to communicate with a disk that is assigned as a dedicated hot spare The disk may have failed or been removed There may also be a bad or loose cable Action Check if the disk is healthy and that it has not been removed Check the cables If necessary replace the disk and reassign the hot spare Clear Alert 903 Number None Related Alert Number 2048 LRA Number 2070 152 2204 A dedicated OK hot spare has Normal been removed 2205 A dedicated OK hot spare has Normal been automaticall y unassigned Cause The controller is unable to communicate with a disk that is assigned as a dedicated hot spare The disk may have been removed There may also be a bad or loose cable Action Check if the disk is healthy and that it has not been removed Check the cables If necessary replace the disk and reassign the hot spare Cause The hot spare is no longer required because the virtual disk it was assigned to has been deleted Action None Storage Management Message Reference Clear Alert 901 Number None Related Alert Number None LRA Number None Clear Alert 901 Number None Related Alert Number 2098 2161 2196 LRA Number None Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP
38. has identified Related Alert corrections errors and made Number and corrections For example Nake completed the Check Consistency may have encountereda LRA Number bad disk block and None remapped the disk block to restore data consistency Action This alert is for informational purposes only and no additional action is required As a precaution monitor the Alert Log for other errors related to this virtual disk If problems persist contact Dell Technical Support 2193 The virtual OK Cause This alert is for Clear Alert 1201 disk Normal informational purposes Number reconfigurati Action None None on has Related Alert resumed Nuinber None LRA Number None 148 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2194 The virtual OK Cause This alert is for Clear Alert 1201 disk Read Normal informational purposes Number policy has Action None None changed Related Alert Number None LRA Number None 2195 Dedicated OK Cause This alert is for Clear Alert 1201 hot spare Normal informational purposes Number 2196 assigned Action None Related Alert Physical disk Nuniber l None LRA Number None 2196 Dedicated OK Cause This alert is for Clear Alert 1201 hot spare Normal informational purposes Status Alert unassigned Acon Non
39. in any attached systems Voltage Sensor Monitors voltages across critical components in various chassis locations and in any attached systems Current Sensor Monitors the current or amperage output from the power supply or supplies in the chassis and in any attached systems Chassis Intrusion Sensor Monitors intrusion into the chassis and any attached systems Redundancy Unit Sensor Monitors redundant units critical units such as fans AC power cords or power supplies within the chassis also monitors the chassis and any attached systems For example redundancy allows a second or nth fan to keep the chassis components at a safe temperature when another fan has failed Redundancy is normal when the intended number of critical components are operating Redundancy is degraded when a component fails but others are still operating Redundancy is lost when there is one less critical redundancy device than required Power Supply Sensor Monitors power supplies in the chassis and in any attached systems Memory Prefailure Sensor Monitors memory modules by counting the number of Error Correction Code ECC memory corrections Fan Enclosure Sensor Monitors protective fan enclosures by detecting their removal from and insertion into the system and by measuring how long a fan enclosure is absent from the chassis This sensor monitors the chassis and any attached systems AC Power Cord Sensor Monitors the p
40. in the drsCAMessage variable binding supplied with the alert 2005 CMC reported a Non Recoverable CMC non recoverable non recoverable event as described in the event drsCAMessage variable binding supplied with the alert 36 Event Message Reference Redundancy Unit Messages Redundancy means that a system chassis has more than one of certain critical components Fans and power supplies for example are so important for preventing damage or disruption of a computer system that a chassis may have extra fans or power supplies installed Redundancy allows a second or nth fan to keep the chassis components at a safe temperature when the primary fan has failed Redundancy is normal when the intended number of critical components are operating Redundancy is degraded when a component fails but others are still operating Redundancy is lost when the number of components functioning falls below the redundancy threshold Table 2 8 lists the redundancy unit messages The number of devices required for full redundancy is provided as part of the message when applicable for the redundancy unit and the platform For details on redundancy computation see the respective platform documentation Table 2 8 Redundancy Unit Messages Event Description Severity Cause ID 1300 Redundancy sensor has Information A redundancy sensor in the failed specified system failed The redundancy unit location chassis location previous redund
41. location lt Location in the ee system hassigs 1 not unctioning The sensor location Chassis location lt Name of chassis location chassis gt previous state and Previous state was lt State gt battery sensor status are provided Battery sensor status previse lt status gt 1701 Battery sensor value unknown Information A battery sensor in Sensor Location lt Location in E system Ghassis gt cou c not ietneve a l reading The sensor Chassis Location lt Name of location chassis chassis gt location previous Previous state was lt State gt state and battery sensor status ar Battery sensor status oi statusai provided lt status gt 1702 Battery sensor returned to a Information A battery sensor in normal value Sensor Location lt Location in chassis gt Chassis Location lt Name of chassis gt Previous state was lt State gt Battery sensor status lt status gt Event Message Reference the specified system detected that a battery transitioned back to a normal state The sensor location chassis location previous state and battery sensor status are provided 55 56 Table 2 16 Battery Sensor Messages continued Event Description Severity Cause ID 1703 Battery sensor detected a Warning A battery sensor in warning value the specified system Sensor Location lt Location in ones that ehassiss a attery Is m l a predictive failure Chassis Location lt Name of state T
42. location lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Voltage sensor value in Volts lt Reading gt If sensor type is discrete Discrete voltage state lt State gt location previous state and voltage sensor value are provided 26 115 Voltage sensor value unknown Information Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Voltage sensor value in Volts lt Reading gt If sensor type is discrete Discrete voltage state lt State gt Event Message Reference A voltage sensor in the specified system could not obtain a reading The sensor location chassis location previous state and a nominal voltage sensor value are provided Table 2 4 Voltage Sensor Messages continued Event Description Severity Cause ID 1152 Voltage sensor returned to Information A voltage sensor in a normal value the specified system Sensor location lt Location ene to a valid Feral erect range atter crossing l a failure threshold Chassis location lt Name of The sensor location chassis gt chassis location Previous state was lt State gt previous state and ie E Aube EE voltage sensor value are provided discrete Voltage sensor value in Volts lt Reading gt If sensor type is discrete Discrete voltage state lt State gt
43. probe warning Number value threshold None changed Action None LRA Number None 2155 Minimum OK Cause This alert is for Clear Alert 1051 temperature Normal informational purposes A Number probe user has changed the value None warning for the minimum Related Alert threshold temperature probe warning Number value threshold None changed Action None LRA Number None Storage Management Message Reference 131 Table 4 4 Storage Management Messages continued 132 Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2156 Controller OK Cause This alert is for Clear Alert 751 alarmhas Normal informational purposes Number been tested The controller alarm test None has run successfully Related Alert Action None Number None LRA Number None 2157 Controller OK Cause This alert is for Clear Alert 751 configuratio Normal informational purposes A Number n has been user has reset the controller None reset configuration See the Related Alert online help for more Ninabes information Nona Action None LRA Number None 2158 Physical disk OK Cause This alert is for Clear Alert 901 online Normal informational purposes An Status Alert offline physical disk has 2158 is a clear been made online alert for alert Action None 2050 Related Alert Number 2048 2050 2065 2099 2121 2196 2201 2203 LRA Number None Storage Management Message Reference
44. provide hardware status messages to the system management software On particular systems the subsequent hardware messages are not displayed when the log is full These messages provide status and warning messages when the logs are full Table 3 9 Hardware Log Sensor Events Event Message Severity Cause Log full Critical This event is generated when the SEL device detected detects that only one entry can be added to the SEL before it is full Information This event is generated when the SEL is cleared Log cleared 66 System Event Log Messages for IPMI Systems Drive Events The drive event messages monitor the health of the drives in a system These events are generated when there is a fault in the drives indicated Table 3 10 Drive Events Event Message Severity Cause Drive lt Drive gt Critical This event is generated when the asserted fault state specified drive in the array is faulty Drive lt Drive gt de Information This event is generated when the asserted fault state specified drive recovers from a faulty condition Drive lt Drive gt Informational This event is generated when the drive pres nc was drive is installed asserted Drive lt Drive gt Warning This event is generated when the predicciye failure drive is about to fail was asserted Drive lt Drive gt Informational This event is generated when the bredietive tallur drive e earlier predicti
45. sensor location chassis location previous state and fan sensor value are provided Table 2 3 Cooling Device Messages continued Event Description ID 1104 Severity Cause Fan sensor detected Error 1105 a failure value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Fan sensor value lt Reading gt A fan sensor in the specified system detected the failure of one or more fans The sensor location chassis location previous state and fan sensor value are provided A fan sensor detected an error Fan sensor detected Error a non recoverable value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Fan sensor value lt Reading gt from which it cannot recover The sensor location chassis location previous state and fan sensor value are provided Event Message Reference 25 Voltage Sensor Messages Voltage sensors listed in Table 2 4 monitor the number of volts across critical components Voltage sensor messages provide status and warning information for voltage sensors in a particular chassis Table 2 4 Voltage Sensor Messages Event Description Severity Cause ID 1150 Voltage sensor has failed Information A voltage sensor in Sensor location lt Location a E system eres ailed The sensor location chassis Chassis
46. task See the Replacing a Failed Disk section in the Dell OpenManage Server Administrator Storage Management User s Guide for more information Storage Management Message Reference 187 Table 4 4 Storage Management Messages continued 188 Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2306 Badblock Warning Cause The bad block table Clear Alert 903 table is Non is used for remapping bad Number 80 full critical disk blocks This table fills None as bad disk blocks are Related Alert remapped When the table Number 2307 is full bad disk blocks can no longer be remapped and LRA Number disk errors can no longer be 2070 corrected At this point data loss can occur The bad block table is now 80 full Action Back up your data Replace the disk generating this alert and restore from back up 2307 Badblock Critical Cause The bad block table Clear Alert 904 table is full Failure is used for remapping bad Number Unable tolog Error disk blocks This table fills None block 1 as bad disk blocks are Related Alert remapped When the table Number 2048 is full bad disk blocks can no longer be remapped and ei Number disk errors can no longer be corrected At this point data loss can occur The 1 indicates a substitution variable The text for this substitution variable is displayed with the alert in the Alert Log and can vary depending on the situation
47. this 2990 not the substitution variable is same displayed with the alert in EMMO 1 the Alert Log and can vary EMM 1 2 depending on the situation Action The EMMs in the enclosure have a different SCSI rate This is an unsupported configuration All EMMs in the enclosure should have the same SCSI rate Storage Management Message Reference 139 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2174 The Warning Cause The controller Clear Alert 1153 controller Non cannot communicate with Number battery has critical the battery the battery may None been be removed or the contact Related Alert removed point between the Number 2188 controller and the battery 737 may be burnt or corroded LRA Number Action Replace the battery 5199 if it has been removed If the contact point between the battery and the controller is burnt or corroded you will need to replace either the battery or the controller or both See the hardware documentation for information on how to safely access remove and replace the battery 2175 The OK Cause This alert is for Clear Alert 1151 controller Normal informational purposes Number battery has Acton None None been Related Alert replaced Number None LRA Number None 140 Storage Management Message Reference Table 4 4 Storage Management Messages continued Eve
48. version EMMO 1 EMM1 2 190 The global hot spare is too small 175 The initialization sequence of SAS components failed during system startup SAS management and monitoring is not possible 191 The non RAID SCSI driver version is older than the minimum required level See the Readme file for the validated driver version 136 The NVRAM has corrupt data 197 The NVRAM has corrupted data The controller is reinitializing the NVRAM 196 The only hot spare available is a SAS disk SAS disks cannot replace SATA disks 153 The only hot spare available is a SATA disk SATA disks cannot replace SAS disks 153 The Patrol Read corrected a media error 173 The patrol read has resumed 180 The Patrol Read has started 163 The Patrol Read has stopped 164 The Patrol Read is paused 180 The Patrol Read mode has changed 163 The Patrol Read rate has changed 161 The physical disk blink has ceased 167 The physical disk blink has initiated 166 The physical disk Clear operation failed 173 The physical disk Clear operation has completed 173 The physical disk Clear operation has started 166 The physical disk has been started 168 The physical disk is not certified 210 The physical disk is not supported 154 The physical disk is too small to be used for a rebuild 187 The physical disk rebuild has resumed 175 The power supply cable has been inserted 1
49. 0 2146 Badblock Warning Cause A portion of a Clear Alert 753 replacement Non physical disk is damaged Number aaa critical Action See the Dell None OpenManage Server Related Alert Administrator Storage Number Management online help or None the Dell OpenManage LRA Number Server Administrator 2060 Storage Management User s Guide for more information 2147 Badblock Warning Cause A portion of a Clear Alert 753 sense Non physical disk is damaged Number error critical Action See the Dell None OpenManage Server Related Alert Administrator Storage Number Management online help for None more information LRA Number 2060 Storage Management Message Reference 129 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2148 Badblock Waming Cause A portion of a Clear Alert 753 medium Non physical disk is damaged Number error critical Action See the Dell None OpenManage Server Related Alert Administrator Storage Number Management online help for None more information LRA Number 2060 2149 Badblock Waming Cause A portion of a Clear Alert 753 extended Non physical disk is damaged Number sense error critical Action See the Dell None OpenManage Server Related Alert Administrator Storage Number Management online help for None more information LRA Number 2060 2150 Badblock Waming Cau
50. 03 and the Red Hat Enterprise Linux and SUSE Linux Enterprise Server event viewers Viewing Events in Windows 2000 Advanced Server and Windows Server 2003 1 Click the Start button point to Settings and click Control Panel 2 Double click Administrative Tools and then double click Event Viewer 3 In the Event Viewer window click the Tree tab and then click System Log The System Log window displays a list of recently logged events 4 To view the details of an event double click one of the event items Z NOTE You can also look up the desys32 log file in the install_pathomsa log directory to view the separate event log file The default insta l_path is C Program Files Dell SysMgt Introduction 11 Viewing Events in Red Hat Enterprise Linux and SUSE Linux Enterprise Server 1 Log in as root 2 Use a text editor such as vi or emacs to view the file named var log messages The following example shows the Red Hat Enterprise Linux and SUSE Linux Enterprise Server message log var log messages The text in boldface type indicates the message text Z NOTE These messages are typically displayed as one long line In the following example the message is displayed using line breaks to help you see the message text more clearly Feb 6 14 20 51 server01 Server Administrator Instrumentation Service EventID 1000 Server Administrator starting Feb 6 14 20 51 server01 Server Administrator Instrumentation Servic
51. 134 sensor AC power cord 9 chassis intrusion 9 current 9 fan 9 fan enclosure 9 hardware log 9 memory prefailure 9 power supply 9 processor 10 50 redundancy unit 9 temperature 9 voltage 9 Server Administrator starting 17 Server Administrator startup complete 17 Service tag changed 131 Single bit ECC error limit exceeded 145 Single bit ECC error 182 Single bit ECC error The DIMM is critically degraded 193 Single bit ECC error The DIMM is critically degraded There will be no further reporting 194 232 Index Single bit ECC error The DIMM is degrading 193 Smart configuration change 109 Smart FPT exceeded 108 SMART thermal shutdown is disabled 170 SMART thermal shutdown is enabled 169 Smart warning 110 Smart warning degraded 113 Smart warning temperature 111 SMBIOS data is absent 18 System Event Log Messages 57 system management data manager started 19 system management data manager stopped 19 T Temperature dropped below the minimum failure threshold 107 Temperature dropped below the minimum warning threshold 106 Temperature exceeded the maximum failure threshold 106 Temperature exceeded the maximum warning threshold 105 temperature sensor 9 Temperature sensor detected a failure value 22 Temperature sensor detected a non recoverable value 22 Temperature sensor detected a warning value 21 Temperature Sensor E
52. 22 Index 2254 167 2255 168 2257 168 2258 168 2259 169 2260 169 2261 169 2262 169 2263 170 2264 170 2265 171 2266 171 2267 172 2268 172 2269 173 2270 173 2271 173 2272 174 2273 174 2274 175 2276 175 2277 175 2278 176 2279 176 2280 177 2281 177 2282 178 2283 178 2284 179 2285 179 2286 179 2287 180 2288 180 2289 181 2290 182 2291 182 2292 182 2293 183 2294 183 2295 183 2296 184 2297 184 2298 184 2299 185 2300 185 2301 186 2302 186 2303 186 2304 187 2305 187 2306 188 2307 188 2309 189 2310 189 2311 190 2312 190 2313 191 2314 191 2315 191 2316 192 2318 192 2319 193 2320 193 2321 194 2322 194 2323 195 2324 195 2325 195 2326 196 2327 196 2328 197 2329 197 2330 198 2331 198 2332 198 2334 199 2335 200 2336 201 2337 201 2338 202 2339 202 Index 223 2340 203 2341 203 2342 204 2343 204 2346 205 2347 205 2348 206 2349 206 2350 206 2351 207 2352 207 2353 207 2356 208 2357 209 2358 209 2359 210 2360 210 2361 210 2362 211 2364 211 2366 211 2367 212 2368 212 2369 213 2371 213 2372 213 232350204 224 Index 2374 214 2375 214 2376 215 2377 215 2378 215 2379 216 2380 216 2381 216 A A bad disk block could not be reassigned during a write operation 206 A bad disk block has been reassigned 198 A block on the physical disk has been p
53. 3 12 BIOS Generated System Events continued Event Message Severity Cause Mem Fatal NB CRC Critical uncorrectable ECC was asserted This event is generated when CRC errors occur while removing from memory Mem Overtemp Critical critical over temperature was asserted This event is generated when system memory reaches critical temperature USB Over current Critical transition to non This event is generated when the USB exceeds a predefined current level recoverable Hdwr version err Critical This event is generated when there is Hanae a mismatch between the BMC and incompatibitity iDRAC firmware and the processor in BMC iDRAC Firmware and CPU mismatch was use or vice versa asserted Hdwr version err Information This event is generated when the Hardawar earlier mismatch between the BMC incompatibility and iDRAC firmware and the processor BMC iDRAC Firmware and CPU mismatch was is corrected deasserted Hdwr version err Information This event is generated when an earlier is sh is corrected ee eee hardware mismatch is correcte incompatibility BMC iDRAC Firmware and CPU mismatch was deasserted SBE Log Disabled Critical correctable memory error logging disabled was asserted This event is generated when the ECC single bit error rate is exceeded 72 System Event Log Messages for IPMI Systems Table 3 12 BIOS Generated System Events continued
54. 5 Virtual disk check consistency completed 100 Virtual disk check consistency failed 97 Virtual disk check consistency started 93 Virtual disk configuration changed 90 Virtual disk created 90 Virtual disk degraded 92 Virtual disk deleted 90 Virtual disk failed 91 Virtual disk format changed 97 Virtual disk format completed 100 Virtual disk format started 93 Virtual disk has inconsistent data 177 Virtual disk initialization 125 Virtual disk initialization cancelled 96 Virtual disk initialization completed 101 Virtual disk initialization failed 98 Virtual disk initialization started 93 Virtual disk rebuild completed 102 Virtual disk rebuild failed 99 Virtual disk rebuild started 94 Virtual disk reconfiguration completed 102 Virtual disk reconfiguration failed 99 Virtual disk reconfiguration started 94 Virtual Disk Redundancy has been degraded 213 Virtual disk renamed 133 voltage sensor 9 Voltage sensor detected a failure value 28 60 Voltage sensor detected a non recoverable value 28 Voltage sensor detected a warning value 27 Voltage Sensor Events 58 Voltage sensor has failed 26 59 voltage sensor messages 26 58 Voltage sensor returned to a normal value 27 Voltage sensor value unknown 26 59 Index 237 238 Index
55. 7 LRA Number None Storage Management Message Reference 89 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2053 Virtual disk OK Cause This alert is for Clear Alert 1201 created Normal informational purposes Number Action None None Related Alert Number None LRA Number None 2054 Virtual disk Warning Cause A virtual disk has Clear Alert 1203 deleted Non been deleted Performinga Number critical Reset Configuration may None detect that a virtual disk Related Alert has been deleted and N mber generate this alert None Action None LRA Number 2080 2055 Virtual disk OK Cause This alert is for Clear Alert 1201 configuratio Normal informational purposes Number n changed Action Nene None Related Alert Number None LRA Number None 90 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2056 Virtual disk Critical Cause One or more Clear Alert 1204 failed Failure physical disks included in Number Error the virtual disk have failed None If the virtual disk is Related Alert non redundant doesnot Numb er 2048 use mirrored or parity data 2049 2050 then the failure of a single 2076 2079 physical disk can cause the 2081 2129 virt
56. 78 A system BIOS update has been scheduled for the next reboot 17 A user has discarded data from the controller cache 210 A virtual disk and its mirror have been split 115 A virtual disk blink has been initiated 164 A virtual disk blink has ceased 164 A virtual disk is permanently degraded 189 AC power cord is not being monitored 47 AC power cord messages 47 AC power cord sensor 9 Index 225 AC power cord sensor has failed 47 67 AC power has been lost 48 AC power has been restored 47 All virtual disks are missing from the controller This situation was discovered during system start up 211 An attempt to hot plug an EMM has been detected This type of hot plug is not supported 187 An EMM has been discovered 182 An EMM has been inserted 184 An EMM has been removed 184 An enclosure blink has ceased 169 An enclosure blink operation has initiated 169 An invalid SAS configuration has been detected 143 Array Manager is installed on the system 124 Asset name changed 131 Asset tag changed 130 Automatic System Recovery ASR action was performed 18 226 Index Background initialization cancelled 122 Background initialization completed 123 Background initialization failed 122 Background initialization started 122 Bad block extended medium error 130 Bad block extended sense error 130 Bad block medium error 130 Bad block
57. 79 The OK Cause This alert is for Clear Alert 1151 controller Normal informational purposes Number battery Action None None Learn cycle Related Alert has been Number postponed None LRA Number None 142 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2180 The OK Cause This alert is for Clear Alert 1151 controller Normal informational purposes Number battery The 1 indicates a None Learn cycle substitution variable The Related Alert will start in text for this substitution Number 1 days variable is displayed with None the alert in the Alert Log and can vary depending on LRA Number the situation None Action None 2181 The OK Cause This alert is for Clear Alert 1151 controller Normal informational purposes Number battery The 1 indicates a None Learn cycle substitution variable The Related Alert will start in text for this substitution Number 1 hours variable is displayed with None the alert in the Alert Log and can vary depending on LRA Number one the situation Action None 2182 An invalid Critical Cause The controller and Clear Alert 754 SAS Failure attached enclosures are not Number configuratio Error cabled correctly None has ay Action See the hardware Related Alert etected documentation for Number information on correct N
58. 95 The power supply is switched on 195 The RAID controller firmware and driver validation was not performed The configuration file cannot be opened 135 Index 235 The RAID controller firmware and driver validation was not performed The configuration file is out of date or corrupted 135 The rebuild failed due to errors on the source physical disk 205 The rebuild failed due to errors on the target physical disk 206 The SCSI Enclosure Processor SEP has been rebooted as part of the firmware download operation and will be unavailable until the operation completes 212 The virtual disk cache policy has changed 150 The virtual disk Check Consistency has made corrections and completed 148 The virtual disk Read policy has changed 149 The virtual disk reconfiguration has resumed 148 There is a bad sensor on an enclosure 184 There was an unrecoverable disk media error during the rebuild 206 Thermal shutdown protection has been initiated 18 236 Index U understanding event description 13 Unsupported configuration detected The SCSI rate of the enclosure management modules EMMs is not the same EMM0 1 EMM1 2 139 User initiated host system reset 18 V viewing event information 13 event messages 10 events in Red Hat Linux 12 events in SUSE Linux Enterprise Server 12 events in Windows 2000 11 Virtual disk check consistency cancelled 9
59. Alert 1153 requires Non reconditioning Number dered critical Action Initiate the battery None amp learn cycle Related Alert the battery j N mber learn cycle Note LRA Number 2070 2211 The physical Warning Cause The physical disk Clear Alert 903 disk is not Non may not have a supported Number supported critical version of the firmware or None the disk may not be Related Alert supported by Dell Ninabes Action If the disk is None supported by Dell update TRA Number the firmware to a 2070 supported version If the disk is not supported by Dell replace the disk with one that is supported 2212 The OK Cause This alert is for Clear Alert 1151 controller Normal informational purposes Number battery Action None None teimperaturg Related Alert is above Number normal None LRA Number None Storage Management Message Reference 154 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2213 Recharge Warning Cause The battery has Clear Alert 1153 count Non been recharged more times Number maximum critical than the battery recharge None exceeded limit allows Related Alert Action Replace the battery Number pack None LRA Number 2100 2214 Battery OK Cause This alert is for Clear Alert 1151 charge in Normal informational purposes Number Pigetess Action None None Related Alert Number N
60. Alert 901 disk that was Normal informational purposes Status Alert marked as Acton None 2352 is a clear missing has i l alert for alert been 2351 replaced Related Alert Number None LRA Number None 2353 The OK Cause This alert is for Clear Alert 851 enclosure Normal informational purposes Status Alert temperature Aetion None 2353 is a clear has returned l l alert for alerts to normal 2100 and 2101 Related Alert Number None LRA Number None Storage Management Message Reference 207 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2356 SAS SMP Critical Cause The 1 indicates a Clear Alert 754 communicat Failure substitution variable The Number ions Error text for this substitution None error 1 variable is generated by the Related Alert firmware and is displayed Number with the alert in the Alert Nake Log This text can vary depending on the LRA Number 2061 situation The reference to SMP in this text refers to SAS Management Protocol Action There may be a SAS topology error See the hardware documentation for information on correct SAS topology configurations There may be problems with the cables such as a loose connection or an invalid cabling configuration See the hardware documentation for information on correct cabling configurations Check if the firmware is a supported versio
61. Alert 904 SMART Failure firmware attempted a Number polling Error SMART polling on the hot None failed spare but was unable to Related Alert complete it The controller Number has lost communication Nove with the hot spare LRA Number Action Check the health 207 of the disk assigned as a hot spare You may need to replace the disk and reassign the hot spare Make sure the cables are attached securely See the Cables Attached Correctly section in the Dell OpenManage Server Administrator Storage Management User s Guide for more information on checking the cables 2283 Aredundant Warning Cause The controller has Clear Alert 903 path is Non two connectors that are Number 2284 broken critical connected to the same Related Alert enclosure The Nutaber communication path on None one connector has lost connection with the LRA Number 2070 enclosure The communication path on the other connector is reporting this loss Action Make sure the cables are attached securely Make sure both EMMs are healthy Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2284 Aredundant OK Cause This alert is for Clear Alert 901 path has Normal informational purposes Status Alert been Acton Nene 2284 is a clear restored alert for alert 2283 Related Alert Number None
62. Although LRA Number user data may have been 2060 lost this alert does not always indicate that relevant or user data has been lost Action Verify that the battery and memory are functioning properly 2187 Single bit Warning Cause The system Clear Alert 753 ECC error Non memory is malfunctioning Number limit critical Action Replace the None exceeded battery pack Related Alert Number None LRA Number 2060 Storage Management Message Reference 145 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2188 The OK Cause The controller Clear Alert 1151 controller Normal battery is unable to Number write policy maintain cached data for None has been the required period of time Related Alert changed to For example if the Nisbet Write required period of time is None Through 24 hours the battery is unable to maintain cached LRA Number data for 24 hours It is None normal to receive this alert during the battery Learn cycle as the Learn cycle discharges the battery before recharging it When discharged the battery cannot maintain cached data Action Check the health of the battery If the battery is weak replace the battery pack 2189 The OK Cause This alert is for Clear Alert 1151 controller Normal informational purposes Number write policy Acton None None has been Related Alert changed to Niner
63. Contact Dell technical support 2300 The Critical Cause The controller is Clear Alert 854 enclosure is Failure not receiving a consistent Number unstable Error response from the None enclosure There could be Related Alert a firmware problem oran Number invalid cabling Nene configuration If the cables are too long they will sat Number degrade the signal Action Power down all enclosures attached to the system and reboot the system If the problem persists upgrade the firmware to the latest supported version You can download the most current version of the driver and firmware from support dell com Make sure the cable configuration is valid See the hardware documentation for valid cabling configurations Storage Management Message Reference 185 Table 4 4 Storage Management Messages continued Event Description Cause and Action Related Alert SNMP ID Information Trap Numbers 2301 The Cause The enclosure or an Clear Alert 854 enclosure enclosure component is in Number has a a Failed or Degraded state None hardware Action Check the health Related Alert EION of the enclosure and its Number components Replace any None hardware that is in a Failed LRA Number state See the hardware 2091 documentation for more information 2302 The Cause The enclosure oran Clear Alert 854 enclosure enclosure component isin Number is not a Failed or Degraded state None responding Actio
64. Critical Cause A physical disk Clear Alert 1204 initialization Failure included in the virtual disk Number failed Error has failed or a user has None cancelled the initialization Related Alert Action If a physical disk Number has failed then replace the None physical disk LRA Number 2081 2080 Physical disk Critical Cause The physical disk Clear Alert 904 initialize Failure has failed or is corrupt Number failed Error Action Replace the failed None or corrupt disk You can Related Alert identify a disk that has Number failed by locating the disk None that has a red X for its LRA Number status Restart the 2071 initialization 98 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2081 Virtual disk Critical Cause A physical disk Clear Alert 1204 reconfigurati Failure included in the virtual disk Number on failed Error has failed or is corrupt A None user may also have Related Alert cancelled Number the reconfiguration None Action Replace the failed ERA Number or corrupt disk You can 2081 identify a disk that has failed by locating the disk that has a red X for its status If the physical disk is part of a redundant array then rebuild the physical disk When finished restart the reconfiguration 2082 Virtual disk Critical Cause A physic
65. Dell OpenManage Server Administrator Messages Reference Guide Notes and Notices Z NOTE A NOTE indicates important information that helps you make better use of your computer NOTICE A NOTICE indicates either potential damage to hardware or loss of data and tells you how to avoid the problem Information in this document is subject to change without notice 2003 2008 Dell Inc All rights reserved Reproduction of these materials in any manner whatsoever without the written permission of Dell Inc is strictly forbidden Trademarks used in this text Dell the DELL logo and Dell OpenManage are trademarks of Dell Inc Microsoft Windows and Windows Server are either trademarks or registered trademarks of Microsoft Corporation in the United States and or other countries Red Hat and Red Hat Enterprise Linux are registered trademark of Red Hat Inc SUSE is a registered trademark of Novell Inc in the United States and other countries Other trademarks and trade names may be used in this document to refer to either the entities claiming the marks and names or their products Dell Inc disclaims any proprietary interest in trademarks and trade names other than its own August 2008 Contents 1 Introduction anaana aaa eee ees 7 What s New inthisRelease 7 Messages Not Described in This Guide 8 Understanding Event Messages 8 Sample Event Message Text 10 Viewing Alerts
66. LRA Number None 2285 Adiskmedia OK Cause This alert is for Clear Alert 901 error was Normal informational purposes Number corrected Action None None during Related Alert recovery N mBber None LRA Number None 2286 A Learn OK Cause This alert is for Clear Alert 1151 cycle startis Normal informational purposes Number pending Action None None while the Related Alert battery Number charges Kone LRA Number None Storage Management Message Reference 179 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2287 The Patrol OK Cause This alert is for Clear Alert 751 Read is Normal informational purposes Number 2288 paused Action None Related Alert Number None LRA Number None 2288 The patrol OK Cause This alert is for Clear Alert 751 read has Normal informational purposes Status Alert resumed 2288 is a clear alert for alert 2287 Related Alert Number None LRA Number None Action None 180 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action ID Related Alert SNMP Information Trap Numbers 2289 Multi bit Critical Cause An error involving ECC error Failure multiple bits has been Error encountered during a read or write operation The error correction algorithm recalculates
67. Number None LRA Number None Storage Management Message Reference 107 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2106 Smart FPT Warning Cause A disk on the Clear Alert 903 exceeded Non specified controller has Number critical received a SMART alert None predictive failure Related Alert indicating that the disk is Number likely to fail in the near i future None LRA Number Action Replace the disk 2070 that has received the SMART alert If the physical disk is a member of a non redundant virtual disk then back up the data before replacing the disk NOTICE Removing a physical disk that is included in a non redundant virtual disk will cause the virtual disk to fail and may cause data loss 108 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2107 Smart Critical Cause A disk has received Clear Alert 904 configuration Failure a SMART alert predictive Number change Error failure after a None configuration change Related Alert The disk is likely to failin Nabe the near future None Action Replace the disk LRA Number that has received the 2071 SMART alert If the physical disk is a member of a non redundant virtual disk then back up the da
68. Number None Storage Management Message Reference 165 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2249 The physical OK Cause This alert is for Clear Alert 901 disk Clear Normal informational purposes Number operation Action None None has started Related Alert Number None LRA Number None 2250 Redundant Warning Cause This alert is Clear Alert 751 Path is Non provided for informational Number broken critical purposes None Action Check the Related Alert connection to the Number enclosure which is None degraded Lo al Response Agent LRA Alert Number None 2251 The physical OK Cause This alert is for Clear Alert 901 disk blink Normal informational purposes Number has Action None None initiated Related Alert Number None LRA Number None 166 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2252 The physical OK Cause This alert is for Clear Alert 901 disk blink Normal informational purposes Number has ceased None Related Alert Number None LRA Number None Action None 2253 Redundant OK Cause This alert is Clear Alert 751 path Normal provided for informational Number restored Informati purposes None onal
69. Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt The AC power cord status is not being monitored This occurs when a system s expected AC power configuration is set to nonredundant The sensor location and chassis location information are provided 1502 AC power has been Information restored Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt An AC power cord that did not have AC power has had the power restored The sensor location and chassis location information are provided Event Message Reference 4 Table 2 12 AC Power Cord Messages continued Event Description ID 1503 1504 1505 AC power has been lost Warning Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt AC power has been lost Error Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt AC power has been lost Error Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Severity Cause An AC power cord has lost its power but there is sufficient redundancy to classify this as a warning The sensor location and chassis location information are provided An AC power cord has lost its power and lack of redundancy requires this to be classified as an error The sensor location and chassis location information are provided An AC po
70. Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2346 Error Waring Cause A physical device Clear Alert 903 occurred 1 Non may have an error The 1 Number critical indicates a substitution None variable The text for this Related Alert substitution variable is Number 2048 generated by the firmware 2050 2956 and is displayed with the 2057 2076 alert in the Alert Log This 2079 2081 text can vary depending on 2083 2095 the situation 2129 2201 Action Verify the health of 2203 2270 attached devices Review 2282 2369 the Alert Log for LRA Number significant events Run the 2070 PHY integrity diagnostic tests You may need to replace faulty hardware Make sure the cables are attached securely See the hardware documentation for more information 2347 The rebuild Critical Cause You are attempting Clear Alert 904 failed due to Failure to rebuild data that resides Number errors on the Error on a defective disk None ee Action Replace the source Related Alert physical disk and restore from Number 2195 disk backup 2346 LRA Number 2071 Storage Management Message Reference 205 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2348 The rebuild Critical Cause You are attempting Clear Alert 904 failed due t
71. age Reference 201 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2338 The OK Cause This alert is for Clear Alert 1151 controller Normal informational purposes Number has Action None None recovered Related Alert cached data Nisbet from the None BBU LRA Number None 2339 The factory OK Cause This alert is for Clear Alert 751 default Normal informational purposes Number settings have Action None None been Related Alert restored Ninabes None LRA Number None 202 l Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2340 The BGI Critical Cause The BGI task Clear Alert 1204 completed Failure encountered errors that Number with Error cannot be corrected The None uncorrectable virtual disk contains Related Alert errors physical disks that have Number unusable disk space or disk None errors that cannot be corrected LRA Number i 2081 Action Replace the physical disk that contains the disk errors Review other alert messages to identify the physical disk that has errors If the virtual disk is redundant you can replace the physical disk and continue using the virtual disk If the virtual disk is non redundant you may need to recreate the virtual dis
72. al disk Clear Alert 1204 rebuild Failure included in the virtual disk Number failed Error has failed or is corrupt A None user may also have Related Alert cancelled the rebuild Number 2048 Action Replace the failed ERA Number or corrupt disk You can 2081 identify a disk that has failed by locating the disk that has a red X for its status Restart the virtual disk rebuild Storage Management Message Reference 99 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2083 Physical disk Critical Cause A physical disk Clear Alert 904 rebuild Failure included in the virtual disk Number failed Error has failed or is corrupt A None user may also have Related Alert cancelled the rebuild N mber Action Replace the failed None or corrupt disk You can LRA Number identify a disk that has 2071 failed by locating the disk that has a red X for its status Rebuild the virtual disk rebuild 2085 Virtual disk OK Cause This alert is for Clear Alert 1201 check Normal informational purposes Status Alert consistency Actin None 2085 is a clear completed alert for alert 2058 Related Alert Number None LRA Number None 2086 Virtual disk OK Cause This alert is for Clear Alert 1201 format Normal informational purposes Status Alert completed Action None 2086 is a clear 100 Storage Management
73. ancy state and the number of devices required for full redundancy are provided Redundancy unit lt Redundancy location in chassis gt Chassis location lt Name of chassis gt Previous redundancy state was lt State gt Event Message Reference 37 Table 2 8 Redundancy Unit Messages continued 38 Event Description Severity Cause ID 1301 Redundancy sensor Information A redundancy sensor in the value unknown specified system could not Redundancy aa obtain a reading l l lt Redundancy Pace gion The redundancy unit location Jo olasiiss chassis location previous redundancy state and the Chassis location number of devices required lt Name of chassis gt for full redundancy Previous redundancy are provided state was lt State gt 1302 Redundancy not Information A redundancy sensor in the applicable specified system detected Redundancy unit that a unit was not redundant Redundancy location The redundancy location tie aie ae es chassis location previous redundancy state and the Chassis location number of devices required lt Name of chassis gt for full redundancy are Previous redundancy provided state was lt State gt 1303 Redundancy is offline Information A redundancy sensor in the Redundancy nit specified system detected that lt Redundancy TocatTon a redundant unit is offline ea ee es The redundancy unit location chassis location Chassis location previous redundancy state lt Name of c
74. and EventMessages 10 Viewing Events in Windows 2000 Advanced Server and Windows Server 2003 11 Viewing Events in Red Hat Enterprise Linux and SUSE Linux Enterprise Server 12 Viewing the Event Information 13 Understanding the Event Description 13 2 Event Message Reference 17 Miscellaneous Messages 17 Temperature Sensor Messages 19 Cooling Device Messages 23 Voltage Sensor Messages 26 Current Sensor Messages 29 Chassis Intrusion Messages 33 Chassis Management Controller Messages 36 Contents Redundancy UnitMessages Power Supply Messages Memory Device Messages Fan Enclosure Messages AC Power Cord Messages Hardware Log SensorMessages Processor Sensor Messages Pluggable Device Messages Battery Sensor Messages 3 System Event Log Messages for IPMI Systems 57 Temperature SensorEvents Voltage SensorEvents FanSensorEvents 4 Processor Status Events Power Supply Events MemoryECCEvents BMC Watchdog Events Memory Events 06 Hardware Log SensorEvents DriveEvents 0 0 C
75. ar Alert 901 spare Normal informational purposes Number imported as Acon None None global due to Related Alert missing Numbes artays None LRA Number None Storage Management Message Reference 211 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2367 Rebuild not Warning Cause The physical disk is Clear Alert 903 possible as Non using an incompatible Number SAS SATA is critical technology None not Action All physical disks in Related Alert lag Mm the virtual disk must use Number 2326 eae the same technology You f virtual disk cannot use both SAS and ae Number SATA physical disks in the same virtual disk Remove the physical disk and insert a new physical disk that uses the correct technology If the rebuild does not start automatically after you have inserted a suitable physical disk then run the Rebuild task 2368 TheSCSI OK Cause This alert is for Clear Alert 851 Enclosure Normal informational purposes Number Processor Action None None SEP has Related Alert beei Number 2049 rebooted as 2052 2162 part of the 2292 firmware download LRA Number operation None and will be unavailable until the operation completes Storage Management Message Reference 212 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Ale
76. as failed Number 2095 904 The failed component may 2201 2203 954 have been identified by the i 1004 LRA Number 1054 2051 2061 1104 2071 2081 1154 2091 2101 1204 controller while performing a task such as a rescan or a check consistency Action Replace the failed component You can identify which disk has failed by locating the disk that has a red X for its status Perform a rescan after replacing the disk Storage Management Message Reference 87 Table 4 4 Storage Management Messages continued Event Description ID Severity Cause and Action Related Alert SNMP Information Trap Numbers 2049 Physical disk Warning Cause A physical disk has removed Non critical been removed from the disk group This alert can also be caused by loose or defective cables or by problems with the enclosure Action If a physical disk was removed from the disk group either replace the disk or restore the original disk On some controllers a removed disk has a red X for its status On other controllers a removed disk may have an Offline status or is not displayed on the user interface Perform a rescan after replacing or restoring the disk If a disk has not been removed from the disk group then check for problems with the cables See the online help for more information on checking the cables Make sure that the enclosure is powered on If the problem persists check the enclo
77. attached securely If the problem persists replace the cable with a valid cable according to SAS specifications If the problem still persists you may need to replace some devices such as the controller or EMM See the hardware documentation for more information Storage Management Message Reference 197 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2330 SAS port OK Cause This alert is for Clear Alert 751 report 1 Normal informational purposes Number The 1 indicates a None substitution variable The Related Alert text for this substitution N mber variable is generated by the None controller and is displayed with the alert in the Alert LRA Number Log This text can vary None depending on the situation Action None 2331 Abaddisk OK Cause The disk has a bad Clear Alert 901 block has Normal block Data has been Number been readdressed to another disk None reassigned block and no data loss has Related Alert occurred Nunibex Action Monitor the disk None for other alerts or LRA Number indications of poor health None For example you may receive alert 2306 Replace the disk if you suspect there is a problem 2332 Acontroller OK Cause This alert is for Clear Alert 751 hot plug has Normal informational purposes Number been Action None None detected Related Alert Number None
78. build Normal completed Cause This alert is for Action None informational purposes Clear Alert Status Alert 2091 is a clear alert for alert 2064 Related Alert Number None LRA Number None 1201 2092 Physical disk OK rebuild Normal completed Cause This alert is for Action None 102 Storage Management Message Reference informational purposes Clear Alert Status Alert 2092 is a clear alert for alert 2065 Related Alert Number None LRA Number None 901 Table 4 4 Storage Management Messages continued Event Description ID Severity Cause and Action SNMP Trap Numbers Related Alert Information 2094 Predictive Failure reported Warming Cause The physical disk is Non predicted to fail Many critical physical disks contain Self Monitoring Analysis and Reporting Technology SMART When enabled SMART monitors the health of the disk based on indications such as the number of write operations that have been performed on the disk Action Replace the physical disk Even though the disk may not have failed yet it is strongly recommended that you replace the disk If this disk is part of a redundant virtual disk perform the Offline task on the disk replace the disk and then assign a hot spare and the rebuild will start automatically Storage Management Message Reference Clear Alert Number None Related Alert Number Non
79. build aborted was asserted drive rebuilding process is aborted 6 System Event Log Messages for IPMI Systems Intrusion Events The chassis intrusion messages are a security measure Chassis intrusion alerts are generated when the system s chassis is opened Alerts are sent to prevent unauthorized removal of parts from the chassis Table 3 11 Intrusion Events Event Message Severity Cause lt Intrusion sensor Critical Name gt sensor detected an intrusion This event is generated when the intrusion sensor detects an intrusion lt Intrusion sensor Information Name gt sensor returned to normal state This event is generated when the earlier intrusion has been corrected lt Intrusion sensor Critical Name gt sensor intrusion was asserted while system was ON lt Intrusion sensor Critical Name gt sensor intrusion was asserted while system was OFF This event is generated when the intrusion sensor detects an intrusion while the system is on This event is generated when the intrusion sensor detects an intrusion while the system is off System Event Log Messages for IPMI Systems 69 BIOS Generated System Events The BIOS generated messages monitor the health and functionality of the chipsets I O channels and other BIOS related functions Table 3 12 BIOS Generated System Events Event Message Severity Cause System Event I O Critical Thi
80. by the Server Administrator When an event occurs on your system the Server Administrator sends information about one of the following event types to the systems management console Table 1 1 Understanding Event Messages Icon Alert Severity Component Status OK Normal An event that describes the successful operation of a unit The alert is provided for informational purposes and does 2 not indicate an error condition For example the alert may indicate the normal start or stop of an operation such as power supply or a sensor reading returning to normal Warning An event that is not necessarily significant but may indicate a A Non critical possible future problem For example a Warning Non critical alert may indicate that a component such as a temperature probe in an enclosure has crossed a warning threshold Cnitical A significant event that indicates actual or imminent loss of Failure Error data or loss of function For example crossing a failure threshold or a hardware failure such as an array disk 8 Introduction Server Administrator generates events based on status changes in the following sensors Temperature Sensor Helps protect critical components by alerting the systems management console when temperatures become too high inside a chassis also monitors a variety of locations in the chassis and in any attached systems Fan Sensor Monitors fans in various locations in the chassis and
81. cables You 2050 2060 1153 should also check the 2070 2080 1203 connection to the 2090 2100 controller battery and the battery health A battery with a weak or depleted charge may cause this alert Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2265 Adeviceis Warning Cause The controller Clear Alert 753 in an Non cannot communicate with Number 803 unknown critical a device The state of the None 853 state device cannot be Related Alert 29 determined There may be Number 2048 953 a bad or loose cable The 2050 1003 system may also be 1053 experiencing problems with LRA Number 1103 the application 2050 2060 1153 programming interface 2070 2080 1203 API There could also be 2090 2100 a problem with the driver or firmware Action Check the cables Check if the controller has a supported version of the driver and firmware You can download the most current version of the driver and firmware from support dell com Rebooting the system may also resolve this problem 2266 Controller OK Cause This alert is for Clear Alert 751 801 logfile entry Normal informational purposes Number 851 901 1 The 1 indicates a None 951 substitution variable The Related Alert 1001 text for this substitution N mbet 1051 variable is generated by the None 1101 controller and i
82. chassis gt Chassis location lt Name of chassis gt Additional details lt Additional details for the events gt specified system The device location chassis location and additional event details if available are provided Event Message Reference 53 Table 2 15 Pluggable Device Messages continued Event Description Severity Cause ID 1652 Device removed from Information A device was removed from the system specified system The device location chassis location and additional event details if available are provided Device location lt Location in chassis gt Chassis location lt Name of chassis gt Additional details lt Additional details for the events gt 1653 Device configuration Error A configuration error was error detected for a pluggable device detected in the specified system The Dies oe owas device may have been added to 2heestion Gn the system incorrectly chassis gt Chassis location lt Name of chassis gt Additional details lt Additional details for the events gt 54 Event Message Reference Battery Sensor Messages Battery sensors monitor how well a battery is functioning Battery messages listed in Table 2 16 provide status and warning information for batteries in a particular chassis Table 2 16 Battery Sensor Messages Event Description Severity Cause ID 1700 Battery sensor has failed Information A battery sensor in Sensor
83. controller event log and the Server Administrator Alert Log for significant events or alerts that may assist in diagnosing the problem Check the health of the storage components See the hardware documentation for more information 200 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2336 Controller Critical Cause The 1 indicates a Clear Alert 754 event Failure substitution variable The Number log 1 Error text for this substitution None variable is generated bythe Related Alert controller and is displayed Number with the alert in the Alert None Log This text is from events in the controller LRA Number event log that were 2061 generated while Storage Management was not running This text can vary depending on the situation Action See the hardware documentation for more information 2337 The Critical Cause The controller was Clear Alert 1154 controlleris Failure unable to recover data from Number unable to Error the cache None FECOVEL Action Check if the Related Alert cached data battery is charged andin Number ae the good health When the None O unit n ee rm ite t LRA Number BBU icceptably low it cannot 270 maintain cached data Check if the battery has reached its recharge limit The battery may need to be recharged or replaced Storage Management Mess
84. ct the Storage object and click the Health subtab Storage Management Message Reference Clear Alert Number 2124 Related Alert Number 2048 2049 2057 LRA Number 2080 2090 1306 119 Table 4 4 Storage Management Messages continued Event Description Cause and Action Severity Related Alert Information SNMP Trap Numbers 2123 contd The controller status displayed on the Health subtab indicates whether a controller has a failed or degraded component Click the controller that displays a Warning or Failed status This action displays the controller Health subtab which displays the status of the individual controller components Continue clicking the components with a Warning or Health status until you identify the failed component See the online help for more information See the enclosure documentation for information on replacing enclosure components and for other diagnostic information 2124 Redundancy OK normal Normal Cause This alert is for informational purposes Data redundancy has been restored to a virtual disk or an enclosure that previously suffered a loss of redundancy Action None 120 Storage Management Message Reference Clear Alert Number Alert 2124 is a clear alert for alerts 2122 and 2123 Related Alert Number None LRA Number None 1304 Table 4 4 Storage Management Messages continued Event Description S
85. dictive failure power supply is about to fail was asserted lt Power Supply Sensor Critical This event is generated when the Name gt input lost was power supply is unplugged asserted lt Power Supply Sensor Information This event is generated when the Name gt predictive failure power supply has recovered from was deasserted an earlier predictive failure event lt Power Supply Sensor Information This event is generated when the Name gt input lost was deasserted power supply is plugged in System Event Log Messages for IPMI Systems 63 Memory ECC Events The memory ECC event messages monitor the memory modules in a system These messages monitor the ECC memory correction rate and the type of memory events that occurred Table 3 6 Memory ECC Events Event Message Severity Cause ECC error correction Information This event is generated when there is a detected on Bank memory error correction on a particular DIMM A B Dual Inline Memory Module DIMM ECC uncorrectable Critical This event is generated when the error detected on chipset is unable to correct the memory Bank DIMM errors Usually a bank number is provided and DIMM may or may not be identifiable depending on the error Correctable memory Critical This event is generated when the error logging chipset in the ECC error correction rate disabled exceeds a predefined limit BMC Watchdog Events The BMC watchdog operations are perfor
86. disk SAS and SATA are not None disks is not supported on the same supported in virtual disk the same Action None virtual disk 150 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2201 Aglobal hot Warning Cause The controller is Clear Alert 903 spare failed Non not able to communicate Number critical with a disk that is assigned None as a dedicated hot spare Related Alert The disk may have been Number 2048 removed There may also be a bad or loose cable sae Number Action Check if the disk is 4 healthy and that it has not been removed Check the cables If necessary replace the disk and reassign the hot spare 2202 A globalhot OK Cause The controller is Clear Alert 901 spare Normal unable to communicate Number has been with a disk that is assigned None removed as a global hot spare The Related Alert disk may have been Nimb r removed There mayalso one be a bad or loose cable i 2 LRA Number Action Check if the disk is ane healthy and that it has not been removed Check the cables If necessary replace the disk and reassign the hot spare Storage Management Message Reference 151 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action SNMP Trap Numbers Related Alert Information 2203 A
87. e LRA Number 2070 903 103 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2094 If this disk is a hot spare cond then unassign the hot spare perform the Prepare to Remove task on the disk replace the disk and assign the new disk as a hot spare NOTICE if this disk is part of a nonredundant disk back up your data immediately If the disk fails you will not be able to recover the data 2095 SCSI sense OK Cause A SCSI device Clear Alert 751 851 data Normal experienced an error but Number 901 may have recovered None Action None Related Alert Number 2273 LRA Number None 2098 Globalhot OK Cause A user has assigned Clear Alert 901 spare Normal a physical disk as a global Number assigned hot spare This alert is for None informational purposes Related Alert Action None Number 2277 LRA Number None 104 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2099 Global hot OK Cause A user has Clear Alert 901 spare Normal unassigned a physical disk Number unassigned as a global hot spare This None alert is for informational Related Alert PUIPOses Number Action None None LRA Number None 2100 Temperature Warning Cause The physical disk
88. e 2196 is a clear Physical disk alert for alert l 2195 Related Alert Number None LRA Number None 2197 Replace OK Cause This alert is Clear Alert 903 member Normal provided for informational Number None operation purposes Related Alert has stopped Action None Number 260 for rebuild LRA Number None Storage Management Message Reference 149 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2198 The physical OK Cause This alert is Clear Alert 903 disk istoo Normal provided for informational Number None small to be purposes Replace member Related Alert used for operation cannot be l Number Nowe Replace performed on the physical member disk as the target disk is RA Number operation smaller for than the source None disk Action None 2199 The virtual OK Cause This alert is for Clear Alert 1201 disk cache Normal informational purposes Number policy has Action None None changed Related Alert Number None LRA Number None 2200 Replace Warning Cause This alert is Clear Alert 903 member Noncritic provided for informational Number operationis al purposes Replace member None not possible operation cannot be Related Alert as performed because the Number combination target physical disk isofa None of SAS and different type SAS SATA SATA from the rest of the virtual LRA Number physical
89. e EventID 1001 Server Administrator startup complete Feb 6 14 21 21 server01 Server Administrator Instrumentation Service EventID 1254 Chassis intrusion detected Sensor location Main chassis intrusion Chassis location Main System Chassis Previous state was OK Normal Chassis intrusion state Open Feb 6 14 21 51 server01 Server Administrator Instrumentation Service EventID 1252 Chassis intrusion returned to normal Sensor location Main chassis intrusion Chassis location Main System Chassis Previous state was Critical Failed Chassis intrusion state Closed 12 Introduction Viewing the Event Information The event log for each operating system contains some or all of the following information Date The date the event occurred Time The local time the event occurred Type A classification of the event severity Information Warning or Error User The name of the user on whose behalf the event occurred Computer The name of the system where the event occurred Source The software that logged the event Category The classification of the event by the event source Event ID The number identifying the particular event type Description A description of the event The format and contents of the event description vary depending on the event type Understanding the Event Description Table 1 2 lists in alphabetical order each lin
90. e Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2296 An EMM OK Cause This alert is for Clear Alert 951 has been Normal informational purposes Number inserted Action None None Related Alert Number None LRA Number None 2297 An EMM Critical Cause An EMM has been Clear Alert 954 has been Failure removed Number removed Error Action Replace the EMM None See the Related Alert hardware documentation Number for information None on replacing the EMM LRA Number 2091 2298 Thereisa Warning Cause The enclosure has a Clear Alert 853 bad sensor Non bad sensor The enclosure Number onan critical sensors monitor the fan None enclosure speeds temperature Related Alert probes etc Number Action See the hardware None documentation for more LRA Number information 2090 184 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2299 Bad PHY 1 Critical Cause There is a problem Clear Alert 854 Failure with a physical connection Number Error or PHY The 1 indicates a None substitution variable The Related Alert text for this substitution Number variable is displayed with None the alert in the Alert Log and can vary depending on LRA Number the situation 2091 Action
91. e Warning Cause The firmware on Clear Alert 853 firmware on Non the EMM modules is not Number the EMMsis critical the same version It is None not the same required that both modules Related Alert version have the same version of N mbet EMMO 1 the firmware This alert Nake EMMI 2 may be caused if you attempt to insert an EMM LRA Number module that has a different 2090 firmware version than an existing module The 1 and 2 indicate a substitution variable The text for these substitution variables is displayed with the alert in the Alert Log and can vary depending on the situation Action Upgrade to the same version of the firmware on both EMM modules 2312 A power Warning Cause The power supply Clear Alert 1003 supplyinthe Non has an AC failure Number 2325 enclosure critical Action Replace the power Related Alert has an supply Number 2122 AC failure 2324 LRA Number 2090 Storage Management Message Reference 190 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2313 A power Waring Cause The power supply Clear Alert 1003 supply inthe Non has a DC failure Number 2323 nie Sule critical Action Replace the power Related Alert NaS Ay supply Number 2122 DC failure 2322 LRA Number 2090 2314 The Critical Cause Storage Clear Alert 104 initialization Failure Management is unable to Number
92. e enclosure object in the tree view and click the Health subtab The Health subtab displays the status of the enclosure components Verify that the controller has supported driver and firmware versions installed and that the EMMs are each runningthe same version of supported firmware 2138 Enclosure OK Cause This alert is for Clear Alert 851 alarm Normal informational purposes A Number enabled user has enabled the None enclosure alarm Related Alert Action None Number None LRA Number None 126 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2139 Enclosure OK Cause A user has disabled Clear Alert 851 alarm Normal the enclosure alarm Number disabled Action None None Related Alert Number None LRA Number None 2140 Deaddisk OK Cause This alert is for Clear Alert 1201 segments Normal informational purposes Number restored Disk space that was None formerly dead or Related Alert inaccessible to a redundant Nim ber virtual disk has been Non restored LRA Number Action None None 2141 Physical disk OK Cause This alert is for Clear Alert 901 dead Normal informational purposes Number segments Portions of the physical None recovered disk were formerly Related Alert inaccessible The disk space Number from these dead segments None has
93. e item that may appear in the event description Table 1 2 Event Description Reference Description Line Item Explanation Action performed was Specifies the action that was performed lt Action gt for example Action performed was Power cycle Action requested was Specifies the action that was requested for lt Action gt example Action requested was Reboot shutdown OS first Additional Details Specifies additional details available for the hot lt Additional details for plug event for example the event gt Memory device DIMM1_A Serial number FFFF30B1 Introduction 13 Table 1 2 Event Description Reference continued Description Line Item Explanation lt Additional power supply status information gt Specifies information pertaining to the event for example Power supply input AC is off Power supply POK power OK signal is not normal Power supply is turned off Chassis intrusion state lt Intrusion state gt Specifies the chassis intrusion state open or closed for example Chassis intrusion state Open Chassis location lt Name of chassis gt Specifies name of the chassis that generated the message for example Chassis location Main System Chassis Configuration error type lt type of configuration error gt Specifies the type of configuration error that occurred for example Configuration error type Revision mismatch
94. ent module EMM managing the power supply Example 2122 Redundancy degraded Power Supply 1 Controller 1 Connector 0 Target ID 6 SAS Power Supply SCSI Temperature Probe Message Format Power Supply X Controller A Connector B Enclosure C Example 2312 A power supply in the enclosure has an AC failure Power Supply 1 Controller 1 Connector 0 Enclosure 2 Message Format Temperature Probe X Controller A Connector B Target ID C where C is the SCSI ID number of the EMM managing the temperature probe Example 2101 Temperature dropped below the minimum warning threshold Temperature Probe 1 Controller 1 Connector 0 Target ID 6 SAS Temperature Probe SCSI Fan Message Format Temperature Probe X Controller A Connector B Enclosure C Example 2101 Temperature dropped below the minimum warning threshold Temperature Probe 1 Controller 1 Connector 0 Enclosure 2 Message Format Fan X Controller A Connector B Target ID C where C is the SCSI ID number of the EMM managing the fan Example 2121 Device returned to normal Fan 1 Controller 1 Connector 0 Target ID 6 SAS Fan 80 Message Format Fan X Controller A Connector B Enclosure C Example 2121 Device returned to normal Fan 1 Controller 1 Connector 0 Enclosure 2 Storage Management Message Reference Table 4 2 Message Format with Variables for Each Storage Object continued Storage Object Message Variab
95. entation prior to Storage Management 2 1 or Dell OpenManage 5 1 86 Storage Management Message Reference Alert Descriptions and Corrective Actions The following sections describe alerts generated by the RAID or SCSI controllers supported by Storage Management The alerts are displayed in the Server Administrator Alert subtab or through Windows Event Viewer These alerts can also be forwarded as SNMP traps to other applications SNMP traps are generated for the alerts listed in the following sections These traps are included in the Dell OpenManage Server Administrator Storage Management management information base MIB The SNMP traps for these alerts use all of the SNMP trap variables For more information on SNMP support and the MIB see the SNMP Reference Guide To locate an alert scroll through the following table to find the alert number displayed on the Server Administrator Alert tab or search this file for the alert message text or number See Understanding Event Messages for more information on severity levels For more information regarding alert descriptions and the appropriate corrective actions see the online help Table 4 4 Storage Management Messages Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2048 Device Critical Cause A storage Clear Alert 754 failed Failure component such as Number 2121 804 Error a physical disk or an Related Alert 7 enclosure h
96. es User is None with missing attempting to import a Related Alert span foreign virtual disk witha Number missing span None Action None LRA Alert Number None 2375 Attempted OK Cause This alert is Clear Alert 751 import of Normal provided for informational Number Virtual Disk purposes User is None with missing attempting to import a Related Alert physical disk foreign virtual disk witha Number missing physical disk None Action None LRA Alert Number None 214 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2376 Attempted OK Cause This alert is Clear Alert 751 import of Normal provided for informational Number Virtual Disk purposes User is None with stale attempting to import a Related Alert physical disk foreign virtual disk witha Number stale physical disk None Action None LRA Alert Number None 2377 Attempted OK Cause This alert is Clear Alert 751 import ofan Normal provided for informational Number orphan drive purposes User is None attempting to importan Related Alert orphan drive Number Action None None LRA Alert Number None 2378 Attempted OK Cause This alert is Clear Alert 751 import of an Normal provided for informational Number incompatibl purposes User ia None e physical attempting to import an Related Alert dri
97. es the variable information is represented with a percent sign in the Storage Management documentation An example of such an alert is shown for alert 2334 in Table 4 1 Table 4 1 Alert Message Format Alert ID Message Text Displayed inthe Message Text Displayed in the Alert Log with Storage Management Service Variable Information Supplied Documentation 2127 Background Initialization Background Initialization started Virtual started Disk 3 Virtual Disk 3 Controller 1 PERC 5 E Adapter 2334 Controller event log Controller event log Current capacity of the battery is above threshold Controller 1 PERC 5 E Adapter The variables required to complete the message vary depending on the type of storage object and whether the storage object is in a SCSI or SAS configuration The following table identifies the possible variables used to identify each storage object Z NOTE Some alert messages relating to an enclosure or an enclosure component such as a fan or EMM are generated by the controller when the enclosure or enclosure component ID cannot be determined 78 Storage Management Message Reference Table 4 2 Message Format with Variables for Each Storage Object Storage Object Message Variables A B C and X Y Z in the following examples are variables representing the storage object name or number Controller Message Format Controller A Name Message Format Controller A Example 2326 A foreign c
98. everity Cause and Action Related Alert SNMP ID Information Trap Numbers 2125 Controller Warning Cause Virtual disk Clear Alert 753 cache controller was Number No preserved for disconnected during IO Related Alert mo or operation Namber No Orme Action Import foreign virtual disk disks if a Check if the 3 uae enclosure containing the virtual disk is disconnected from the controller 2126 SCSI sense Warning Cause A sector of the Clear Alert 903 sector Non physical disk is corrupted Number reassign critical and data cannot be None maintained on this portion Related Alert of the disk This alert is for N mber informational purposes None NOTICE Any data LRA Number residing on the None corrupt portion of the disk may be lost and you may need to restore your data from backup Action If the physical disk is part of a nonredundant virtual disk then back up the data and replace the physical disk NOTICE Removing a physical disk that is included in a nonredundant virtual disk will cause the virtual disk to fail and may cause data loss Storage Management Message Reference 121 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2126 If the disk is part of a conte redundant virtual disk then any data residing on the corrupt portion of the disk will be reallocated elsewhere in the virtual disk 2127
99. example a fan may N mber threshold have failed the thermostat None may be set too high or the room temperature maybe LRA Number too hot 2091 Action Check for factors that may cause overheating For example verify that the enclosure fan is working You should also check the thermostat settings and examine whether the enclosure is located near a heat source Make sure the enclosure has enough ventilation and that the room temperature is not too hot See the physical disk enclosure documentation for more diagnostic information Storage Management Message Reference 106 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2103 Temperature Critical Cause The physical disk Clear Alert 1054 dropped Failure enclosure is too cool Number below the Error Action Check if the None Cea thermostat setting is too Related Alert failure low and if the room Number 2112 threshold tem a perature is too cool LRA Number 2091 2104 Controller OK Cause This alert is for Clear Alert 1151 battery is Normal informational purposes Number 2105 recondition Action None Related Alert ng Number None LRA Number None 2105 Controller Ok Cause This alert is for Clear Alert 1151 battery Normal informational purposes Status Alert recondition Action Noie 2105 is a clear is completed i alert for alert 2104 Related Alert
100. for more information Action None 2327 The Warning Cause The NVRAM has Clear Alert 753 NVRAM has Non corrupted data This may Number corrupted critical occur after a power surge a None data The l battery failure or for other Related Alert controller is reasons The controller is Number 2266 reinitializing reinitializing the NVRAM the Action None The LPA ties NVRAM esac be 2060 controller is taking the required corrective action If this alert is generated often such as during each reboot replace the controller Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2328 The Warming Cause The NVRAM has Clear Alert 753 NVRAM has Non corrupt data The Number corrupt data critical controller is unable to None correct the situation Related Alert Action Replace the Number controller None LRA Number 2060 2329 SAS port Waring Cause The text for this Clear Alert 753 report 1 Non alert is generated by the Number critical controller and can vary None depending on ihe ine Related Alert situation The 701 indicates N mber a substitution variable The None text for this substitution variable is generated by the LRA Number 2060 controller and is displayed with the alert in the Alert Log This text can vary depending on the situation Action Make sure the cables are
101. formance Board degraded lt description of why gt was asserted Entity Presence Events The entity presence messages are used for detecting different hardware devices Table 3 17 Entity Presence Events Description Severity Cause lt Device Name gt Information This event is generated when the device presenes twas was detected asserted lt Device Name gt Critical This event is generated when the device Sbsent vas asserted was not detected 76 System Event Log Messages for IPMI Systems Storage Management Message Reference The Dell OpenManage Server Administrator Storage Management s alert or event management features let you monitor the health of storage resources such as controllers enclosures physical disks and virtual disks Alert Monitoring and Logging The Storage Management Service performs alert monitoring and logging By default the Storage Management Service starts when the managed system starts up If you stop the Storage Management Service then alert monitoring and logging stops Alert monitoring does the following e Updates the status of the storage object that generated the alert e Propagates the storage object s status to all the related higher objects in the storage hierarchy For example the status of a lower level object will be propagated up to the status displayed on the Health tab for the top level Storage object e Logs an alert in the Alert log and
102. from the disk group either replace the disk or restore the original disk You can identify which disk has been removed by locating the disk that has a red X for its status Perform a rescan after replacing the disk Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2058 Virtual disk Cause This alert is for Clear Alert 1201 check informational purposes Number 2085 consistency Action None Related Alert started Naniber None LRA Number None 2059 Virtual disk Cause This alert is for Clear Alert 1201 format informational purposes Number 2086 started Action None Related Alert Number None LRA Number None 2060 Copy of data Cause This alert is for Clear Alert 1201 started on informational purposes Number physical disk Action None None l from f Related Alert physical disk Number 2075 2 LRA Number None 2061 Virtual disk Cause This alert is for Clear Alert 1201 initialization informational purposes Number 2088 started Action None Related Alert Number None LRA Number None Storage Management Message Reference 93 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert Information SNMP Trap Numbers 2062 Physical disk initialization started 2063 Virtual dis
103. gt Fan Redundancy sensor redundancy lost Critical The fan specified by lt Sensor Name Location gt may have failed and hence the redundancy that was degraded previously has been lost lt Sensor Name Location gt Fan Redundancy sensor redundancy regained Information The fan specified by lt Sensor Name Location gt may have started functioning again and hence the redundancy has been regained 60 System Event Log Messages for IPMI Systems Processor Status Events The processor status messages monitor the functionality of the processors in a system These messages provide processor health and warning information of a system Table 3 4 Processor Status Events Event Message Severity Cause lt Processor Entity gt status Critical processor sensor IERR where lt Processor Entity gt is the processor that generated the event For example PROC for a single processor system and PROC for multiprocessor IERR internal error generated by the lt Processor Entity gt system lt Processor Entity gt status Critical The processor generates this processor sensor Thermal event before it shuts down Trip because of excessive heat caused by lack of cooling or heat synchronization lt Processor Entity gt status Information This event is generated when a processor sensor recovered processor recovers from the from IERR internal error lt Processor Entity gt status
104. hange History Storage Management 2 1 Comments Product Versions to which Changes Apply Storage Management 2 1 Server Administrator 2 4 Dell OpenManage 5 1 New Alerts 2062 see note The alert numbers for the new alerts 2173 2062 2260 were previously unassigned 2195 Alert numbers 2370 and 2371 are new 2196 K NOTE Alerts 2062 and 2260 2212 were previously undocumented 2213 in the Storage Management 2214 online help Dell OpenManage Server Administrator Storage 2215 Management User s Guide and 2260 see note the Dell OpenManage Server 2370 Administrator Messages Reference Guide 2371 Modified Alerts 2049 2050 2051 2052 2065 The term array disk has been 2074 2080 2083 2089 2092 changed to physical disk 2141 2158 2249 2251 2252 throughout Storage Management 2255 2269 2270 2274 2303 This change affects the message text 2305 2309 2361 2362 2363 of the modified alerts Obsolete Alerts 2160 2160 replaced by 2195 2161 2161 replaced by 2196 Documentation Documentation updated to Starting with Dell OpenManage 5 0 Changes indicate clear alert status Array Manager is no longer an Reference to SNMP trap variables removed Corresponding Array Manager event numbers removed see comments installable option If you have an Array Manager installation and wish to see how the Array Manager events correspond to the Storage Management alerts refer to the product docum
105. hassis gt and the number of devices Previous redundancy required for full redundancy state was lt State gt are provided Event Message Reference Table 2 8 Redundancy Unit Messages continued Event Description Severity Cause ID 1304 Redundancy regained Information A redundancy sensor in the Redundancy nit specified system detected that Redundancy locaton a lost redundancy device has in ohassies been reconnected or replaced full redundancy is in effect Chassis location The redundancy unit location lt Name of chassis gt chassis location previous Previous redundancy redundancy state and the state was lt State gt number of devices required for full redundancy are provided 1305 Redundancy degraded Warning A redundancy sensor in Redundancy unit the specified system detected lt Redundancy location that one of the components of arene ee tas the redundancy unit has failed but the unit is still redundant Chassis location The redundancy unit location lt Name of chassis gt chassis location previous Previous redundancy redundancy state and state was lt State gt the number of devices required for full redundancy are provided 1306 Redundancy lost Error A redundancy sensor in the Redundancy unit lt Redundancy location in chassis gt Chassis location lt Name of chassis gt Previous redundancy state was lt State gt specified system detected that one of the components in the redunda
106. hat has a red X for its status Perform a rescan after replacing the disk When performing a consistency check be aware that the consistency check can take along time The time it takes depends on the size of the physical disk or the virtual disk Storage Management Message Reference 95 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2070 Virtual disk OK Cause The virtual disk Clear Alert 1201 initialization Normal initialization cancelled Number cancelled because a physical disk None included in the virtual disk Related Alert has failed or because auser Number cancelled the virtual disk one initialization i i LRA Number Action If a physical disk None failed then replace the physical disk You can identify which disk has failed by locating the disk that has a red X for its status Perform a rescan after replacing the disk Restart the format physical disk operation Restart the virtual disk initialization 2074 Physical disk OK Cause A user has cancelled Clear Alert 901 rebuild Normal the rebuild operation Number cancelled Action Restart the rebuild None operation Related Alert Number None LRA Number None 2075 Copyofdata Ok Cause This alert is Clear Alert 1201 completed Normal provided for informational Number on physical Taforma PUTPoses None disk 2 from tional Act
107. he Microsoft Windows 2000 Advanced Server and Windows Server 2003 operating systems messages are logged to the system event log and optionally to a unicode text file desys32 log viewable using Notepad that is located in the install_path omsa log directory The default install_path is C Program Files Dell SysMgt In the Red Hat Enterprise Linux and SUSE Linux Enterprise Server operating system messages are logged to the system log file The default name of the system log file is var log messages You can view the messages file using a text editor such as vi or emacs NOTE Logging messages to a unicode text file is optional By default the feature is disabled To enable this feature modify the Event Manager section of the dcemdy32 ini file as follows e In Windows locate the file at lt install_path gt dataeng ini and set UnitextLog enabled True The default install_path is C Program Files Dell SysMgt Restart the DSM SA Event Manager service e In Red Hat Enterprise Linux and SUSE Linux Enterprise Server locate the file at lt install_path gt dataeng ini and set UnitextLog enabled True The default install_path is opt dell srvadmin Issue the etc init d dataeng restart command to restart the Server Administrator event manager service This will also restart the Server Administrator data manager and SNMP services The following subsections explain how to open the Windows 2000 Advanced Server Windows Server 20
108. he sensor chassis gt location chassis Previous state was lt State gt location previous Battery sensor status state and battery sensor status are lt status gt provided 1704 Battery sensor detected a Error A battery sensor in failure value the specified system Sensor Location lt Location in on ne ea chasals a attery has ailed The sensor location Chassis Location lt Name of chassis location chassis gt previous state and Previous state was lt State gt battery sensor status Battery sensor status arg provided lt status gt 1705 Battery sensor detected a Error A battery sensor in non recoverable value the specified system Sensor Location lt Location in aes ion i hasas as battery has failed The sensor location Chassis Location lt Name of chassis location chassis gt previous state and Previous state was lt State gt battery sensor status ided Battery sensor status peace ee lt status gt Event Message Reference System Event Log Messages for IPMI Systems The following tables list the system event log SEL messages their severity and cause Z NOTE For corrective actions see the appropriate documentation Temperature Sensor Events The temperature sensor event messages help protect critical components by alerting the systems management console when the temperature rises inside the chassis These event messages use additional variables such as sensor location chassis location previo
109. ine whether the enclosure is located near a heat source Make sure the enclosure has enough ventilation and that the room temperature is not too hot or too cold See the enclosure documentation for more diagnostic information 2114 A OK Cause The check Clear Alert 1201 consistency Normal consistency operation on a Number 2115 check on a virtual disk was paused bya Related Alert virtual disk user Number has aa Action To resume the None EE eae nse X LRA Number peration right click the None 114 virtual disk in the tree view and select Resume Check Consistency Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2115 A OK Cause This alert is for Clear Alert 1201 consistency Normal informational purposes Status Alert check on a The check consistency 2115 is a clear virtual disk operation on a virtual disk alert for alert has been has resumed processing 2114 resumed after being paused by Related Alert LET Number Action None None LRA Number None 2116 A virtual OK Cause This alert is for Clear Alert 1201 disk and its Normal informational purposes A Number mirror have user has caused a mirrored None been split virtual disk to be split Related Alert W hen a virtual disk is N rnbei mirrored its data is copied Nang to another virtual disk in order to ma
110. intain LRA Number redundancy After being None split both virtual disks retain a copy of the data although because the mirror is no longer intact updates to the data are no longer copied to the mirror Action None Storage Management Message Reference 115 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2117 Amirrored OK Cause This alert is for Clear Alert 1201 virtual disk Normal informational purposes A Number has been user has caused a mirrored None unmirrored virtual disk to be Related Alert unmuirrored When a Number virtual disk is mirrored its Note data is copied to another virtual disk in order to LRA Number maintain redundancy After None being unmirrored the disk formerly used as the mirror returns to being a physical disk and becomes available for inclusion in another virtual disk Action None 2118 The write OK Cause This alert is for Clear Alert 1201 policy Normal informational purposes Number change write A user has changed None policy the write policy for a Related Alert virtual disk Namber Action None None LRA Number None 116 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2120 Enclosure Warning Cause The firmware on Clear Alert
111. ion None Related Alert physical disk Number 2060 z LRA Number None 96 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2076 Virtual disk Critical Cause A physical disk Clear Alert 1204 Check Failure included in the virtual disk Number Consistency Error failed or there is an error in None failed the parity information A Related Alert failed physical disk can Number cause errors in parity None information LRA Number Action Replace the failed 59 physical disk You can identify which disk has failed by locating the disk that has a red X for its status Rebuild the physical disk When finished restart the check consistency operation 2077 Virtual disk Critical Cause A physical disk Clear Alert 1204 format failed Failure included in the virtual disk Number Error failed None Action Replace the failed Related Alert physical disk You can Number identify which physical disk None has failed by locating the TRA Number disk that has a red X for 2081 its status Rebuild the physical disk When finished restart the virtual disk format operation Storage Management Message Reference 97 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2079 Virtual disk
112. iption Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2171 The Warning Cause The battery may be Clear Alert 1153 controller Non recharging the room Number 2172 battery critical temperature may be too Rotated Alert temperature hot or the fan in the Number is above system may be degraded or None normal failed LRA Number Action If this alert was 2100 generated due to a battery recharge the situation will correct when the recharge is complete You should also check if the room temperature is normal and that the system components are functioning properly 2172 The OK Cause This alert is for Clear Alert 1151 controller Normal informational purposes Status Alert battery Acton None 2172 is a clear temperature alert for alert is normal 2171 Related Alert Number None LRA Number None Storage Management Message Reference 138 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2173 Unsupported Waming Cause The EMMs inthe Clear Alert 853 configuratio Non enclosure have a different Number n detected critical SCSI rate This is an None The SCSI unsupported configuration Related Alert rate of the All EMMs in the enclosure Number enclosure should have the same SCSI None managemen rate The percent sign t modules indicates a substitution LRA Number EMM s is variable The text for
113. isk is degraded Clear Alert 903 warning Non and has received a SMART Number degraded critical alert predictive failure None The disk is likely to fail in Related Alert the near future N mber Action Replace the disk None that has received the LRA Number SMART alert If the 2070 physical disk is a member of a non redundant virtual disk then back up the data before replacing the disk NOTICE Removing a physical disk that is included in a non redundant virtual disk will cause the virtual disk to fail and may cause data loss 2111 Failure Warning Cause A disk has received Clear Alert 903 prediction Non a SMART alert predictive Number threshold critical failure due to test None exceeded conditions Related Alert due to test 7 Action None Number No action None needed LRA Number 2070 Storage Management Message Reference 113 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2112 Enclosure Critical Cause The physical disk Clear Alert 854 was shut Failure enclosure is either hotter or Number down Error cooler than the maximum None or minimum allowable Related Alert temperature range Number Action Check for factors None that may cause overheating 1 RA Number or excessive cooling For 209 example verify that the enclosure fan is working You should also check the thermostat settings and exam
114. k after replacing the physical disk After replacing the physical disk run Check Consistency to check the data 2341 The Check OK Cause This alert is for Clear Alert 1201 Consistency Normal informational purposes Number made Action None None corrections Related Alert and Number completed None LRA Number None Storage Management Message Reference 203 204 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2342 The Check Warming Cause The data ona Clear Alert 1203 Consistency Non source disk and the Number found critical redundant data on a target None inconsistent disk is inconsistent Related Alert parity data Action Restart the Check Number 2341 ae F Consistency task If you 2343 A receive this alert again LRA Number i check the health of the 2080 physical disks included in the virtual disk Review the alert messages for significant alerts related to the physical disks If you suspect that a physical disk has a problem replace it and restore from backup 2343 The Check Warning Cause The Check Clear Alert 1203 Consistency Non Consistency can no longer Number logging of critical report errors in the parity None inconsistent data Related Alert parity data is Action See the hardware Number disabled documentation for more None information LRA Number 2080 Storage Management Message Reference
115. k reconfigurati on started OK Normal OK Normal Cause This alert is for informational purposes Action None Cause This alert is for informational purposes Action None Clear Alert Number 2089 Related Alert Number None LRA Number None Clear Alert Number 2090 Related Alert Number None LRA Number None 901 1201 2064 rebuild started Virtual disk OK Normal Cause This alert is for informational purposes Action None Clear Alert Number 2091 Related Alert Number None LRA Number None 1201 2065 Physical disk rebuild started 4s OK Normal Cause This alert is for informational purposes Action None Storage Management Message Reference Clear Alert Number 2092 Related Alert Number 2099 2121 2196 LRA Number None 901 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2067 Virtual disk OK Cause The check Clear Alert 1201 check Normal consistency operation Number consistency cancelled because a None cancelled physical disk in the array Related Alert has failed or because a user Number cancelled the check None consistency operation i LRA Number Action If the physical disk None failed then replace the physical disk You can identify which disk failed by locating the disk t
116. le None and Replace Related Alert Member and Number Auto None Replace Member LRA Number operation on None Predictive Failure changed 2229 Abort Check OK Cause This alert is for Clear Alert 751 Consistency Normal informational purposes Number on Error and Acton None None Auto Related Alert Replace Number Member None operation on Predictive LRA Number Failure None changed 2230 Auto OK Cause This alert is for Clear Alert 751 Replace Normal informational purposes Number Member Action None None operation on Related Alert Predictive Nu tmb r Failure None changed LRA Number None 160 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2231 Allow OK Cause This alert is for Clear Alert 751 Revertible Normal informational purposes Number Hot Spare Action Nene None and Replace Related Alert Member and Number Abort Check None Consistency Enoi LRA Number changed None 2232 The OK Cause This alert is for Clear Alert 751 controller Normal informational purposes Number alarm is Action None None silenced Related Alert Number None LRA Number None 2233 The OK Cause This alert is for Clear Alert 751 background Normal informational purposes Number initialization Action Nene None BGI rate Related Alert has changed Naber No
117. le device 53 70 power supply 0 62 processor sensor 50 processor status 61 r2 generated system 74 redundancy unit 36 storage management 87 temperature sensor 19 57 voltage sensor 26 58 Minimum temperature probe warning threshold value changed 131 Multi bit ECC error 181 Multiple enclosures are attached to the controller This is an unsupported configuration 147 P Patrol Read found an uncorrectable media error 174 Physical disk dead segments recovered 127 Physical disk degraded 89 Physical disk initialization started 94 230 Index Physical disk initialize completed 101 Physical disk initialize failed 98 Physical disk inserted 89 Physical disk offline 89 Physical disk online 132 Physical disk rebuild cancelled 96 Physical disk rebuild completed 102 Physical disk rebuild failed 100 Physical disk rebuild started 94 Physical disk removed 88 Physical disk s have been removed from a virtual disk The virtual disk will be in Failed state during the next system reboot 211 Physical disk s that are part of a virtual disk have been removed while the system was shut down This removal was discovered during system start up 210 pluggable device sensor 10 Power And Performance Events 76 Power supply detected a failure 42 Power supply detected a warning 42 65 Power Supply Events 62 power supply messages 40 62 Power supply returned to nor
118. lert is for informational purposes Action None Clear Alert Number None Related Alert Number None None LRA Number 1201 2245 A virtual disk blink has ceased 164 OK Normal Cause This alert is for informational purposes Action None Storage Management Message Reference Clear Alert Number None Related Alert Number None None LRA Number 1201 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2246 The Waring Cause The controller Clear Alert 1153 controller Non battery charge is weak Number battery is critical Action As the charge None degraded weakens the charger Related Alert should automatically Number recharge the battery If the None battery has reached its LRA Number recharge limit replace the 2100 battery pack Monitor the battery to make sure that it recharges successfully If the battery does not recharge replace the battery pack 2247 The OK Cause This alert is for Clear Alert 1151 controller Normal informational purposes Number 2358 battery 18 Action None Related Alert charging Nimb t None LRA Number None 2248 The OK Cause This alert is for Clear Alert 1151 controller Normal informational purposes Number battery s Action None None executing a Related Alert Learn cycle Naniber None LRA
119. les A B C and X Y Z in the following examples are variables representing the storage object name or number SCSI EMM Message Format EMM X Controller A Connector B Target ID C where C is the SCSI ID number of the EMM Example 2121 Device returned to normal EMM 1 Controller 1 Connector 0 Target ID 6 SAS EMM Message Format EMM X Controller A Connector B Enclosure C Example 2121 Device returned to normal EMM 1 Controller 1 Connector 0 Enclosure 2 Alert Message Change History The following table describes changes made to the Storage Management alerts from the previous release of Storage Management to the current release Table 4 3 Alert Message Change History Alert Message Change History Storage Management 3 0 Comments Product Versions Storage Management 3 0 to which Server Administrator 5 5 Changes Apply Dell OpenManage 5 5 New Alerts 2060 2075 2087 2125 2183 2184 2185 2190 2197 2198 2200 2210 2216 2217 2218 2219 2220 2221 2222 2223 2224 2225 2226 2227 2228 2229 2230 2231 2236 2237 2257 2258 2381 Modified Alerts 2060 2075 2087 Updated the alert description and changed the SNMP trap number to 1201 Storage Management Message Reference 81 Table 4 3 Alert Message Change History continued Alert Message Change History Obsolete Alerts None Documentation Changes Documentation updated to reflect change in SNMP t
120. location lt Name of chassis gt Previous state was lt State gt Chassis intrusion state lt Intrusion state gt in the specified system detected an error from which it cannot recover The sensor location chassis location previous state and chassis intrusion state are provided Event Message Reference 35 Chassis Management Controller Messages Alerts sent by Dell s M1000e Chassis Management Controller CMC are organized by severity That is the event ID of the CMC trap indicates the severity informational warning critical or non recoverable of the alert Each CMC alert includes the originating system name location and event message text The alert message text matches the corresponding Chassis Event Log message text that is logged by the sending CMC for that event Table 2 7 lists the Chassis Management Controller messages Table 2 7 Chassis Management Controller Messages EventID Description Severity Cause 2000 CMC generateda Informational A user initiated test trap test trap was issued through the CMC GUI or racadm CLI 2002 CMC reported a Informational CMC informational return to normal event as described in the or informational drsCAMessage variable binding supplied with the alert 2003 CMC reported a Warning CMC warning event as warning described in the drsCAMessage variable supplied with the alert 2004 CMC reported a Critical CMC critical event as critical event described
121. location lt Name of exceeded its failure chassis gt threshold The sensor Previous state was lt State gt location chassis location previous If sensor type is not P i state and discrete temperature sensor Temperature sensor value in value are provided degrees Celsius lt Reading gt If sensor type is discrete Discrete temperature state lt State gt 1055 Temperature sensor detected Error A temperature sensor a non recoverable value on the backplane Sensor location lt Location son system poi Jr chaasies or rive carrier 1n the specified system Chassis location lt Name of detected an error from chassis gt which it cannot Previous state was lt State gt recover The sensor location chassis If sensor type is not ae i i location previous discrete state and Temperature sensor value in temperature sensor degrees Celsius lt Reading gt value are provided If sensor type is discrete Discrete temperature state lt State gt Event Message Reference 22 Cooling Device Messages Cooling device sensors listed in Table 2 3 monitor how well a fan is functioning Cooling device messages provide status and warning information for fans in a particular chassis Table 2 3 Cooling Device Messages Event Description Severity Cause ID 1100 Fan sensor has Information A fan sensor in the specified failed system is not functioning The Sensor location lt Location in chassis gt Chassis location lt
122. location of the redundant power supply or cooling unit in the chassis for example Redundancy unit Fan Enclosure Sensor location lt Location in chassis gt Specifies the location of the sensor in the specified chassis for example Sensor location CPU1 Temperature sensor value lt Reading gt Specifies the temperature in degrees Celsius for example Temperature sensor value in degrees Celsius 30 Voltage sensor value in Volts lt Reading gt Specifies the voltage sensor value in volts for example Voltage sensor value in Volts 1 693 16 Introduction Event Message Reference The following tables lists in numerical order each event ID and its corresponding description along with its severity and cause Z NOTE For corrective actions see the appropriate documentation Miscellaneous Messages Miscellaneous messages in Table 2 1 indicate that certain alert systems are up and working Table 2 1 Miscellaneous Messages Event Description Severity Cause ID 0000 Log was cleared Information User cleared the log from Server Administrator 0001 Log backup created Information The log was full copied to backup and cleared 1000 Server Administrator Information Server Administrator is starting beginning to initialize 1001 Server Administrator Information Server Administrator startup complete completed its initialization 1002 A system BIOS update Information The
123. lure task has encountered an Number uncorrectabl Error error that cannot be None e media corrected There may be a Related Alert error bad disk block that cannot Number be remapped None Action Back up your data ERA Number If you are able to back up 207 the data successfully then fully initialize the disk and then restore from back up 2273 Ablockon Critical Cause The controller Clear Alert 904 the physical Failure encountered an Number disk has Error unrecoverable medium None been error when attempting to Related Alert punctured read a block on the physical Number by the disk and marked that block None controller as invalid If the controller encountered the LRA Number unrecoverable medium 2071 error on a source physical disk during a rebuild or reconfigure operation it will also puncture the corresponding block on the target physical disk The invalid block will be cleared on a write operation Action Back up your data If you are able to back up the data successfully then fully initialize the disk and then restore from back up Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2274 The physical OK Cause This alert is for Clear Alert 901 disk rebuild Normal informational purposes Number has Action None None resumed Related Alert Number None LRA Number
124. mal 41 65 power supply sensor 9 Power supply sensor detected a non recoverable value 43 Power supply sensor has failed 40 Power supply sensor value unknown 41 Predictive Failure reported 103 Problems with the battery or the battery charger have been detected The battery health is poor 192 processor sensor 10 Processor sensor detected a failure value 52 70 Processor sensor detected a non recoverable value 52 Processor sensor detected a warning value 51 70 Processor sensor has failed 50 70 Processor sensor returned to a normal state 51 70 Processor sensor value unknown 50 70 Processor Status Events 61 processor status messages 61 r2 generated system messages 74 Rebuild completed with errors 134 Rebuild not possible as SAS SATA is not supported in the same virtual disk 212 Recharge count maximum exceeded 155 Redundancy degraded 39 118 Redundancy is offline 38 Redundancy lost 39 119 Redundancy normal 120 Redundancy not applicable 38 64 Redundancy regained 39 Redundancy sensor has failed 37 Redundancy sensor value unknown 38 64 redundancy unit messages 36 redundancy unit sensor 9 S SAS expander error 1 209 SAS port report 1 197 198 SAS SMP communications error 1 208 Index 231 SCSI sense data 104 SCSI sense sector reassign 121 See the Readme file for a list of validated controller driver versions
125. med out 142 The controller battery Learn cycle will start in days 143 The controller battery needs to be replaced 137 The controller battery temperature is above normal 138 The controller battery temperature is above normal 154 The controller battery temperature is normal 138 The controller cache has been discarded 145 The controller debug log file has been exported 162 The controller has recovered cached data from the BBU 202 The controller is unable to recover cached data from the battery backup unit BBU 201 The controller reconstruct rate has changed 172 The controller write policy has been changed to Write Back 146 234 Index The controller write policy has been changed to Write Through 146 The current kernel version and the non RAID SCSI driver version are older than the minimum required levels See the Readme file for a list of validated kernel and driver versions 136 The DC power supply is switched off 194 The dedicated hot spare is too small 175 The EMM has failed 183 The enclosure cannot support both SAS and SATA physical disks Physical disks may be disabled 186 The enclosure has a hardware error 186 The enclosure is not responding 186 The enclosure is unstable 185 The enclosure temperature has returned to normal 207 The factory default settings have been restored 202 The firmware on the EMMs is not the same
126. med when the system hangs or crashes These messages monitor the status and occurrence of these events in a system Table 3 7 BMC Watchdog Events Event Message Severity Cause BMC OS Watchdog timer Information This event is generated when the expired BMC watchdog timer expires and no action Is set BMC OS Watchdog Critical This event is generated when the performed system BMC watchdog detects that the reboot system has crashed timer expired because no response was received from Host and the action is set to reboot 64 System Event Log Messages for IPMI Systems Table 3 7 BMC Watchdog Events continued Event Message Severity Cause BMC OS Watchdog Critical This event is generated when the performed system power BMC watchdog detects that the off system has crashed timer expired because no response was received from Host and the action is set to power off BMC OS Watchdog Critical This event is generated when the performed system power cycle Memory Events The memory modules can be configu systems These messages monitor the BMC watchdog detects that the system has crashed timer expired because no response was received from Host and the action is set to power cycle red in different ways in particular status warning and configuration information about the memory modules in the system Table 3 8 Memory Events Event Message Severity Cause Memory RAID Information
127. n 208 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2357 SAS Critical Cause The 1 indicates a Clear Alert 754 expander Failure substitution variable The Number error 1 Error text for this substitution None variable is generated bythe Related Alert firmware and is displayed Number with the alert in the Alert None Log This text can vary depending on the LRA Number situation 2061 Action There may be a problem with the enclosure Check the health of the enclosure and its components by selecting the enclosure object in the tree view The Health subtab displays a red X or yellow exclamation point for enclosure components that are failed or degraded See the enclosure documentation for more information 2358 The battery OK Cause This alert is for Clear Alert 1151 charge cycle Normal informational purposes Number is complete Acton None None Related Alert Number None LRA Number None Storage Management Message Reference 209 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2359 The physical Warning Cause The physical disk Clear Alert 903 disk isnot Non does not comply with the Number certified critical standards set by Dell and is None not s
128. n Check the health Related Alert of the enclosure and its Number components Replace any None hardware that is in a Failed LRA Number state See the hardware 2091 documentation for more information 2303 The Cause This alert is for Clear Alert 851 enclosure informational purposes Number cannot Action None None support both Related Alert SAS and Number SATA None physical disks LRA Number Physical None disks may be disabled 186 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2304 Anattempt OK Cause This alert is for Clear Alert 751 to hot plug Normal informational purposes Number an EMM has Action None Nonce been Related Alert serie Number 2211 This type of hot plug is LRA Number not None supported 2305 The physical Warning Cause The physical disk is Clear Alert 903 disk is too Non too small to rebuild the Number smalltobe critical data None used for a Action Remove the Related Alert rebuild physical disk and insert Number 2326 a new physical disk that is TRA Number the same size or larger than 2070 the disk that is being rebuilt The new physical disk must also use the same technology for example SAS or SATA as the disk being rebuilt If the rebuild does not start automatically after you have inserted a suitable physical disk then run the Rebuild
129. n for the utility that ran the diagnostics for more information 2318 Problems Warning Cause The battery or the Clear Alert 1154 with the Non battery charger is not Number battery or critical functioning properly None the battery Action Replace the battery Related Alert charger have pack Number 2188 been detected LRA Number 2100 192 The battery health is poor Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2319 Single bit Warning Cause The DIMM is Clear Alert 753 ECC error Non beginning to malfunction Number I he DIMM critical Action Replace the DIMM None is degrading to avoid data loss or data Related Alert corruption The DIMM isa Number 2320 part of the controller LRA Number battery pack See your 2060 hardware documentation for information on replacing the DIMM 2320 Single bit Critical Cause The DIMM is Clear Alert 754 ECC error Failure malfunctioning Data loss Number The DIMM Error or data corruption may be None is critically imminent Related Alert degraded Action Replace the DIMM Number 2321 immediately to avoid data TRA Number loss or data corruption The 206 DIMM is a part of the controller battery pack See your hardware documentation for information on replacing the DIMM Storage Management Message Reference 193 Table 4
130. n gt returned to normal operating range Voltage Sensor Events The voltage sensor event messages monitor the number of volts across critical components These messages provide status and warning information for voltage sensors for a particular chassis Table 3 2 Voltage Sensor Events Event Message Severity Cause lt Sensor Name Location gt Critical The voltage of the monitored voltage sensor detected device has exceeded the critical a failure lt Reading gt where threshold lt Sensor Name Location gt is the entity that this sensor is monitoring Reading is specified in volts For example 3 860 V lt Sensor Name Location gt Critical The voltage specified by voltage sensor state asserted lt Sensor Name Location gt is in critical state lt Sensor Name Location gt voltage sensor state de asserted 58 Information The voltage of a previously reported lt Sensor Name Location gt is returned to normal state System Event Log Messages for IPMI Systems Table 3 2 Voltage Sensor Events continued Event Message Severity Cause lt Sensor Name Location gt Warning Voltage of the monitored voltage sensor detected a warning lt Reading gt entity lt Sensor Name Location gt exceeded the warning threshold lt Sensor Name Location gt Information The voltage of a previously voltage sensor returned to reported normal lt Reading gt lt Sensor Name Location gt i
131. ne LRA Number None 2234 The Patrol OK Cause This alert is for Clear Alert 751 Read rate Normal informational purposes Number has changed Action None None Related Alert Number None LRA Number None Storage Management Message Reference 161 Table 4 4 Storage Management Messages continued 162 Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2235 The Check OK Cause This alert is for Clear Alert 751 Consistency Normal informational purposes Number rate has Action None None changed Related Alert Number None LRA Number None 2236 Allow OK Cause This alert is for Clear Alert 751 Revertible Normal informational purposes Number Hot Spare Action None None and Replace Related Alert Member Ninabes property None changed LRA Number None 2237 Abort OK Cause This alert is for Clear Alert 751 Check Normal informational purposes Number Consistency Action Nene None on Error Related Alert modified N mb i None LRA Number None 2238 The OK Cause This alert is for Clear Alert 751 controller Normal informational purposes Number debug log Action None None file has been Related Alert exported Number None LRA Number None Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert Information SNMP Trap Numbers
132. neration redundant or inappropriate shutdown may have caused the alerts posted to the Alert Log after an unexpected system shutdown controller to repost a large number of alerts to the Alert Log when restarting the system Modified Alerts 2095 2153 2188 2192 2202 2204 2205 2266 Severity changed to Informational SNMP trap changed to 901 Severity changed to Informational SNMP trap changed to 851 Severity changed to Informational SNMP trap changed to 1151 Changed documentation for cause and corrective action Severity changed to Informational SNMP trap changed to 901 everity changed to Informational NMP trap changed to 901 everity changed to Informational NMP trap changed to 901 SNMP traps changed to 751 801 851 901 951 1001 1051 1101 1151 1201 S S S S Storage Management Message Reference 83 Table 4 3 Alert Message Change History continued Alert Message Change History 84 2272 2273 2279 2299 2305 2331 2367 Severity changed to Critical SNMP trap changed to 904 Changed corrective action information in the documentation Changed alert message text and documentation for cause and corrective action Changed alert message text Changed corrective action information in the documentation Changed severity to Warning Changed SNMP trap number to 903 Changed severity to Informational Changed SNMP trap number to
133. nformation Number None LRA Number 2071 2164 See the OK Cause This alert is for Clear Alert 101 Readme file Normal informational purposes Number for a list of Storage Management is None validated unable to determine Related Alert controller whether the system has the Number driver minimum required versions None versions of the RAID controller aves LRA Number None Action See the Readme file for driver and firmware requirements In particular if Storage Management experiences performance problems you should verify that you have the minimum supported versions of the drivers and firmware installed 134 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2165 The RAID Waming Cause Storage Clear Alert 753 controller Non Management is unable to Number firmware critical determine whether the None and driver system has the minimum Related Alert validation required versions of the Number was not RAID controller firmware None performed and drivers This situation The may occur for a variety of LRA Number configuratio reasons For example the 2060 n file cannot installation directory path be opened to the configuration file may not be correct The configuration file may also have been removed or renamed Action Reinstall Storage Management 2166 The RAID Waming Cause
134. nt Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2176 The OK Cause This alert is for Clear Alert 1151 controller Normal informational purposes Number 2177 battery Action None Related Alert Learn cycle Nainbes has started Nate LRA Number None 2177 The OK Cause This alert is for Clear Alert 1151 controller Normal informational purposes Status Alert battery Aston Nene 2177 is a clear Learn cycle alert for alert has 2176 completed Related Alert Number None LRA Number None Storage Management Message Reference 141 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2178 The Warning Cause The controller Clear Alert 1153 controller Non battery must be fully Number battery critical charged before the Learn None Learn cycle cycle can begin The Related Alert has battery may be unable to Number timed out maintain a full charge None causing the Learn cycle to timeout Additionally the LRA Number battery must be able to 2100 maintain cached data for a specified period of time in the event of a power loss For example some batteries maintain cached data for 24 hours If the battery is unable to maintain cached data for the required period of time then the Learn cycle will timeout Action Replace the battery pack as the battery is unable to maintain a full charge 21
135. nt sensor detected a non recoverable value 32 Current sensor detected a warning value 31 Current sensor has failed 29 61 current sensor messages 29 Current sensor returned to a normal value 30 61 Current sensor value unknown 30 D Dead disk segments restored 127 Dedicated hot spare assigned Physical disk 1 149 Index 227 Dedicated hot spare unassigned Physical disk 1 149 Dedicated spare imported as global due to missing arrays 211 Device failed 87 Device returned to normal 117 Diagnostic message 1 191 192 Drive Events 67 Driver version mismatch 124 drives messages 67 E Enclosure alarm disabled 127 Enclosure alarm enabled 126 Enclosure firmware mismatch 117 Enclosure was shut down 114 entity presence messages 76 Error occurred 1 205 event description reference 13 F Failure prediction threshold exceeded due to test 113 Fan enclosure inserted into system 45 228 Index fan enclosure messages 45 Fan enclosure removed from system 45 Fan enclosure removed from system for an extended amount of time 46 fan enclosure sensor 9 Fan enclosure sensor detected a non recoverable value 46 Fan enclosure sensor has failed 45 Fan enclosure sensor value unknown 45 fan sensor 9 Fan sensor detected a failure value 25 Fan sensor detected a non recoverable value 25 Fan sensor detected a warning value 24 Fan Sensor Event
136. nt unit has been disconnected has failed or is not present The redundancy unit location chassis location previous redundancy state and the number of devices required for full redundancy are provided Event Message Reference 39 Power Supply Messages Power supply sensors monitor how well a power supply is functioning Power supply messages listed in Table 2 9 provide status and warning information for power supplies present in a particular chassis Table 2 9 Power Supply Messages Event Description Severity Cause ID 1350 Power supply sensor has Information A power supply sensor failed Sensor location in the specified system lt Location in chassis gt failed The sensor location chassis location previous state and additional power supply status information are provided Chassis location lt Name of chassis gt Previous state was lt State gt Power Supply type lt type of power supply gt lt Additional power supply status information gt If in configuration error state Configuration error type lt type of configuration error gt 40 Event Message Reference Table 2 9 Power Supply Messages continued Event Description ID 1351 1352 Severity Power supply sensor value Information unknown Sensor location lt Location in chassis gt Chassis location of chassis gt lt Name Previous state was lt State gt Power Supply type of power supply g
137. o Failure to rebuild data on a disk Number errors on the Error that is defective None target Action Replace the target Related Alert physical disk If a rebuild does not Number 2195 disk automatically start after 2346 replacing the disk initiate LRA Number the Rebuild task You may 207 l need to assign the new disk as a hot spare to initiate the rebuild 2349 Abaddisk Critical Cause A write operation Clear Alert 904 block could Failure could not complete Number not be Error because the disk contains None reassigned bad disk blocks that could Related Alert during a not be reassigned Data loss Number 2346 write may have occurred and operation data redundancy may also LRA Number be lost 2071 Action Replace the disk 2350 There was Critical Cause The rebuild Clear Alert 904 an Failure encountered an Number unrecoverabl Error unrecoverable disk media None e disk media error Related Alert eror during Action Replace the disk Number 2095 the rebuild 2273 LRA Number 2071 206 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2351 Aphysical OK Cause This alert is for Clear Alert 901 disk is Normal informational purposes Number 2352 marked as Action None Related Alert missing Number None LRA Number None 2352 A physical OK Cause This alert is for Clear
138. ocation revious s and processor Chassis Location previous state anaip a Name of chasses sensor status are provided Previous state was lt State gt Processor sensor status lt status gt Event Message Reference 50 Table 2 14 Processor Sensor Messages continued Event Description ID 1602 1603 Severity Processor sensor returned to a normal value Sensor Location lt Location in chassis gt Chassis Location lt Name of chassis gt Previous state was lt State gt Processor sensor status lt status gt Processor sensor detected a warning value Warning Sensor Location lt Location in chassis gt Chassis Location lt Name of chassis gt Previous state was lt State gt Processor sensor status lt status gt Information Cause A processor sensor in the specified system transitioned back to a normal state The sensor location chassis location previous state and processor sensor status are provided A processor sensor in the specified system is in a throttled state The sensor location chassis location previous state and processor sensor status are provided Event Message Reference 51 Table 2 14 Processor Sensor Messages continued Event Description 1604 1605 Processor sensor Error detected a failure value Sensor Location lt Location in chassis gt Chassis Location lt Name of chassis gt Previous state was lt State gt
139. om If you do not have a supported version of the firmware available check with your support provider for information on how to obtain the most current firmware Storage Management Message Reference 123 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2132 Driver Warning Cause The controller Clear Alert 753 version Non driver is not a supported Number mismatch critical version None Action Install a supported Related Alert version of the driver If you Number do not have a supported None driver version available it LRA Number can be downloaded from 2060 the Dell support site at support dell com If you do not have a supported version of the driver available check with your support provider for information on how to obtain the most current driver 2135 Array Warning Cause Storage Clear Alert 103 Manager is Non Management has been Number installed on critical installed on a system that None the system has an Array Manager Related Alert installation Number Action Installing Storage None Management and Array LRA Number Manager on the same 2050 124 system is not a supported configuration Uninstall either Storage Management or Array Manager Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Ac
140. one LRA Number None 2215 Battery OK Cause This alert is for Clear Alert 1151 charge Normal informational purposes Number RroOgesS Action None None interrupted Related Alert Number None LRA Number None 2216 The battery OK Cause This alert is for Clear Alert 1151 learn mode Normal informational purposes Number has changed Aetion None None toute Related Alert Number None LRA Number None Storage Management Message Reference 155 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2217 The battery OK Cause This alert is for Clear Alert 1151 learn mode Normal informational purposes Number has changed Action None None tovari Related Alert Number None LRA Number None 2218 None ofthe OK Cause This alert is for Clear Alert 751 Controller Normal informational purposes Number Property are Action You should change None changed at least one controller Related Alert property and run the Number command again None LRA Number None 2219 Abort Check OK Cause This alert is for Clear Alert 751 Consistency Normal informational purposes Number on Error Action Nene None Allow Related Alert Revertible Number Hot Spare None and Replace Member LRA Number Auto None Replace Member on Predictive Failure and Load balance changed Storage Management Message Reference 156 Table
141. one cabling configurations LRA Number 2061 Storage Management Message Reference 143 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2183 Replace Critical Cause The physical disk Clear Alert 904 member Failure being replaced has failed Number operation Error Action None None failed oa Related Alert physical Number 2060 disk 1 LRA Number None 2184 Replace OK Cause User cancelled the Clear Alert 901 member Normal replace member operation Number None operation Action None Related Alert cancelled on Number 2060 physical disk LRA Number None 2185 Replace Warning Cause This alert is Clear Alert 903 member Non provided for informational Number None operation critical purposes Related Alert stopped for Action None Number 2060 rebuild of hot spare on LRA Number None physical disk 14 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2186 The Warning Cause The controller has Clear Alert 753 controller Non flushed the cache and any Number cache has critical data in the cache has been None been lost This may happen if Related Alert discarded the system has memory or Number battery problems that None cause the controller to distrust the cache
142. onfiguration has been detected Controller 1 PERC 5 E Adapter Z NOTE The controller name is not always displayed Battery Message Format Battery X Controller A Example 2174 The controller battery has been removed Battery 0 Controller 1 SCSI Physical Message Format Physical Disk X Y Controller A Connector B Disk Example 2049 Physical disk removed Physical Disk 0 14 Controller 1 Connector 0 SAS Physical Message Format Physical Disk X Y Z Controller A Connector B Disk Example 2049 Physical disk removed Physical Disk 0 0 14 Controller 1 Connector 0 Virtual Disk Message Format Virtual Disk X Name Controller A Name Message Format Virtual Disk X Controller A Example 2057 Virtual disk degraded Virtual Disk 11 Virtual Disk 11 Controller 1 PERC 5 E Adapter K NOTE The virtual disk and controller names are not always displayed Enclosure Message Format Enclosure X Y Controller A Connector B Example 2112 Enclosure shutdown Enclosure 0 2 Controller 1 Connector 0 Storage Management Message Reference 79 Table 4 2 Message Format with Variables for Each Storage Object continued Storage Object Message Variables A B C and X Y Z in the following examples are variables representing the storage object name or number SCSI Power Supply Message Format Power Supply X Controller A Connector B Target ID C where C is the SCSI ID number of the enclosure managem
143. ontents IntrusionEvents 69 BIOS Generated SystemEvents 70 R2 Generated SystemEvents 74 Cable Interconnect Events 75 BatteryEvents 004 75 Power And Performance Events 76 Entity Presence Events 76 4 Storage Management Message Reference 77 Alert Monitoring andLogging 71 Alert Message Format with Substitution Variables 78 Alert Message Change History 81 Alert Descriptions and Corrective Actions 87 LAO at ash Lea he ath al ORs Blea etl Mee hae 217 Contents Contents Introduction Dell OpenManage Server Administrator produces event messages stored primarily in the operating system or Server Administrator event logs and sometimes in SNMP traps This document describes the event messages created by Server Administrator version 5 3 or later and displayed in the Server Administrator Alert log Server Administrator creates events in response to sensor status changes and other monitored parameters The Server Administrator event monitor uses these status change events to add descriptive messages to the operating system event log or the Server Administrator Alert log Each event message that Server Administrator adds to the Alert log consists of a unique identifier called the event ID for a specific event source category and a descriptive message The event message includes the severity
144. ovided lt Additional power supply status information gt If in configuration error state Configuration error type lt type of configuration error gt 1354 Power supply detected a Error A power supply has been 42 failure Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Power Supply type of power supply gt lt type lt Additional power supply status information gt If in configuration error state Configuration error type lt type of configuration error gt Event Message Reference disconnected or has failed The sensor location chassis location previous state and additional power supply status information are provided Table 2 9 Power Supply Messages continued Event Description Severity ID 1355 Power supply sensor Error detected a non recoverable value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Power Supply type lt type of power supply gt lt Additional power supply status information gt If in configuration error State Configuration error type lt type of configuration error gt Cause A power supply sensor in the specified system detected an error from which it cannot recover The sensor location chassis location previous state and additional power supply status information are provided
145. parity data during read and write operations If an error involves only a single bit it may be possible for the error correction algorithm to correct the error and maintain parity data An error involving multiple bits however usually indicates data loss In some cases if the multi bit error occurs during a read operation the data on the disk may be correct valid If the multi bit error occurs during a write operation data loss has occurred Action Replace the dual in line memory module DIMM The DIMM is a part of the controller battery pack See your hardware documentation for information on replacing the DIMM You may need to restore data from backup Clear Alert 754 Number None Related Alert Number None LRA Number 2061 Storage Management Message Reference 181 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2290 Single bit Warning Cause An error involving a Clear Alert 753 ECC error Non single bit has been Number critical encountered during a read None or write operation The Related Alert error correction algorithm N mbetr has corrected this error None Action None LRA Number 2060 2291 An EMM OK Cause This alert is for Clear Alert 851 has been Normal informational purposes Number discovered Action None None Related Alert Number None LRA
146. plane Sensor location lt Location ped system ony ane ChaSetes or rive carrier 1n the l l specified system chassis location lt Name of returned to a valid chassis gt range after crossing Previous state was lt State gt a failure threshold The sensor locati If sensor type is not hee ranean i chassis location discrete previous state and Temperature sensor value in temperature sensor degrees Celsius lt Reading gt value are provided If sensor type is discrete Discrete temperature state lt State gt 1053 Temperature sensor detected Warning A temperature sensor a warning value Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt If sensor type is not discrete Temperature sensor value in degrees Celsius lt Reading gt If sensor type is discrete Discrete temperature state lt State gt on the backplane board system board CPU or drive carrier in the specified system exceeded its warning threshold The sensor location chassis location previous state and temperature sensor value are provided Event Message Reference 21 Table 2 2 Temperature Sensor Messages continued Event Description Severity Cause ID 1054 Temperature sensor detected Error A temperature sensor a failure value on the backplane Sensor location lt Location T system D qi Ghaseie gt or rive carrier 1n the specified system chassis
147. present and SEL Data Record SDR not present Event Message Reference 19 Table 2 2 Temperature Sensor Messages Event Description Severity Cause ID 1050 Temperature sensor has Information A temperature sensor failed on the backplane Sensor location lt Location ne ee in CHAS bas or the carrier in the specified system Chassis location lt Name of failed The sensor chassis gt location chassis Previous state was lt State gt location previous state and If sensor type is not i temperature sensor discrete value are provided Temperature sensor value in degrees Celsius lt Reading gt If sensor type is discrete Discrete temperature state lt State gt 1051 Temperature sensor value Information A temperature sensor unknown on the backplane Sensor location lt Location pe system a in Chaseis gt or Hye carrier in the specified system could Chassis location lt Name of not obtain a reading chassis gt The sensor location If sensor type is not chassis location discrete previous state and a nominal Temperature sensor value in temperature sensor degrees Celsius lt Reading gt value are provided If sensor type is discrete Discrete temperature state lt State gt Event Message Reference 20 Table 2 2 Temperature Sensor Messages continued Event Description Severity Cause ID 1052 Temperature sensor returned Information A temperature sensor to a normal value on the back
148. provided 1553 Log size is near or Warming The size of a hardware log on the at capacity specified system is near or at the Ce ee capacity of the hardware log The log type information is provided 1554 Log size is full Error The size of a hardware log on Hog Epes Dog types the specified system is full The log type information is provided 1555 Log sensor has failed Error A hardware log sensor in the Log type lt Log type gt specified system failed The hardware log status cannot be monitored The log type information is provided Event Message Reference 49 Processor Sensor Messages Processor sensors monitor how well a processor is functioning Processor messages listed in Table 2 14 provide status and warning information for processors in a particular chassis Table 2 14 Processor Sensor Messages Event Description Severity Cause ID 1600 Processor sensor has Information A processor sensor in the failed specified system is not functioning The sensor Sensor Location l che ss a F EP OGAETOI ATicGhs eat a gt ocation chassis location previous state and processor Chassis Location sensor status are provided lt Name of chassis gt Previous state was lt State gt Processor sensor status lt status gt 1601 Processor sensor value Information A processor sensor in the unknown Sensor specified system could not Location lt Location in obtain a reading The sensor chassis gt location chassis l
149. rap number and description text 2060 2075 2087 Updated the alert description and changed the SNMP trap number to 1201 Storage Management 2 3 Comments Product Versions Storage Management 2 3 to which Changes Server Administrator 3 2 Appl ERJ Dell OpenManage 5 3 New Alerts 2369 Modified Alerts 2095 Added SNMP traps 751 and 851 2294 Removed SNMP traps 752 802 852 902 952 1002 1052 1102 1152 and 1202 Added SNMP trap 851 2295 Removed SNMP traps 754 804 904 954 1004 1054 1104 1154 and 1204 Remaining SNMP trap is 854 Obsolete Alerts 2317 2363 Documentation Documentation updated to Changes indicate related alerts and Local Response Agent LRA alerts 2095 2305 Changed documentation for cause Changed documentation for cause and corrective action Changed SNMP trap number to 903 82 Storage Management Message Reference Table 4 3 Alert Message Change History continued Alert Message Change History 2312 2367 Changed documentation for corrective action in the Storage Management online help Changed documentation for cause and corrective action Storage Management 2 2 Comments Product Versions Storage Management 2 2 to which Changes Server Administrator 3 2 Apply Dell OpenManage 5 2 Reduction of Enhancements to Storage In previous versions of Storage unnecessary alert Management avoid numerous Management an unexpected system ge
150. ras power off or power cycle the BAe ees system Alternatively the user had indicated protective measures to be initiated in the event of a thermal shutdown 18 Event Message Reference Table 2 1 Miscellaneous Messages continued Event Description Severity Cause ID 1008 Systems Management Data Information Systems Management Data Manager Started Manager services were started 1009 Systems Management Data Information Systems Management Data Manager Stopped Manager services were stopped 1011 RCI table is corrupt Warning This message is generated when the BIOS Remote Configuration Interface RCI table is corrupted or cannot be read by the systems management software 1012 IPMI Status Information This message is generated Interface lt the IPMI interface being used gt lt additional information if available and applicable gt Temperature Sensor Messages Temperature sensors listed in Table 2 2 help protect critical components by alerting the systems management console when temperatures become too high inside a chassis The temperature sensor messages use additional variables sensor location chassis location previous state and temperature sensor value or state to indicate the Intelligent Platform Management Interface IPMI status of the system Additional information when available includes Baseboard Management Controller BMC not present BMC not responding System Event Log SEL not
151. replacement error 129 Bad block sense error 129 Bad block table is 80 full 188 Bad block table is full Unable to log block 1 188 Bad PHY 1 185 Battery charge in progress 155 Battery charge process interrupted 155 battery messages 75 BIOS Generated System Events 70 bios generated system messages 70 BMC Watchdog Events 64 BMC watchdog messages 64 c cable interconnect messages 75 Change write policy 116 Chassis intrusion detected 35 63 Chassis intrusion in progress 34 63 chassis intrusion messages 33 Chassis intrusion returned to normal 34 chassis intrusion sensor 9 Chassis intrusion sensor detected a non recoverable value 35 63 Chassis intrusion sensor has failed 33 Chassis intrusion sensor value unknown 33 62 Chassis Management Controller Messages 36 Communication regained 133 Communication timeout 125 Communication with the enclosure has been lost 182 Controller alarm disabled 128 Controller alarm enabled 128 Controller alarm has been tested 132 Controller battery is reconditioning 107 Controller battery low 129 Controller battery recondition is completed 107 Controller configuration has been reset 132 Controller event log 1 199 201 Controller log file entry 1 171 Controller rebuild rate has changed 128 cooling device messages 23 current sensor 9 Current sensor detected a failure value 31 Curre
152. resence of AC power for an AC power cord Hardware Log Sensor Monitors the size of a hardware log Introduction 9 e Processor Sensor Monitors the processor status in the system e Pluggable Device Sensor Monitors the addition removal or configuration errors for some pluggable devices such as memory cards e Battery Sensor Monitors the status of one or more batteries in the system Sample Event Message Text The following example shows the format of the event messages logged by Server Administrator EventID 1000 Source Server Administrator Category Instrumentation Service Type Information Date and Time Mon Oct 21 10 38 00 2002 Computer lt computer name gt Description Server Administrator starting Data Bytes in Hex Viewing Alerts and Event Messages An event log is used to record information about important events Server Administrator generates alerts that are added to the operating system event log and to the Server Administrator Alert log To view these alerts in Server Administrator 1 Select the System object in the tree view 2 Select the Logs tab 3 Select the Alert subtab You can also view the event log using your operating system s event viewer Each operating system s event viewer accesses the applicable operating system event log 10 Introduction The location of the event log file depends on the operating system you are using In t
153. rt 103 RAID SCSI Non driver does not meet the Number driver critical minimum requirements None version is Storage Management may Related Alert older than not be able to display the Number the storage or perform storage None minimum management functions required until you have updated the LRA Number level See system to meet the 2050 readme txt minimum requirements for the Action See the Readme validated file for the validated driver driver version Update the system Meron to meet the minimum requirements and then reinstall Storage Management 136 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2169 The Critical Cause The controller Clear Alert 1154 controller Failure battery cannot recharge Number battery Error The battery may be old or None needs to be it may have been already Related Alert replaced recharged the maximum Number 2118 number of times In f addition the battery LRA Number charger may not be 2101 working Action Replace the battery pack 2170 The OK Cause This alert is for Clear Alert 1151 controller Normal informational purposes Number battery Action None None charge level Related Alert is normal N mber None LRA Number None Storage Management Message Reference 137 Table 4 4 Storage Management Messages continued Event Descr
154. rt SNMP ID Information Trap Numbers 2369 Virtual Disk OK Cause A physical disk ina Clear Alert 1201 Redundancy Normal RAID 6 virtual disk has Number 2121 has been either failed or been Related Alert degraded removed Number 2048 Action Replace the 2049 2050 missing or failed physical 2076 2346 disk LRA Number None 2371 Attempted OK Cause This alert is for Clear Alert 751 import of Normal informational purposes Number Unsupporte Acton None None d Virtual Related Alert Disk type Number RAID 1 Nos LRA Number None 2372 Attempted OK Cause This alert is Clear Alert 751 import of Normal provided for informational Number Virtual Disk purposes None exceeding Action None Related Alert the limit Number supported None on the controller LRA Alert Number None Storage Management Message Reference 213 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2373 Attempted OK Cause This alert is Clear Alert 751 import of Normal provided for informational Number unsupported purposes User is None Virtual Disk attempting to import a Related Alert type RAID foreign virtual disk with Number l unsupported RAID levelon None the controller LRA Alert Action None Number None 2374 Attempted OK Cause This alert is Clear Alert 751 import of Normal provided for informational Number Virtual Disk purpos
155. s 59 Fan sensor has failed 23 58 fan sensor messages 59 Fan sensor returned to a normal value 24 Fan sensor value unknown 23 58 Firmware version mismatch 123 G Global hot spare assigned 104 Global hot spare unassigned 105 hardware log sensor 9 Hardware Log Sensor Events 66 hardware log sensor messages 66 Hot spare SMART polling failed 178 Intrusion Events 69 intrusion messages 69 L Log backup created 17 Log monitoring has been disabled 49 69 Log size is near or at capacity 49 Log size returned to a normal level 49 Log status is unknown 49 69 Log was cleared 17 Maximum temperature probe warning threshold value changed 131 Memory device ECC Correctable error count crossed a warning threshold 44 Memory device ECC Correctable error count sensor crossed a failure threshold 44 memory device messages 44 Memory device monitoring has been disabled 44 Memory ECC Events 64 memory ecc messages 64 Memory Events 65 memory modules messages 65 memory prefailure sensor 9 messages AC power cord 47 67 battery 75 battery sensor 55 bios generated system 70 BMC watchdog 64 cable interconnect 75 chassis intrusion 33 cooling device 23 current sensor 29 drives 67 entity presence 76 fan enclosure 45 fan sensor 59 hardware log sensor 66 Index 229 intrusion 69 memory device 44 memory ecc 64 memory modules 65 miscellaneous 17 pluggab
156. s returned to normal state Fan Sensor Events The cooling device sensors monitor how well a fan is functioning These messages provide status warning and failure messages for fans for a particular chassis Table 3 3 Fan Sensor Events Event Message Severity Cause lt Sensor Name Location gt Critical Fan sensor detected a failure lt Reading gt where lt Sensor Name Location gt is the entity that this sensor is monitoring For example BMC Back Fan or BMC Front Fan Reading is specified in RPM For example 100 RPM The speed of the specified lt Sensor Name Location gt fan is not sufficient to provide enough cooling to the system lt Sensor Name Location gt Information The fan specified by lt Sensor Name Fan sensor returned to normal state lt Reading gt Location gt has returned to its normal operating speed System Event Log Messages for IPMI Systems 59 Table 3 3 Fan Sensor Events Event Message Severity Cause lt Sensor Name Location gt Fan sensor detected a warning lt Reading gt Warning The speed of the specified lt Sensor Name Location gt fan may not be sufficient to provide enough cooling to the system lt Sensor Name Location gt Fan Redundancy sensor redundancy degraded Information The fan specified by lt Sensor Name Location gt may have failed and hence the redundancy has been degraded lt Sensor Name Location
157. s Normal informational purposes Status Alert switched on Action None 2323 is a clear alert for alerts 2313 and 2322 Related Alert Number None LRA Number None 2324 The AC Critical Cause The power cable Clear Alert 1004 power Failure may be pulled out Number 2325 supply cable Error or removed The power Related Alert has been cable may also have Number removed overheated and become None warped and nonfunctional f LRA Number Action Replace the power 209 cable 2325 The power Ok Cause This alert is for Clear Alert 1001 supply cable Normal informational purposes Status Alert has been Achion Nowe 2325 is a clear inserted alert for alerts 2324 and 2312 Related Alert Number None LRA Number None Storage Management Message Reference 195 196 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2326 A foreign OK Cause This alert is for Clear Alert 751 configuratio Normal informational purposes Number n has been The controller has physical None detected disks that were moved from Related Alert another controller These N mber physical disks contain None virtual disks that were created on the other LRA Number controller See the Import None Foreign Configuration and Clear Foreign Configuration section in the Dell OpenManage Server Administrator Storage Management User s Guide
158. s can cause the excessive temperature For example a fan may have failed the thermostat may be set too high or the room temperature may be too hot or cold Verify that the fans in the server or enclosure are working If the physical disk is in an enclosure you should check the thermostat settings and examine whether the enclosure is located near a heat source Storage Management Message Reference Clear Alert Number None Related Alert Number None LRA Number 2070 903 111 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2109 Make sure the enclosure contd has enough ventilation and that the room temperature is not too hot See the physical disk enclosure documentation for more diagnostic information Action 2 If you cannot identify why the disk has reached an unacceptable temperature then replace the disk If the physical disk is a member of a non redundant virtual disk then back up the data before replacing the disk NOTICE Removing a physical disk that is included ina non redundant virtual disk will cause the virtual disk to fail and may cause data loss 112 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2110 SMART Warning Cause A d
159. s displayed 1151 with the alert in the Alert LRA Number 1201 Log This text can vary None depending on the situation Action None Storage Management Message Reference 171 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2267 The OK Cause This alert is for Clear Alert 751 controller Normal informational purposes Number reconstruct Action None None rate has Related Alert changed Nisbet None LRA Number None 2268 1 Storage Critical Cause Storage Clear Alert 104 Manage Failure Management has lost Number ment has Error communication with a None lost controller This may occur Related Alert commun if the controller driver Or Nimber ication firmware 7 experiencing a None with the problem The 1 indicates controller a substitution variable The LRA Number An text for this substitution 2091 immediate variable is displayed with reboot is the alert in the Alert Log strongly and can vary depending on recomm the situation ended to Action Reboot the system avoid further If the problem is not problems resolved contact technical If the reboot support See your system does not documentation for restore information about communicat contacting technical ion then support by using contact telephone fax and technical Internet services support for more information Storage Management Message Reference
160. s event is generated when a channel chk critical interrupt is generated in the T O Channel System Event PCI Critical This event is generated when a parity Parity Err error is detected on the PCI bus System Event Chipset Critical This event is generated when a chip Err error is detected System Event PCI Information This event indicates historical data System Err and is generated when the system has crashed and recovered System Event PCI Critical This error is generated when a fatal Fatal Err error is detected on the PCI bus System Event PCIE Critical This error is generated when a fatal Fatal Err error is detected on the PCIE bus POST Err Critical This event is generated when an error posm Read error occurs during system boot See the nunbers or eee Oe system documentation for more Jeseript ions information on the error code Memory Spared Critical This event is generated when memory redundancy Tost spare is no longer redundant Memory Mirrored Critical This event is generated when memory yed ndancy Tost mirroring is no longer redundant Memory RAID Critical This event is generated when memory redundancy lost RAID is no longer redundant Err Reg Pointer Information This event is generated when an OEM OEM Diagnostic data event was asserted event occurs 70 System Event Log Messages for IPMI Systems Table 3 12 BIOS Generated System Events continued Event Message Severity Cause System Board PFault Critical
161. s intrusion in progress Warning Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Chassis intrusion state lt Intrusion state gt Event Message Reference Cause A chassis intrusion sensor in the specified system detected that a cover was opened while the system was operating but has since been replaced The sensor location chassis location previous state and chassis intrusion state are provided A chassis intrusion sensor in the specified system detected that a system cover is currently being opened and the system is operating The sensor location chassis location previous state and chassis intrusion state are provided Table 2 6 Chassis Intrusion Messages continued Event Description ID 1254 Severity Chassis intrusion Error Cause A chassis intrusion sensor 1255 detected Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Chassis intrusion state lt Intrusion state gt in the specified system detected that the system cover was opened while the system was operating The sensor location chassis location previous state and chassis intrusion state are provided A chassis intrusion sensor Chassis intrusion sensor Error detected a non recoverable value Sensor location lt Location in chassis gt Chassis
162. se A portion of a Clear Alert 753 extended Non physical disk is damaged Number medium critical Action See the Dell None nants OpenManage Server Related Alert Administrator Storage Number Management online help for None more information LRA Number 2060 2151 Asset tag OK Cause This alert is for Clear Alert 851 changed Normal informational purposes A Number user has changed the None enclosure asset tag Related Alert Action None Number None LRA Number None 130 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2152 Assetname OK Cause This alert is for Clear Alert 851 changed Normal informational purposes A Number user has changed the None enclosure asset name Related Alert Action None Number None LRA Number None 2153 Service tag OK Cause An enclosure Clear Alert 851 changed Normal service tag was changed In Number most circumstances this None service tag should only be elated Alert changed by Dell support Number or your service provider None Action Ensure that the tag LRA Number was changed under None authorized circumstances f 2154 Maximum OK Cause This alert is for Clear Alert 1051 temperature Normal informational purposes A Number probe user has changed the value None warning for the maximum Related Alert threshold temperature
163. sure documentation for further diagnostic information Storage Management Message Reference Clear Alert 903 Number 2052 Related Alert Number 2054 2057 2056 2076 2079 2081 2083 2129 2202 2204 2270 2292 2299 2369 LRA Number 2070 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2050 Physical disk Warning Cause A physical diskin Clear Alert 903 offline Non the disk group is offline Number 2158 critical A user may have manually Related Alert put the physical disk Number 2099 offline 2196 Action Perform a rescan LRA Number You can also select the 2070 offline disk and perform a Make Online operation 2051 Physical disk Warning Cause A physical disk has Clear Alert 903 degraded Non reported an error condition Number critical and may be degraded None The physical disk may Related Alert have reported the error Number 2070 condition in response to a consistency check or LRA Number None other operation Action Replace the degraded physical disk You can identify which disk is degraded by locating the disk that has a red X for its status Perform a rescan after replacing the disk 2052 Physical disk OK inserted Normal Cause This alert is for informational purposes Action None Clear Alert 901 Number None Related Alert Number 2065 2305 236
164. system The sensor location and chassis location are provided 45 Table 2 11 Fan Enclosure Messages continued Event Description Severity Cause ID 1454 Fan enclosure removed Error A fan enclosure has been from system for an removed from the specified extended amount of system for a user definable time length of time The sensor Sansor iecacisn E chassis location lt Location in chassis gt ale POMENU Chassis location lt Name of chassis gt 1455 Fan enclosure sensor Error A fan enclosure sensor in the detected a non specified system detected an recoverable value error from which it cannot Sens r location e T sensor location lt Location in chassis gt ange deol ocation are provided Chassis location lt Name of chassis gt 46 Event Message Reference AC Power Cord Messages AC power cord messages listed in Table 2 12 provide status and warning information for power cords that are part of an AC power switch if your system supports AC switching Table 2 12 AC Power Cord Messages Event Description Severity ID Cause 1500 AC power cord sensor Information has failed Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt An AC power cord sensor in the specified system failed The AC power cord status cannot be monitored The sensor location and chassis location information are provided 1501 AC power cord is not Information being monitored
165. t lt type lt Additional power supply status information gt If in configuration error state Configuration error type lt type of configuration error gt Power supply returned to Information normal Sensor location lt Location in chassis gt Chassis location of chassis gt lt Name Previous state was lt State gt Power Supply type of power supply gt lt type lt Additional power supply status information gt If in configuration error state Configuration error type lt type of configuration error gt Event Message Reference Cause A power supply sensor in the specified system could not obtain a reading The sensor location chassis location previous state and additional power supply status information are provided A power supply has been reconnected or replaced The sensor location chassis location previous state and additional power supply status information are provided 41 Table 2 9 Power Supply Messages continued Event Description Severity Cause ID 1353 Power supply detected a Waring A power supply sensor warning Sensor location reading in the specified lt Location in chassis gt system exceeded Chassis location lt Name eee warming of ehasagies thresho he sensor location chassis location Previous state was previous state and sstate gt additional power supply Power Supply type lt type status information of power supply gt are pr
166. t SNMP ID Information Trap Numbers 2259 Anenclosure OK Cause This alert is for Clear Alert 851 blink Normal informational purposes Number 2260 operon Action None Related Alert has Number initiated None LRA Number None 2260 Anenclosure OK Cause This alert is for Clear Alert 851 blink has Normal informational purposes Number ceased Action None None Related Alert Number None LRA Number None 2261 A global OK Cause This alert is for None 101 rescan has Normal informational purposes initiated Action None 2262 SMART OK Cause This alert is for Clear Alert 101 thermal Normal informational purposes Number shutdown is RCNA None enabled Related Alert Number None LRA Number None Storage Management Message Reference 169 Table 4 4 Storage Management Messages continued 170 Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2263 SMART OK Cause This alert is for Clear Alert 101 thermal Normal informational purposes Number shutdown is Action None None disabled Related Alert Number None LRA Number None 2264 Adeviceis Warning Cause The controller Clear Alert 753 missing Non cannot communicate with Number 803 critical a device The device may None 853 be removed There may Related Alert 03 also be a bad or loose cable N mber 953 Action Check if the device None 1003 is in and not removed If it LRA Number aie is in check the
167. t sensor value in Watts lt Reading gt If sensor type is discrete Discrete current state lt State gt 1204 Current sensor detected a Error A current sensor failure value in the specified Sensor location lt Location in system exceeded i its failure chassis gt threshold Chassis location lt Name of The sensor chassis gt location chassis Previous state was lt State gt location previous state and current If sensor type is not discrete sensor value Current sensor value in Amps are provided lt Reading gt OR Current sensor value in Watts lt Reading gt If sensor type is discrete Discrete current state lt State gt Event Message Reference 31 Table 2 5 Current Sensor Messages continued Event Description Severity Cause ID 1205 Current sensor detected a Error A current sensor non recoverable value in the specified Sensor location lt Location in system s R EA an error from which it Chassis location lt Name of cannot recover chassis gt The sensor Previous state was lt State gt location chassis s location previous If sensor type is not discrete P state and current Current sensor value in Amps sensor value lt Reading gt OR are provided Current sensor value in Watts lt Reading gt If sensor type is discrete Discrete current state lt State gt 32 Event Message Reference Chassis Intrusion Messages Chassis intrusion messages listed in Table 2 6 are a securi
168. ta before replacing the disk NOTICE Removing a physical disk that is included in a non redundant virtual disk will cause the virtual disk to fail and may cause data loss Storage Management Message Reference 109 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2108 Smart Warning Cause A disk has received Clear Alert 903 warning Non a SMART alert predictive Number critical failure The disk is likely to None fail in the near future Related Alert Action Replace the disk Number that has received the None SMART alert If the LRA Number physical disk isa member 3979 of a non redundant virtual disk then back up the data before replacing the disk NOTICE Removing a physical disk that is included ina non redundant virtual disk will cause the virtual disk to fail and may cause data loss 110 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action SNMP Trap Numbers Related Alert Information 2109 SMART warning Non temperature critical Waming Cause A disk has reached an unacceptable temperature and received a SMART alert predictive failure The disk is likely to fail in the near future Action 1 Determine why the physical disk has reached an unacceptable temperature A variety of factor
169. te gt location previous state and a nominal current Current sensor value in Amps sensor value are lt Reading gt OR provided If sensor type is not discrete Current sensor value in Watts lt Reading gt If sensor type is discrete Discrete current state lt State gt 1202 Current sensor returned to Information A current sensor a normal value in the specified system returned Sensor location lt Location in to a valid range chassis gt after crossing a Chassis location lt Name of failure threshold chassis gt The sensor Previous state was lt State gt location chassis location previous state and current Current sensor value in Amps sensor value are lt Reading gt OR provided If sensor type is not discrete Current sensor value in Watts lt Reading gt If sensor type is discrete Discrete current state lt State gt 30 Event Message Reference Table 2 5 Current Sensor Messages continued Event Description Severity Cause ID 1203 Current sensor detected a Warning A current sensor warning value in the specified Sensor location lt Location in system exceeded ehaseies its warning threshold Chassis location lt Name of The sensor chassis gt location chassis Previous state was lt State gt location previous state and current If sensor type is not discrete 4 sensor value Current sensor value in Amps are provided lt Reading gt OR Curren
170. tem s next scheduled maintenance Clear the memory error on multibit ECC error The memory device status and location are provided Event Message Reference Fan Enclosure Messages Some systems are equipped with a protective enclosure for fans Fan enclosure messages listed in Table 2 11 monitor whether foreign objects are present in an enclosure and how long a fan enclosure is missing from a chassis Table 2 11 Fan Enclosure Messages Event Description Severity Cause ID 1450 Fan enclosure sensor Information The fan enclosure sensor in has failed the specified system failed danson iscat cares v ts location ae r See pen in chassis chassis location are provided Chassis location lt Name of chassis gt 1451 Fan enclosure sensor Information The fan enclosure sensor in value unknown the specified system could not sensor lo ation o a a a sensor sLocation ino ehasere gt ocation an chass s location are provided Chassis location lt Name of chassis gt 1452 Fan enclosure inserted Information A fan enclosure has been into system inserted into the specified eee location a ee sensor location Location An ohassiss an chassis ocation are provided Chassis location lt Name of chassis gt 1453 Fan enclosure removed Warming A fan enclosure has been from system Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Event Message Reference removed from the specified
171. the system that generated the message for example Memory device bank location Bank_1 Specifies the location of the memory module in the chassis for example Memory device location DIMM_A Number of devices required for full redundancy lt Number gt Specifies the number of power supply or cooling devices required to achieve full redundancy for example Number of devices required for full redundancy 4 Possible memory module event cause lt list of causes gt Specifies a list of possible causes for the memory module event for example Possible memory module event cause Single bit warning error rate exceeded Single bit error logging disabled Introduction 15 Table 1 2 Event Description Reference continued Description Line Item Explanation Power Supply type lt type of power supply gt Specifies the type of power supply for example Power Supply type VRM Previous redundancy state was lt State gt Specifies the status of the previous redundancy message for example Previous redundancy state was Lost Previous state was lt State gt Specifies the previous state of the sensor for example Previous state was OK Normal Processor sensor status lt status gt Specifies the status of the processor sensor for example Processor sensor status Configuration error Redundancy unit lt Redundancy location in chassis gt Specifies the
172. the operating system OS application log Sends an SNMP trap if the operating system s SNMP service is installed and enabled Z NOTE Dell OpenManage Server Administrator Storage Management does not log alerts regarding the data I O path These alerts are logged by the respective RAID drivers in the system alert log See the Storage Management Online Help and the Dell OpenManage Server Administrator Storage Management User s Guide for updated information Storage Management Message Reference 71 Alert Message Format with Substitution Variables When you view an alert in the Server Administrator alert log the alert identifies the specific components such as the controller name or the virtual disk name to which the alert applies In an actual operating environment a storage system can have many combinations of controllers and disks as well as user defined names for virtual disks and other components Because each environment is unique in its storage configuration and user defined names an accurate alert message requires that the Storage Management Service be able to insert the environment specific names of storage components into an alert message This environment specific information is inserted after the alert message text as shown for alert 2127 in Table 4 1 For other alerts the alert message text is constructed from information passed directly from the controller or another storage component to the Alert Log In these cas
173. tion Related Alert SNMP ID Information Trap Numbers 2136 Virtual disk OK Cause This alert is for Clear Alert 1201 initialization Normal informational purposes Number 2088 Virtual disk initialization is Related Alert in progress Number Action None None LRA Number None 2137 Communi Warning Cause The controller is Clear Alert 853 cation Non unable to communicate with Number 2162 timeout critical an enclosure There are Related Alert several reasons why Number communication may be None lost For example there may be a bad or loose cable o Number An unusual amount of I O may also interrupt communication with the enclosure In addition communication loss may be caused by software hardware or firmware problems bad or failed power supplies and enclosure shutdown When viewed in the Alert Log the description for this event displays several variables These variables are Controller and enclosure names type of communication problem return code and SCSI status Storage Management Message Reference 125 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2137 Action Check for problems dontd with the cables See the online help for more information on checking the cables You should also check to see if the enclosure has degraded or failed components To do so select th
174. ty measure Chassis intrusion means that someone is opening the cover to a system s chassis Alerts are sent to prevent unauthorized removal of parts from a chassis Table 2 6 Chassis Intrusion Messages Event Description Severity ID Cause 1250 Chassis intrusion sensor Information has failed Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Chassis intrusion state lt Intrusion state gt A chassis intrusion sensor in the specified system failed The sensor location chassis location previous state and chassis intrusion state are provided 1251 Chassis intrusion sensor Information value unknown Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Chassis intrusion state lt Intrusion state gt A chassis intrusion sensor in the specified system could not obtain a reading The sensor location chassis location previous state and chassis intrusion state are provided Event Message Reference 33 Table 2 6 Chassis Intrusion Messages continued Event Description 34 1252 1253 Severity Chassis intrusion Information returned to normal Sensor location lt Location in chassis gt Chassis location lt Name of chassis gt Previous state was lt State gt Chassis intrusion state lt Intrusion state gt Chassi
175. ual disk to fail If the 2346 virtual disk is redundant then more physical disks a Number have failed than can be rebuilt using mirrored or parity information Action Create a new virtual disk and restore from a backup controller rebuild the virtual disk by first configuring a hot spare for the disk and then initiating a write operation to the disk The write operation will initiate a rebuild of the disk Storage Management Message Reference 91 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2057 Virtual disk Warning Cause 1 This alert message Clear Alert 1203 degraded Non occurs when a physical disk Number critical included in a redundant None virtual disk fails Because Related Alert the virtual disk is redundant Number 2048 uses mirrored or parity 2049 2050 information and only one 2076 2079 physical disk has failed the 2081 2123 virtual disk can be rebuilt 2 129 3346 Action 1 Configure a hot LRA Number spare for the virtual disk if 2080 92 one is not already configured Rebuild the virtual disk When using an Expandable RAID Controller PERC PERC 3 SC 3 DCL 3 DC 3 OC 4 SC 4 DC 4e DC 4 Di CERC ATA100 4ch PERC 5 E PERC 5 1 or a Serial Attache SCSI SAS 5 iR Cause 2 A physical disk in the disk group has been removed Action 2 If a physical disk was removed
176. unctured by the controller 174 A consistency check on a virtual disk has been paused suspended 114 A consistency check on a virtual disk has been resumed 115 A controller hot plug has been detected 198 A controller rescan has been initiated 162 A dedicated hot spare failed 152 A dedicated hot spare has been automatically unassigned 152 A dedicated hot spare has been removed 152 A device has been inserted 183 A device has been removed 183 A device is in an unknown state 171 A device is missing 170 A disk media error has been corrected 177 A disk media error was corrected during recovery 179 A foreign configuration has been cleared 163 A foreign configuration has been detected 196 A foreign configuration has been imported 163 A global hot spare failed 151 A global hot spare has been removed 151 A global rescan has initiated 169 A Learn cycle start is pending while the battery charges 179 A mirrored virtual disk has been unmirrored 116 A physical disk is incompatible 189 A physical disk is marked as missing 207 A physical disk that was marked as missing has been replaced 207 A power supply in the enclosure has a DC failure 191 A power supply in the enclosure has an AC failure 190 A previously scheduled system BIOS update has been canceled 17 A redundant path has been restored 179 A redundant path is broken 1
177. upported Related Alert Action Replace the Number physical disk with a None physical disk that is LRA Number supported 2070 2360 Auserhas OK Cause This alert is for Clear Alert 751 discarded Normal informational purposes Number data from Action Nong None the Related Alert controller Ninabes cache Nona LRA Number None 2361 Physical OK Cause This alert is for Clear Alert 751 disk s that Normal informational purposes Number are part of a Action Nene None virtual disk Related Alert have been Number removed None while the system was LRA Number shut down None This removal was discovered during system startup 210 Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2362 Physical OK Cause This alert is for Clear Alert 751 disk s have Normal informational purposes Number been Action None Nonce removed Related Alert from a Number virtual disk None The virtual disk will be LRA Number in Failed None state during the next system reboot 2364 Allvirtual OK Cause This alert is for Clear Alert 751 disks are Normal informational purposes Number missing Action None Nonce from the Related Alert controller Number This None situation wis LRA Number discovered None during system start up 2366 Dedicated OK Cause This alert is for Cle
178. us state and temperature sensor value or state Table 3 1 Temperature Sensor Events Event Message Severity Cause lt Sensor Name Location gt Critical Temperature of the backplane temperature sensor board system board or the carrier detected a failure in the specified system lt Sensor lt Reading gt where lt Sensor Name Location gt exceeded the Name Location gt is the critical threshold entity that this sensor is monitoring For example PROC Temp or Planar Temp Reading is specified in degree Celsius For example 100 C lt Sensor Name Location gt Warning Temperature of the backplane temperature sensor detected a warning lt Reading gt board system board or the carrier in the specified system lt Sensor Name Location gt exceeded the non critical threshold System Event Log Messages for IPMI Systems 57 Table 3 1 Temperature Sensor Events continued Cause Temperature of the backplane board system board or the carrier in the specified system lt Sensor Name Location gt returned from critical state to non critical state Event Message Severity lt Sensor Name Location gt Warning temperature sensor returned to warning state lt Reading gt lt Sensor Name Location gt Information temperature sensor returned to normal state lt Reading gt Temperature of the backplane board system board or the carrier in the specified system lt Sensor Name Locatio
179. user has chosen to has been scheduled for update the flash basic input the next reboot output system BIOS 1003 A previously scheduled Information The user decides to cancel system BIOS update has been canceled the flash BIOS update or an error occurs during the flash Event Message Reference 17 Table 2 1 Miscellaneous Messages continued Event Description Severity Cause ID 1004 Thermal shutdown Error This message is generated protection has been when a system is configured initiated for thermal shutdown due to an error event If a temperature sensor reading exceeds the error threshold for which the system is configured the operating system shuts down and the system powers off This event may also be initiated on certain systems when a fan enclosure is removed from the system for an extended period of time 1005 SMBIOS data is absent Warning The system does not contain the required systems management BIOS version 2 2 or higher or the BIOS is corrupted 1006 Automatic System Error This message is generated Recovery ASR action when an automatic system was performed recovery action is performed Action performed was due to a hung operating ZACEL bas system The action performed and the time of Date and time of action are provided action lt Date and time gt 1007 User initiated host Information User requested a host system system control action control action to reboot Aeti n requested i
180. value are If sensor type is not ine oes provided discrete Voltage sensor value in Volts lt Reading gt If sensor type is discrete Discrete voltage state lt State gt Event Message Reference Current Sensor Messages Current sensors listed in Table 2 5 measure the amount of current in amperes that is traversing critical components Current sensor messages provide status and warning information for current sensors in a particular chassis Table 2 5 Current Sensor Messages Event Description Severity Cause ID 1200 Current sensor has failed Information A current sensor in the specified Sensor location lt Location in system failed chassis gt The sensor Chassis location lt Name of location chassis cnassrs location previous Previous state was lt State gt state and current sensor value If sensor type is not discrete i are provided Current sensor value in Amps lt Reading gt OR Current sensor value in Watts lt Reading gt If sensor type is discrete Discrete current state lt State gt Event Message Reference 29 Table 2 5 Current Sensor Messages continued Event Description Severity Cause ID 1201 Current sensor value unknown Information A current sensor in the specified Sensor location lt Location in system could not chassis gt i obtain a reading Chassis location lt Name of The sensor chassis gt location chassis Previous state was lt Sta
181. ve failure is was deasserted c rrected Drive lt Drive gt Warning This event is generated when the Hot egare was drive is placed in a hot spare asserted Drive lt Drive gt Informational This event is generated when the hote Space wee drive is taken out of hot spare deasserted Drive lt Drive gt Warning This event is generated when the seneiataney anaes ou drive is placed in consistency check progress was asserted Drive lt Drive gt Informational This event is generated when the consistency check in progress was deasserted System Event Log Messages for IPMI Systems consistency check of the drive is completed 67 Table 3 10 Drive Events continued Event Message Severity Cause Drive lt Drive gt Critical This event is generated when the SA eae Real array was drive is placed in critical array asserted Drive lt Drive gt Informational This event is generated when the fii oie ees aaa was drive is removed from critical array deasserted Drive lt Drive gt Critical This event is generated when the eee eee drive is placed in the fail array asserted Drive lt Drive gt Informational This event is generated when the mata led ee ee drive is removed from the fail array deasserted Drive lt Drive gt Informational This event is generated when the vebuita ini progress drive is rebuilding was asserted Drive lt Drive gt Warning This event is generated when the re
182. ve incompatible physical Number drive None Action None LRA Alert Number None Storage Management Message Reference 215 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2379 An overflow OK Cause This alert is Clear Alert 751 of the Normal provided for informational Number foreign purposes None configuratio Action None Related Alert n has Number occurred None You can import the LRA Alert foreign Number configuratio None n in multiple attempts 2380 Foreign OK Cause This alert is Clear Alert 751 configuratio Normal provided for informational Number n has been purposes None partially Action None Related Alert imported Number Some None configuratio n failed to LRA Alert import Number None 2381 Controller Inform Cause This alert is Clear Alert 751 preserved ational provided for informational Number cache is purposes None recovered Action None Related Alert Number None LRA Alert Number None 216 Storage Management Message Reference Index Symbols 1 Storage Management has lost communication with this RAID controller and attached storage An immediate reboot is strongly recommended to avoid further problems If the reboot does not restore communication there may be a hardware failure 172 Numerics 0001 17 1000 17 1001 17 1002 17 1003 17 1004 18
183. vents 57 Temperature sensor has failed 20 57 temperature sensor messages 19 57 Temperature sensor returned toa normal value 21 58 Temperature sensor value unknown 20 57 The AC power supply cable has been removed 195 The background initialization BGI rate has changed 161 The battery charge cycle is complete 209 The BGI completed with uncorrectable errors 203 The Check Consistency found inconsistent parity data Data redundancy may be lost 204 The Check Consistency logging of inconsistent parity data is disabled 204 The Check Consistency made corrections and completed 203 The Check Consistency rate has changed 162 The Clear operation has cancelled 167 The controller alarm is silenced 161 The controller battery charge level is below a normal threshold 176 The controller battery charge level is normal 137 The controller battery charge level is operating within normal limits 176 The controller battery has been removed 140 The controller battery has been replaced 140 The controller battery is charging 165 The controller battery is degraded 165 The controller battery is executing a Learn cycle 165 The controller battery Learn cycle has been postponed 142 The controller battery Learn cycle has completed 141 Index 233 The controller battery Learn cycle has started 141 The controller battery Learn cycle has ti
184. wer cord sensor in the specified system failed The AC power cord status cannot be monitored The sensor location and chassis location information are provided Hardware Log Sensor Messages Hardware logs provide hardware status messages to systems management software On certain systems the hardware log is implemented as a circular queue When the log becomes full the oldest status messages are overwritten when new status messages are logged On some systems the log is not circular On these systems when the log becomes full subsequent hardware status messages are lost Hardware log sensor messages listed in Table 2 13 provide status and warning information about the noncircular logs that may fill up resulting in lost status messages 48 Event Message Reference Table 2 13 Hardware Log Sensor Messages Event Description Severity Cause ID 1550 Log monitoring has Information A hardware log sensor in the been disabled Log type lt Log type gt specified system is disabled The log type information is provided 1551 Log status is unknown Information A hardware log sensor in the rog type tiog types specified system could not obtain a reading The log type information is provided 1552 Log size is no longer Information The hardware log on the near or at capacity specified system is no longer near ou Eyben epee ees or at its capacity usually as the result of clearing the log The log type information is
185. y and replace the failed component To identify the failed component select the enclosure in the tree view and click the Health subtab Any failed component will be identified with a red X on the enclosure s Health subtab Alternatively you can select the Storage object and click the Health subtab 2122 contd 118 The controller status displayed on the Health subtab indicates whether a controller has a failed or degraded component See the enclosure documentation for information on replacing enclosure components and for other diagnostic information Storage Management Message Reference Table 4 4 Storage Management Messages continued Event Description ID Severity Cause and Action SNMP Trap Numbers Related Alert Information 2123 Redundancy Warning Cause A virtual disk or an lost Non critical enclosure has lost data redundancy In the case of a virtual disk one or more physical disks included in the virtual disk have failed Due to the failed physical disk or disks the virtual disk is no longer maintaining redundant mirrored or parity data The failure of an additional physical disk will result in lost data In the case of an enclosure more than one enclosure component has failed For example the enclosure may have suffered the loss of all fans or all power supplies Action Identify and replace the failed components To identify the failed component sele
186. y pack 2279 The OK Cause This alert is Clear Alert 1151 controller Normal provided for informational Number battery purposes This alert None charge level is operating within normal limits 1716 indicates that the battery is Related Alert recharging during the battery Learn cycle Action None Storage Management Message Reference Number None LRA Number None Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2280 Adiskmedia OK Cause A disk media error Clear Alert 1201 error has Normal was detected while the Number been controller was completing a None corrected background task A bad Related Alert disk block was identified Number The disk block has been None remapped r LRA Number Action Consider replacing None the disk If you receive this alert frequently be sure to replace the disk You should also routinely back up your data 2281 Virtual disk OK Cause This alert is for Clear Alert 1201 has Normal informational purposes Number inconsistent Action None None data Related Alert Number 2127 LRA Number None Storage Management Message Reference 177 173 Table 4 4 Storage Management Messages continued Event Description Severity Cause and Action Related Alert SNMP ID Information Trap Numbers 2282 Hotspare Critical Cause The controller Clear
Download Pdf Manuals
Related Search
Related Contents
EZ 2693 Bedienungsanleitung/Garantie Elektrische Franke ADG 611 Biacore T200 使用ルール Especificaciones DMC-ZS1 Manual de instrucciones Guía del usuario OMA-H-IL17AF-02 Alcatel OneTouch ONE TOUCH 280 Quick Start Manual E2FM MELSEC iQ-R Positioning Module User`s Manual (Application) Copyright © All rights reserved.
Failed to retrieve file