Home
Sun Fire T1000 Server Service Manual
Contents
1. 3 30 Sun Fire T1000 Server Service Manual January 2007 Oo OF 0 gt CPU 0 gt I Cache Tag RAM 0 gt FSR Read Write 0 gt Address Bitwalk 0 gt Testing Memory Channel 0 Rank 0 Stack 0 0 gt Testing Memory Channel 3 Rank 0 Stack 0 0 gt Testing Memory Channel 0 Rank 0 Stack 1 0 gt Testing Memory Channel 3 Rank 0 Stack 1 0 gt Test Slave Threads Basic 0 gt Set Mailbox 0 gt Setup Final DMMU Entries 0 gt Post Image Region Scrub 0 gt Run POST from Memory 0 gt Verifying checksum on copied image 0 gt The Memory s CHECKSUM value is ccle 0 gt The Memory s Content Size value is 7b192 0 gt Success Checksum on Memory Validated 0 gt L2 Cache Ram Test 0 gt Enable L2 Cache 0 gt L2 Scrub Data 0 gt L2 Enable 0 gt CPU 0 0 0 gt Test slave strand registers 0 gt Extended CPU Tests 0 gt Scrub Icache 0 gt Scrub Dcache 0 gt D Cache Tags 0 gt I Cache RAM Test 0 gt FPU Registers and Data Path 0 gt FPU Move Registers 0 gt FPU Branch Instructions Chapter 3 Server Diagnostics 3 31 0 gt Enable Icache 0 gt Enable Dcache 0 gt Scrub Memory Or Or Or OO 0 gt Scrub Memory 0 0 gt Scrub 00000000 00600000 gt 00000001 00000000 on Memory Channel 0 3 Rank 0 Stack 0 0 0 gt Scrub 00000001 00000000 gt 00000002
2. Reset the system so that POST runs There are several ways to initiate a reset The following example uses the powercycle command For other methods refer to the Sun Fire T1000 Server Administration Guide sc gt powercycle Are you sure you want to powercycle the system y n y Powering host off at MON JAN 10 02 52 02 2000 Waiting for host to Power Off hit any key to abort SC Alert SC Request to Power Off Host SC Alert Host system has shut down Powering host on at MON JAN 10 02 52 13 2000 SC Alert SC Request to Power On Host 3 28 Sun Fire T1000 Server Service Manual January 2007 4 Switch to the system console to view the POST output sc gt console Example of POST output SC Alert Host system has resetl Note Some output omitted 0 0 gt 0 0 gt ERIE Integrated POST 4 x 0 build_17 2005 08 30 11 25 export common source firmware_re ontario fireball_fio build_17 post Niagara erie integrated firmware_re 0 0 gt Copyright 2005 Sun Microsystems Inc All rights reserved SUN PROPRIETARY CONFIDENTIAL Use is subject to license terms 0 0 gt VBSC selecting POST IO Testing 0 0 gt VBSC enabling threads 1 0 0 gt VBSC setting verbosity level 3 0 0 gt Start Selftest 0 0 gt Init CPU 0 0 gt Master CPU Tests Basic 0 0 gt CPU 0 0 0 gt DMMU Registers Access 0 0 gt IMMU Registers Access 0 0 gt Init mmu regs 0 0 gt D Cache RAM 0 0 gt DMMU TLB DATA RA
3. Earlier versions of firmware have max as the default setting for the POST diag_level variable To set the default to min use the ALOM CMT command setsc diag_level min 3 4 2 Changing POST Parameters 1 Access the ALOM CMT sc gt prompt At the console issue the key sequence 2 Use the ALOM CMT sc gt prompt to change the POST parameters Refer to TABLE 3 5 for a list of ALOM CMT POST parameters and their values The setkeyswitch parameter sets the virtual keyswitch so it does not use the setsc command For example to change the POST parameters using the setkeyswitch command enter the following sc gt setkeyswitch diag 3 26 Sun Fire T1000 Server Service Manual January 2007 3 4 3 3 4 3 1 To change the POST parameters using the setsc command you must first set the setkeyswitch parameter to normal then you can change the POST parameters using the setsc command sc gt setkeyswitch normal sc gt setsc value Example sc gt setkeyswitch normal sc gt setsc diag mode service Reasons to Run POST You can use POST for basic hardware verification and diagnosis and for troubleshooting as described in the following sections Verifying Hardware Functionality POST tests critical hardware components to verify functionality before the system boots and accesses software If POST detects an error the faulty component is disabled automatically preventing
4. 1 Perform the procedures described in Chapter 4 2 Locate the DIMM that you want to remove Use FIGURE 5 11 and TABLE 5 1 to identify the DIMM that you want to remove 5 14 Sun Fire T1000 Server Service Manual January 2007 Front Back gt J0501 J0701 J0601 J0801 Channel 0 E ES SR DIMM DIMM 1 E RE O j Rank 0 IMM 0 EES OOO Channel 3 CY IMM 1 eee EN Rank 1 ET Banko NA enr E DIMM 0 ES DIMM 1 Tl J1301 J1101 J1201 J1001 FIGURE 5 11 DIMM Locations TABLE 5 1 maps the DIMM names that are displayed in faults to the socket numbers that identify the location of the DIMM on the motherboard The Channel Rank DIMM locations for example CHO RO D0 are silkscreened on the board and on a label near the board TABLE 5 1 DIMM Names and Socket Numbers Socket Number DIMM Name Used in Messages J0501 CH0 RO0 D0 J0601 CH0 RO D1 J0701 CH0 R1 D0 J0801 CH0 R1 D1 J1001 CH3 R0 D0 J1101 CH3 R0 D1 J1201 CH3 R1 D0 J1301 CH3 R1 D1 DIMM names in messages are displayed with the full name such as MB CMP0 CH0 R1 D1 but this table lists the DIMM name with the preceding MB CMP0 omitted for clarity Note the DIMM location so that you can install the replacement DIMM in the same socket Push down on the ejector levers on each side of the DIMM until the DIMM is released Chapter 5 Replacing Field Replaceable Units 5 1
5. To switch from the sc gt prompt to the console type console 3 3153 Service Related ALOM CMT Commands TABLE 3 4 describes the typical ALOM CMT commands for servicing the server For descriptions of all ALOM CMT commands issue the help command or refer to the Advanced Lights Out Management ALOM CMT Guide TABLE 3 4 Service Related ALOM CMT Commands ALOM CMT Command Description help command break y c clearfault UUID console f consolehistory b lines e lines v g lines boot run bootmode normal reset_nvram bootscript string powercycle f Displays a list of all ALOM CMT commands with syntax and descriptions Specifying a command name as an option displays help for that command Takes the host server from the OS to either kmdb or OpenBoot PROM equivalent to a Stop A depending on the mode Solaris software was booted The y option skips the confirmation question The c option executes a console command after completion of the break command Manually clears host detected faults The UUID is the unique fault ID of the fault to be cleared Connects you to the host system The option forces the console to have read and write capabilities Displays the contents of the system s console buffer The following options enable you to specify how the output is displayed e g lines specifies the number of lines to display before pausing e e lines displays n lines fr
6. If the fault is a PSH detected fault identify the faulty FRU from the fault message and replace the faulty FRU After the FRU is replaced perform the procedure to clear PSH detected faults POST performs basic tests of the server components and reports faulty FRUs When POST detects a faulty FRU it logs the fault and if possible takes the FRU offline POST detected FRUs display the following text in the fault message FRU_name deemed faulty and disabled In this case replace the FRU and run the procedure to clear POST detected faults The majority of hardware faults are detected by the server s diagnostics In rare cases a problem might require additional troubleshooting If you are unable to determine the cause of the problem contact technical support Section 3 5 Using the Solaris Predictive Self Healing Feature on page 3 39 Chapter 5 Section Replacing Field Replaceable Units on page 5 1 Section 3 5 2 Clearing PSH Detected Faults on page 3 44 Section 3 4 Running POST on page 3 22 Chapter 5 Section Replacing Field Replaceable Units on page 5 1 Section 3 4 6 Clearing POST Detected Faults on page 3 38 Section 2 2 Obtaining the Chassis Serial Number on page 2 3 ol 3 6 Memory Configuration and Fault Handling A variety of features play a role in how the memory subsystem is configured and how memory faults are handled Understanding the unde
7. Replacing the Motherboard and Chassis Removing the Motherboard and Chassis The motherboard and chassis are replaced as a unit Therefore you must remove all FRUs and associated cables from your chassis and install them in the new chassis Perform the procedures described in Chapter 4 Remove the PCI Express card See Section 5 1 Replacing the Optional PCI Express Card on page 5 2 Remove the fan tray assembly and cable See Section 5 2 Replacing the Fan Tray Assembly on page 5 4 Remove the power supply and cable See Section 5 3 Replacing the Power Supply on page 5 5 Remove the hard drive and cable See Section 5 4 Replacing a Hard Drive on page 5 7 Remove all DIMMs from the motherboard assembly See Section 5 5 Replacing DIMMs on page 5 14 Remove the socketed system configuration SEEPROM from the motherboard and place it on an antistatic mat The system configuration SEEPROM contains the persistent storage for the host ID and Ethernet MAC addresses of the system as well as the ALOM configuration including the IP addresses and ALOM user accounts if configured This information will be lost unless the system configuration SEEPROM is removed and installed in the replacement motherboard The PROM does not hold the fault data and this data is no longer accessible once the motherboard and chassis assembly is replaced The location of this SEEPROM is shown in Append
8. about 3 39 clearing faults 3 44 memory faults and 3 8 Sun URL 3 40 PSH detected faults 3 16 PSH see also Predictive Self Healing PSH 3 39 R removing clock battery 5 22 DIMMs 5 14 5 20 fan tray assembly 5 4 hard drive 5 8 5 10 motherboard and chassis 5 20 PCI Express card 5 2 power supply 5 5 top cover 4 5 removing the server from the rack 4 3 required tools 4 2 reset command 3 15 resetsc command 3 15 S safety information 1 1 safety symbols 1 1 Service Required LED 3 12 3 39 setkeyswitch parameter 3 15 3 26 setlocator command 3 15 showcomponent command 3 47 showenvironment command 3 15 3 17 showfaults command 3 4 description and examples 3 16 syntax 3 15 troubleshooting with 3 5 showfru command 3 15 3 19 showkeyswitch command 3 15 showlocator command 3 15 showlogs command 3 15 showplatform command 3 15 Solaris log files 3 4 Solaris OS collecting diagnostic information from 3 45 Solaris Predictive Self Healing PSH detected faults 3 4 SunVTS 3 2 3 4 exercising the system with 3 50 running 3 51 tests 3 53 user interfaces 3 50 support obtaining 3 5 syslogd daemon 3 46 system console switching to 3 14 system temperatures displaying 3 17 T top cover installing 6 1 removing 4 5 troubleshooting actions 3 4 DIMMs 3 8 U UltraSPARC T1 multicore processor 3 40 Universal Unique Identifier UUID 3 39 V virtual keyswitch 3 26 voltage and curr
9. POST displays the fault with the device name of the faulty DIMMS logs the fault and disables the faulty DIMMs by placing them in the ASR blacklist For a given memory fault POST disables half of the physical memory in the system When this offlining process occurs in normal operation you must replace the faulty DIMMs based on the fault message and enable the disabled DIMMs with the ALOM CMT enablecomponent command In other than normal operation POST can be configured to run various levels of testing see TABLE 3 5 and TABLE 3 6 and can thoroughly test the memory subsystem based on the purpose of the test However with thorough testing enabled diag_level max POST finds faults and offlines memory devices with errors that could be correctable with PSH Thus not all memory devices detected and offlined by POST need to be replaced See Section 3 4 5 Correctable Errors Detected by POST on page 3 35 Chapter 3 Server Diagnostics 3 7 3 1 1 3 m Solaris Predictive Self Healing PSH technology A feature of the Solaris OS uses the fault manager daemon fmd to watch for various kinds of faults When a fault occurs the fault is assigned a unique fault ID UUID and logged PSH reports the fault and provides a recommended proactive replacement for the DIMMs associated with the fault Troubleshooting Memory Faults If you suspect that the server has a memory problem follow the flow chart see TABLE 3 1 Run the ALOM CMT showfau
10. TABLE 3 6 This parameter overrides all other commands diag The system runs POST based on predetermined settings stby The system cannot power on locked The system can power on and run POST but no flash updates can be made diag_mode off POST does not run normal Runs POST according to diag_level value service Runs POST with preset values for diag_level and diag_verbosity diag_level min If diag_mode normal runs minimum set of tests max If diag_mode normal runs all the minimum tests plus extensive CPU and memory tests diag_trigger none Does not run POST on reset diag_verbosity user_reset power_on_reset error_reset all_reset none Runs POST upon user initiated resets Only runs POST for the first power on This option is the default Runs POST if fatal errors are detected Runs POST after any reset No POST output is displayed Chapter 3 Server Diagnostics 3 23 3 24 TABLE 3 5 ALOM CMT Parameters Used for POST Configuration Continued Parameter Values Description min POST output displays functional tests with a banner and pinwheel normal POST output displays all test and informational messages max POST displays all test informational and some debugging messages Sun Fire T1000 Server Service Manual January 2007 diag_mode service user_reset power_on_ reset error_reset diag_trigger System Boot OpenBoot PROM Service Mode Forces a Sun prescribed
11. displays a UUID continue to Step 8 For example sc gt showfaults v ID Time FRU Fault 0 SEP 09 11 09 26 MB CMP0 CH0O RO DO Host detected fault MSGID SUN4V 8000 DX UUID 92e9fbe 735e c218 cf 87 9e1720a28004 m If the fault resulted in the FRU being disabled such as the following sc gt showfaults v ID Time FRU Fault 1 OCT 13 12 47 27 MB CMP0 CHO RO DO MB CMP0 CH0 RO DO0 deemed faulty and disabled then run the enablecomponent command to enable the FRU sc gt enablecomponent MB CMP0 CH0 R0 D0 8 Perform the following steps to verify the repair a Set the virtual keyswitch to diag so that POST will run in Service mode sc gt setkeyswitch diag Chapter 5 Replacing Field Replaceable Units 5 17 5 18 b Issue the poweron command sc gt poweron c Switch to the system console to view the POST output sc gt console Watch the POST output for possible fault messages The following output is a sign that POST did not detect any faults 0 gt POST Passed all devices 0 gt 0 gt DEMON Diagnostics Engineering MONitor 0 gt Select one of the following functions 0 gt POST Return to OBP 0 gt INFO 0 gt POST Passed all devices 0 gt Master set ACK for vbsc runpost command and spin OOO 1 Note Depending on the configuration of ALOM CMT POST variables and whether POST detected faults or not the system mi
12. init test test test test test interrupt test Config MB bridges 0 gt Config port B bus 2 dev 0 func 0 0 gt Config port B bus 3 dev 8 func 0 PCI id test tag 5714 BRIDGE tag PCIX BRIDGE read passed for MB IOB_PCIEb BRIDGE Last read read passed for MB IOB_PCIEb BRIDGE GBE Last read read passed for MB IOB_PCIEb BRIDGE HBA Last read 0 gt POST Return to VBSC 0 gt POST Passed all devices 0 gt Quick jbus loopback Test 262144 bytes at 00000000 00600000 0 gt Master set ACK for vbsc runpost command and spin 5 Perform further investigation if needed a If no faults were detected the system will boot m If POST detects a faulty device the fault is displayed and the fault information is passed to ALOM CMT for fault handling Faulty FRUs are identified in fault messages using the FRU name For a list of FRU names see Appendix A Chapter 3 Server Diagnostics 3 33 3 34 a Interpret the POST messages POST error messages use the following syntax c s gt ERROR TEST failing test c is gt H W under test FRU c s gt Repair Instructions under test above c s gt MSG test error message c s gt END_ERROR Replace items in order listed by H W In this syntax c the core number and s the strand number Warning and informational messages use the following syntax INFO or WARNING message The following example shows a POST error messa
13. Caution The system supplies 3 3 Vdc standby power to the circuit boards even when the system is powered off if the AC power cord is plugged in Press the cover release button FIGURE 4 3 While pressing the release button grasp the rear of the cover and slide the cover toward the rear of the server about one half inch 1 2 cm 3 Lift the cover off the chassis Chapter 4 Preparing for Servicing 4 5 Cover release button Top cover FIGURE 4 3 Location of Top Cover Release Button 4 6 Sun Fire T1000 Server Service Manual January 2007 CHAPTER 5 Replacing Field Replaceable Units This chapter describes how to remove and replace customer replaceable field replaceable units FRUs in the server The following topics are covered Section 5 1 Replacing the Optional PCI Express Card on page 5 2 Section 5 2 Replacing the Fan Tray Assembly on page 5 4 Section 5 3 Replacing the Power Supply on page 5 5 Section 5 4 Replacing a Hard Drive on page 5 7 Section 5 5 Replacing DIMMs on page 5 14 Section 5 6 Replacing the Motherboard and Chassis on page 5 20 Section 5 7 Replacing the Clock Battery on page 5 22 For a list of FRUs see Appendix A Note Never attempt to run the system with the cover removed The cover must be in place for proper air flow The cover interlock switch immediately shuts the system down when the cover is removed 5 1 5 1 Repl
14. Host detected fault MSGID SUN4V 8000 DX Note The Service Required LED is also turns on for PSH diagnosed faults Using the fmdump Command to Identify Faults The fmdump command displays the list of faults detected by the Solaris PSH facility and identifies the faulty FRU for a particular EVENT_ID UUID Do not use fmdump to verify a FRU replacement has cleared a fault because the output of fmdump is the same after the FRU has been replaced Use the fmadm faulty command to verify the fault has cleared Note Faults detected by the Solaris PSH facility are also reported through ALOM CMT alerts In addition to the PSH mdump command the ALOM CMT showfaults command provides information about faults and displays fault UUIDs See Section 3 3 2 Running the showfaults Command on page 3 16 1 Check the event log using the fmdump command with v for verbose output fmdump v TIME UUID SUNW MSG ID Sep 14 10 09 46 2234 92e9fbe 735e c218 cf 87 9e1720a28004 SUN4V 8000 DX 95 fault memory dimm FRU mem component MB CMP0 CH0 R0 D0 J0601 rsrc mem component MB CMP0 CH0 R0 D0 J0601 In this example a fault is displayed indicating the following details m Date and time of the fault Sep 14 10 09 a Universal Unique Identifier UUID that is unique for every fault 92e9fbe 735e c218 cf87 9e1720a28004 m Sun message identifier SUN4V 8000 Dx that can be used to obtain additional fault inf
15. LEDs indicate that there is activity on the associated nets Indicates that the server is linked to the associated nets Indicates that there is activity on the SC Network Management port Indicates that the server is linked to the SC network management port 3 10 Sun Fire T1000 Server Service Manual January 2007 cae Power Supply LEDs The power supply LEDs TABLE 3 3 are located on the back of the power supply TABLE 3 3 Power Supply LEDs Name Color Description Fault Amber On Power supply has detected a failure e Off Normal operation DC OK Green On Normal operation DC output voltage is within normal limits e Off Power is off AC OK Green On Normal operation Input power is within normal limits Off No input voltage or input voltage is below limits 3 3 Using ALOM CMT for Diagnosis and Repair Verification The Sun Advanced Lights Out Management ALOM CMT is a system controller in the server that enables you to remotely manage and administer your server ALOM CMT enables you to remotely run diagnostics such as power on self test POST that would otherwise require physical proximity to the server s serial port You can also configure ALOM CMT to send email alerts of hardware failures hardware warnings and other events related to the server or to ALOM CMT The ALOM CMT circuitry runs independently of the server using the server s standby power Therefore ALO
16. MB page 5 14 e 1GB e 2GB 3 Fan tray Section 5 2 A single assembly containing 4 fans FTO assembly Replacing the Fan Tray Assembly on page 5 4 4 Hard drives Section 5 4 One of the following configurations HDDO Replacing a Hard e One SATA disk drive 3 5 inch HDD1 Drive on page 5 7 form factor e Two SAS disk drives 2 5 inch form factor 5 Power supply Section 5 3 The power supply provides 3 3 Vde PSO unit PS Replacing the standby power at 3 3 Amps and 12 Power Supply on Vdc at 25 Amps page 5 5 6 PCI Express Section 5 1 Optional add on express card PCIEO card slot Replacing the Optional PCI Express Card on page 5 2 7 Clock battery Section 5 7 The battery is located on the MB BAT Replacing the motherboard Clock Battery on page 5 22 8 SEEPROM Remove and The socketed SEEPROM contains the MB SCC replace the socketed SEEPROM MAC address and system configuration information Appendix A Field Replaceable Units A 3 A 4 Sun Fire T1000 Server Service Manual January 2007 Index A AC OK LED 3 4 Advanced ECC technology 3 7 Advanced Lights Out Management ALOM CMT connecting to 3 13 diagnosis and repair of server 3 11 POST and 3 23 prompt 3 13 service related commands 3 13 airflow blocked 3 5 ALOM CMT see Advanced Lights Out Management ALOM CMT antistatic mat 1 2 antistatic wrist strap 1 2 ASR blacklist 3 47 3 48 asrkeys 3 47 Automatic System Recovery AS
17. Tables TABLE 3 1 TABLE 3 2 TABLE 3 3 TABLE 3 4 TABLE 3 5 TABLE 3 6 TABLE 3 7 TABLE 3 8 TABLE 5 1 TABLE A 1 Diagnostic Flowchart Actions 3 4 Front and Rear Panel LEDs 3 10 Power Supply LEDs 3 11 Service Related ALOM CMT Commands 3 14 ALOM CMT Parameters Used for POST Configuration 3 23 ALOM CMT Parameters and POST Modes 3 26 ASR Commands 3 47 Useful SunVTS Tests to Run on This Server 3 53 DIMM Names and Socket Numbers 5 15 Server FRU List A 3 xi xii Sun Fire T1000 Server Service Manual January 2007 Preface The Sun Fire T1000 Server Service Manual provides information to aid in troubleshooting problems with and replacing components within the Sun Fire T1000 server This manual is written for technicians service personnel and system administrators who service and repair computer systems The person qualified to use this manual Can open a system chassis identify and replace internal components Understands the Solaris Operating System and the command line interface Has superuser privileges for the system being serviced Understands typical hardware troubleshooting tasks How This Book Is Organized This guide is organized into the following chapters Chapter 1 describes the safety precautions of the server Chapter 2 describes the main features of the server Chapter 3 describes the diagnostics that are available for monitoring and troubleshooting the server Chapter 4 d
18. both mounting brackets and pull the server chassis out until the brackets lock in the open position FIGURE 4 1 Chapter 4 Preparing for Servicing 4 3 FIGURE 4 1 Unlocking a Mounting Bracket 6 Press the gray release tab on both mounting brackets to release the right and left mounting brackets then pull the server chassis out of the rails FIGURE 4 2 The mounting brackets slide approximately 4 in 10 cm farther before disengaging FIGURE 4 2 Location of the Mounting Bracket Release Buttons 7 Set the chassis on a sturdy work surface 4 4 Sun Fire T1000 Server Service Manual January 2007 4 1 4 4 1 5 A Performing Electrostatic Discharge ESD Prevention Measures Prepare an antistatic surface to set parts on during removal and installation Place ESD sensitive components such as the printed circuit boards on an antistatic mat The following items can be used as an antistatic mat m Antistatic bag used to wrap a Sun replacement part m Sun ESD mat part number 250 1088 a Disposable ESD mat shipped with some replacement parts or optional system components Use an antistatic wrist strap Removing the Top Cover Access to all field replaceable units FRUs requires the removal of the top cover Note Never run the system with the top cover removed The top cover must be in place for proper air flow The cover interlock switch immediately shuts the system down when the cover is removed
19. drive drive 0 Install the replacement drive in the lower drive slot in the drive bracket Push the drive firmly toward the front of the drive bracket until the hard drive is completely seated Plug the DRIVE 0 connector on the drive cable into the data power connector on the lower drive Make sure the connector is correctly oriented before plugging it into the data power connector on the drive To replace the upper drive drive 1 Install the replacement drive in the upper drive slot in the drive bracket Push the drive firmly toward the front of the drive bracket until the hard drive is completely seated Plug the DRIVE 1 connector on the drive cable into the data power connector on the upper drive Ensure that the connector is correctly oriented before plugging it into the data power connector on the drive 3 Slide the drive assembly into the chassis until it mates with the front of the chassis FIGURE 5 10 Sun Fire T1000 Server Service Manual January 2007 10 11 12 Fasteners FIGURE 5 10 Installing the Dual Drive Assembly Push the fasteners down to lock the drive assembly into place in the chassis FIGURE 5 10 Redress the cable through the midwall in the chassis Route the drive data cables underneath the power supply cable Plug the power connector on the dual drive cable to the power connector on the motherboard FIGURE 5 8 Plug the data connector marked J5003 on the cable
20. edges Using an Antistatic Wrist Strap Wear an antistatic wrist strap and use an antistatic mat when handling components such as drive assemblies boards or cards When servicing or removing server components attach an antistatic strap to your wrist and then to a metal area on the chassis Do this after you disconnect the power cords from the server Following this practice equalizes the electrical potentials between you and the server Using an Antistatic Mat Place ESD sensitive components such as the motherboard memory and other PCB cards on an antistatic mat 1 2 Sun Fire T1000 Server Service Manual January 2007 CHAPTER 2 Server Overview This chapter provides an overview of the server Topics include m Section 2 1 Server Overview on page 2 1 m Section 2 2 Obtaining the Chassis Serial Number on page 2 3 21 Server Overview The server is a high performance entry level server that is highly scalable and very reliable FIGURE 2 1 FIGURE 2 1 Server 2 1 FIGURE 2 2 shows the major components in the server and FIGURE 2 3 and FIGURE 2 4 show the front and rear panels of the server PCI E slot opening Chassis assembly PCI E riser board UltraSPARC T1 Motherboard mullticore processor Fan tray assembly Power supply Hard drive FIGURE 2 2 Server Components Locator LED button E Service Required LED Power OK LED and Power On Off button FIGURE 2 3 S
21. faulty hardware from potentially harming software In normal operation diag_level min POST runs in mimimum mode by default to test devices required to power on the server Replace any devices POST detects as faulty in minimum mode Run POST in maximum mode diag_level max for all power on or error generated resets and to validate hardware upgrades or repairs With maximum testing enabled POST finds faults and offlines memory devices with errors that could be correctable by PSH Check the POST generated errors with the showfaults v command to verify if memory devices detected by POST can be corrected by PSH or need to be replaced See Section 3 4 5 Correctable Errors Detected by POST on page 3 35 Chapter 3 Server Diagnostics 3 27 3 4 3 2 3 4 4 Diagnosing the System Hardware You can use POST as an initial diagnostic tool for the system hardware In this case configure POST to run in maximum mode diag_mode service setkeyswitch diag diag_level max for thorough test coverage and verbose output Running POST in Maximum Mode This procedure describes how to run POST when you want maximum testing as in the case when you are troubleshooting a server or verifying a hardware upgrade or repair Switch from the system console prompt to the sc gt prompt by issuing the escape sequence ok sc gt Set the virtual keyswitch to diag so that POST will run in Service mode sc gt setkeyswitch diag
22. intervention In most cases ALOM CMT detects the repair and extinguishes the Service Required LED If ALOM CMT does not perform these actions you must perform these tasks manually using the clearfault or enablecomponent commands ALOM CMT can detect the removal of a FRU in many cases even if the FRU is removed while ALOM CMT is powered off This enables ALOM CMT to know that a fault diagnosed to a specific FRU has been repaired The ALOM CMT clearfault command enables you to manually clear certain types of faults without a FRU replacement or if ALOM CMT was unable to automatically detect the FRU replacement Note ALOM CMT does not automatically detect hard drive replacement 3 12 Sun Fire T1000 Server Service Manual January 2007 3 3 1 3 3 1 1 Many environmental faults can automatically recover A temperature that is exceeding a threshold might return to normal limits An unplugged power supply can be plugged in and so on Recovery of environmental faults is automatically detected Recovery events are reported using one of two forms m fru at location is OK m sensor at location is within normal range Environmental faults can be repaired through the removal of the faulty FRU FRU removal is automatically detected by the environmental monitoring and all faults associated with the removed FRU are cleared The message for that case and the alert sent for all FRU removals is fru at location has been removed There is n
23. level of diagnostic execution Overrides user defined settings as if parameters were diag_level max diag_verbosity max diag_trigger all resets User defined settings are not modified Normal Mode Diagnostic execution is enabled User defined settings control test coverage and verbosity via diag_level diag_verbosity diag_trigger FIGURE 3 5 Flowchart of ALOM CMT Variables for POST Configuration Chapter 3 Server Diagnostics 3 25 TABLE 3 6 shows combinations of ALOM CMT variables and associated POST modes TABLE 3 6 ALOM CMT Parameters and POST Modes Parameter Normal Diagnostic No POST Diagnostic Keyswitch Mode Execution Service Mode Diagnostic Preset Default Settings Values diag_mode normal off service normal setkeyswitch normal normal normal diag diag_level min n a max max diag_trigger power on reset none all resets all resets error reset diag_verbosity normal n a max Description of POST This is the default POST POST does not POST runs the execution configuration This run resulting in full spectrum of configuration tests the quick system tests with the system thoroughly and initialization but maximum output suppresses some of the this is not a displayed detailed POST output suggested configuration max POST runs the full spectrum of tests with the maximum output displayed The setkeyswitch parameter when set to diag overrides all the other ALOM CMT POST variables
24. named messages 1 Over a period of time the messages are further rotated to messages 2 and messages 3 and then deleted 1 Log in as superuser 2 Issue the following command more var adm messages 3 If you want to view all logged messages issue the following command more var adm messages 37 Managing Components With Automatic System Recovery Commands The Automatic System Recovery ASR feature enables the server to automatically configure failed components out of operation until they can be replaced In the server the following components are managed by the ASR feature m UltraSPARC T1 processor strands a Memory DIMMS a I O bus 3 46 Sun Fire T1000 Server Service Manual January 2007 Idal The database that contains the list of disabled components is called the ASR blacklist asr db In most cases POST automatically disables a faulty component After the cause of the fault is repaired FRU replacement loose connector reseated and so on you must remove the component from the ASR blacklist The ASR commands TABLE 3 7 enable you to view and manually add or remove components from the ASR blacklist These commands are run from the ALOM CMT sc gt prompt TABLE 3 7 ASR Commands Command Description showcomponent Displays system components and their current state enablecomponent asrkey Removes a component from the asr db blacklist where asrkey is the component to enable di
25. of the chassis engage the release lever to secure the card to the chassis FIGURE 5 1 4 Perform the procedures described in Chapter 6 5 2 Replacing the Fan Tray Assembly 5 2 1 Removing the Fan Tray Assembly 1 Perform the procedures described in Chapter 4 2 Disconnect the fan power cable from the motherboard 3 Push in on the clasps on both sides of the fan assembly FIGURE 5 3 Fan tray assembly FIGURE 5 3 Removing the Fan Tray Assembly 4 Remove the fan assembly from the sheet metal mounting brackets 5 4 Sun Fire T1000 Server Service Manual January 2007 022 Installing the Fan Tray Assembly 1 Unpack the replacement fan tray assembly and place it on an antistatic mat 2 Align the fan tray assembly with the sheet metal mounting brackets and slide it into place until the clasps on each side lock it into place 3 Reconnect the fan power cable to the motherboard 4 Perform the procedures described in Chapter 6 5 3 Replacing the Power Supply 5 3 1 Removing the Power Supply 1 Perform the procedures described in Chapter 4 2 Disconnect the power cable from the motherboard and pull the cable through the midwall 3 Pull the fastener up on the front of the power supply and remove the power supply from the chassis FIGURE 5 4 Chapter 5 Replacing Field Replaceable Units 5 5 Fastener Power supply FIGURE 5 4 Removing the Power Supply 5 3 2 Installing the Power Supply 1 Unpac
26. que ce soit de la part de Fujitsu Limited ou de Sun Microsystems Inc ou des soci t s affili es Ce document et le produit et les technologies qu il d crit peuvent inclure des droits de propri t intellectuelle de parties tierces prot g s par copyright et ou c d s sous licence par des fournisseurs Fujitsu Limited et ou Sun Microsystems Inc y compris des logiciels et des technologies relatives aux polices de caract res Par limites du GPL ou du LGPL une copie du code source r gi par le GPL ou LGPL comme applicable est sur demande vers la fin utilsateur disponible veuillez contacter Fujitsu Limted ou Sun Microsystems Inc Cette distribution peut comprendre des composants d velopp s par des tierces parties Des parties de ce produit pourront tre d riv es des syst mes Berkeley BSD licenci s par l Universit de Californie UNIX est une marque d pos e aux Etats Unis et dans d autres pays et licenci e exclusivement par X Open Company Ltd Sun Sun Microsystems le logo Sun Java Netra Solaris Sun StorEdge SPARC Enterprise docs sun com OpenBoot SunVTS Sun Fire SunSolve CoolThreads J2EE et Sun sont des marques de fabrique ou des marques d pos es de Sun Microsystems Inc aux Etats Unis et dans d autres pays Fujitsu et le logo Fujitsu sont des marques d pos es de Fujitsu Limited Toutes les marques SPARC sont utilis es sous licence et sont des marques de fabrique ou des marques d pos es de SP
27. registered trademarks of SPARC International Inc in the U S and other countries Products bearing SPARC trademarks are based upon architecture developed by Sun Microsystems Inc SPARC64 is a trademark of SPARC International Inc used under license by Fujitsu Microelectronics Inc and Fujitsu Limited The OPEN LOOK and Sun Graphical User Interface was developed by Sun Microsystems Inc for its users and licensees Sun acknowledges the pioneering efforts of Xerox in researching and developing the concept of visual or graphical user interfaces for the computer industry Sun holds a non exclusive license from Xerox to the Xerox Graphical User Interface which license also covers Sun s licensees who implement OPEN LOOK GUIs and otherwise comply with Sun s written license agreements United States Government Rights Commercial use U S Government users are subject to the standard government user license agreements of Sun Microsystems Inc and Fujitsu Limited and the applicable provisions of the FAR and its supplements Disclaimer The only warranties granted by Fujitsu Limited Sun Microsystems Inc or any affiliate of either of them in connection with this document or any product or technology described herein are those expressly set forth in the license agreement pursuant to which the product or technology is provided EXCEPT AS EXPRESSLY SET FORTH IN SUCH AGREEMENT FUJITSU LIMITED SUN MICROSYSTEMS INC AND THEIR AFFILIATES MAKE NO REPRE
28. to the J5003 data connector on the motherboard the connector farthest from the power supply Refer to FIGURE 5 8 for the location of the J5003 data connector Plug the data connector marked J5002 on the cable to the J5002 data connector on the motherboard the connector closest to the power supply Refer to FIGURE 5 8 for the location of the J5002 data connector Perform the procedures described in Chapter 6 Use the Solaris format utility to label the 2 5 inch SAS hard drives Refer to the Labeling Unlabeled Hard Drives document for those instructions Perform the necessary administrative tasks to reconfigure the hard drive The procedures that you perform at this point depend on how your data is configured You might need to partition the drive create file systems load data from backups or have the data updated from a RAID configuration Chapter 5 Replacing Field Replaceable Units 5 13 5 5 Replacing DIMMs 5 5 1 Removing DIMMs Note Not all DIMMs detected as faulty and offlined by POST must be replaced In service maximum mode POST detects memory devices with errors that might be corrected with Solaris PSH See Section 3 4 5 Correctable Errors Detected by POST on page 3 35 static discharges that can cause the component to fail To avoid this problem follow Caution This procedure requires that you handle components that are sensitive to the antistatic practices as described in Chapter 4
29. 00000000 on Memory Channel 0 3 Rank 0 Stack 1 0 0 gt IMMU Functional 0 gt DMMU Functional 0 gt Extended Memory Tests 0 gt Print Mem Config 0 gt Caches Icache is ON Dcache is ON 0 gt Bank 0 4096MB 00000000 00000000 gt 00000001 00000000 0 gt Bank 1 4096MB 00000001 00000000 gt 00000002 00000000 0 gt Block Mem Test 0 gt Test 6291456 bytes at 00000000 00600000 Memory Channel 0 3 Rank 0 Stack 0 0 gt Test 6291456 bytes at 00000001 00000000 Memory Channel 0 3 Rank 0 Stack 1 0 gt T0 Bridge Tests 0 gt T0 Bridge Quick Read 0 gt fire 1 JBUSID 00000080 0 000000 0 gt c000002 e03dda23 0 gt fire 1 JBUSCSR 00000080 0 410000 0 gt 00000f 5 13cb7000 ONE Or Or Or OT OF TION TO O ON TO TOO OO Os OT OT O 7 ON 20 2 3 32 Sun Fire T1000 Server Service Manual January 2007 a T O er Eee oo 0 gt 10 Bridge unit 1 0 gt 10 Bridge unit 1 0 gt 10 Bridge unit 1 0 gt 10 Bridge unit 1 0 gt 10 Bridge unit 1 0 gt 10 Bridge unit 1 0 gt 10 Bridge unit 1 0 gt 10 Bridge unit 1 0 gt 10 Bridge unit 1 0 gt 10 Bridge unit 1 0 0 gt INFO 10 count VID 1166 DID 103 0 0 gt INFO 10 count VID 14e4 DID 1648 0 0 gt INFO 10 count VID 1000 DID 50 0 0 gt Quick JBI Loopback Block Mem Test T G G 0 gt INFO jbus perf test int msi ilu tlu lpu link train port init init init init
30. 1 Removing the Optional PCI Express Card 5 2 5 1 2 Installing the Optional PCI Express Card 5 3 Replacing the Fan Tray Assembly 5 4 5 2 1 Removing the Fan Tray Assembly 5 4 5 2 2 Installing the Fan Tray Assembly 5 5 Replacing the Power Supply 5 5 5 3 1 Removing the Power Supply 5 5 5 3 2 Installing the Power Supply 5 6 Replacing a Hard Drive 5 7 Contents vii 5 4 1 Replacing a Hard Drive in a Single Drive Assembly 5 8 5 4 1 1 Removing the Hard Drive in a Single Drive Assembly 5 8 5 4 1 2 Installing the Hard Drive in a Single Drive Assembly 5 9 5 4 2 Replacing a Hard Drive in a Dual Drive Assembly 5 10 5 4 2 1 Removing a Hard Drive in a Dual Drive Assembly 5 10 5 422 Installing the Hard Drive in a Dual Drive Assembly 5 12 5 5 Replacing DIMMs 5 14 5 5 1 Removing DIMMs 5 14 5 5 2 Installing DIMMs 5 16 5 6 Replacing the Motherboard and Chassis 5 20 5 6 1 Removing the Motherboard and Chassis 5 20 5 6 2 Installing the Motherboard and Chassis 5 20 5 7 Replacing the Clock Battery 5 22 5 7 1 Removing the Clock Battery on the Motherboard 5 22 5 7 2 Installing the Clock Battery on the Motherboard 5 22 6 Finishing Up Servicing 6 1 6 1 Final Service Procedures 6 1 6 11 Replacing the Top Cover 6 1 6 12 Reinstalling the Server Chassis in the Rack 6 1 6 13 Applying Power to the Server 6 2 A Field Replaceable Units A 1 Index Index 1 viii Sun Fire T1000 Server Service Manual January 2007 FIGURE 2 1 FIGURE 2 2 FI
31. 2048 MB SPD Manufacture Location SPD Vendor Infineon formerly Siemens SPD Vendor Part No 72T256220HR3 7A SPD Vendor Serial No d03f623 FRU_PROM at MB CMP0 CH0O R1 D0 SEEPROM SPD Timestamp MON OCT 03 12 00 00 2005 SPD Description DDR2 SDRAM 2048 MB SPD Manufacture Location SPD Vendor Infineon formerly Siemens SPD Vendor Part No 72T256220HR3 7A SPD Vendor Serial No d03fc26 FRU_PROM at MB CMP0 CH0O R1 D1 SEEPROM SPD Timestamp MON OCT 03 12 00 00 2005 SPD Description DDR2 SDRAM 2048 MB SPD Manufacture Location SPD Vendor Infineon formerly Siemens SPD Vendor Part No 72T256220HR3 7A SPD Vendor Serial No d03eb26 FRU_PROM at MB CMP0 CH3 R0 D0 SEEPROM SPD Timestamp MON OCT 03 12 00 00 2005 SPD Description DDR2 SDRAM 2048 MB SPD Manufacture Location SPD Vendor Infineon formerly Siemens SPD Vendor Part No 72T256220HR3 7A SPD Vendor Serial No d03e620 3 20 Sun Fire T1000 Server Service Manual January 2007 FRU_PROM at MB CMP0 CH3 R0 D1 SEEPROM SPI SPI SPI SPI SPI SPI D Timestamp MON OCT 03 12 00 00 2005 D Description DDR2 SDRAM 2048 MB D Manufacture Location D Vendor Infineon formerly Siemens D Vendor Part No 72T256220HR3 7A D Vendor Serial No d040920 FRU_PROM at MB CMP0 CH3 R1 D0 SEEPROM SPI SPI SPI SPI SPI SPI D Timestamp MON OCT 03 12 00 00 2005 D Description DDR2 SDRAM 2048 MB D Manufact
32. 5 5 5 2 Grasp the top corners of the DIMM and remove it from the motherboard Place the DIMM on an antistatic mat Installing DIMMs Use the following guidelines and FIGURE 5 11 and TABLE 5 1 to plan the memory configuration of your server m Eight slots hold industry standard DDR 2 memory DIMMs m The server accepts the following DIMM sizes 512 MB a 1 GB a 2GB a All DIMMs installed must be the same size a DIMMs must be added four at a time m Rank 0 memory must be fully populated for the server to function Unpack the replacement DIMMs and place them on an antistatic mat Ensure that the socket ejector tabs are in the open position Line up the replacement DIMM with the connector Push the DIMM into the socket until the ejector tabs lock the DIMM in place Perform the procedures described in Chapter 6 Note You must replace the top cover as instructed in the Chapter 6 chapter before proceeding with these instructions The top cover must be in place for ALOM CMT to detect that a DIMM has been replaced Gain access to the ALOM sc gt prompt Refer to the Advanced Lights Out Management ALOM CMT Guide for instructions Run the showfaults v command to determine how to clear the fault The method you use to clear a fault depends on how the fault is identified by the showfaults command 5 16 Sun Fire T1000 Server Service Manual January 2007 m If the fault is a host detected fault
33. ARC International Inc aux Etats Unis et dans d autres pays Les produits portant les marques SPARC sont bas s sur une architecture d velopp e par Sun Microsystems Inc SPARC64 est une marques d pos e de SPARC International Inc utilis e sous le permis par Fujitsu Microelectronics Inc et Fujitsu Limited L interface d utilisation graphique OPEN LOOK et Sun a t d velopp e par Sun Microsystems Inc pour ses utilisateurs et licenci s Sun reconna t les efforts de pionniers de Xerox pour la recherche et le d veloppement du concept des interfaces d utilisation visuelle ou graphique pour l industrie de l informatique Sun d tient une license non exclusive de Xerox sur l interface d utilisation graphique Xerox cette licence couvrant galement les licenci s de Sun qui mettent en place l interface d utilisation graphique OPEN LOOK et qui en outre se conforment aux licences crites de Sun Droits du gouvernement am ricain logiciel commercial Les utilisateurs du gouvernement am ricain sont soumis aux contrats de licence standard de Sun Microsystems Inc et de Fujitsu Limited ainsi qu aux clauses applicables stipul es dans le FAR et ses suppl ments Avis de non responsabilit les seules garanties octroy es par Fujitsu Limited Sun Microsystems Inc ou toute soci t affili e de l une ou l autre entit en rapport avec ce document ou tout produit ou toute technologie d crit e dans les pr sentes correspondent aux
34. CMP0 T_BCORE OK 51 10 5 0 85 90 95 MB IOB T_CORE OK 49 10 5 0 95 100 105 Chapter 3 Server Diagnostics 3 17 SYS LOCATE SYS SERVICE SYS ACT OFF OFF ON Fans Speeds Revolution Per Minute Sensor Status Speed Warn Low FTO FO OK 6762 2240 1920 FTO F1 OK 6762 2240 1920 FTO F2 OK 6762 2240 1920 FTO F3 OK 6653 2240 1920 Voltage sensors in Volts Sensor Status Voltage LowSoft LowWarn HighWarn HighSoft MB V_VCORE OK 1 30 20 24 L 36 1 39 MB V_VMEM OK 1 79 69 72 1 87 1 90 MB V_VTT OK 0 89 0 84 0 86 0 93 0 95 MB V_ 1V2 OK 1 18 09 11 1 28 1 30 MB V_ 1V5 OK 1 49 36 39 1 60 1 63 MB V_ 2V5 OK 2 51 2 27 2 32 2 67 2 72 MB V_ 3V3 OK 3 29 3 06 10 3 49 3 53 MB V_ 5V OK 5 02 55 4 65 5 35 5 45 MB V_ 12V OK 12 25 10 92 11 16 12 84 13 08 MB V_ 3V3STBY OK 3433 3 13 3 16 3 53 359 sensor Status Load Warn Shutdown MB I_VCORE OK 20 560 80 000 88 000 MB I_VMEM OK 8 160 60 000 66 000 3 18 Sun Fire T1000 Server Service Manual January 2007 PSO sc gt OK OFF OFF OFF OFF OFF 3 3 4 Note Some environmental information might not be available when the server is in Standby mode Running the showfru Command The showfru command displays information about the FRUs in the server Use this command to see information about an individual FRU or for all the FRUs Note By default the output of the showfru command for all FRUs is very long At the sc gt prompt enter the showfru co
35. Earlier versions of firmware have max as the default setting for the POST diag_level variable To set the default to min use the ALOM CMT command setsc diag level min For validating hardware upgrades or repairs configure POST to run in maximum mode diag_level max Note that with maximum testing enabled POST detects and offlines memory devices with errors that could be correctable by PSH Thus not all memory devices detected by POST need to be replaced See Section 3 4 5 Correctable Errors Detected by POST on page 3 35 Note Devices can be manually enabled or disabled using ASR commands see Section 3 7 Managing Components With Automatic System Recovery Commands on page 3 46 Controlling How POST Runs The server can be configured for normal extensive or no POST execution You can also control the level of tests that run the amount of POST output that is displayed and which reset events trigger POST by using ALOM CMT variables 3 22 Sun Fire T1000 Server Service Manual January 2007 TABLE 3 5 lists the ALOM CMT variables used to configure POST and FIGURE 3 5 shows how the variables work together Note Use the ALOM CMT setsc command to set all the parameters in TABLE 3 5 except setkeyswitch TABLE 3 5 ALOM CMT Parameters Used for POST Configuration Parameter Values Description setkeyswitch normal The system can power on and run POST based on the other parameter settings For details see
36. FIGURE 5 8 Data connector J5002 Data connector J5003 Power connector Z FIGURE 5 8 Location of Drive Power and Data Connectors on the Motherboard 5 10 Sun Fire T1000 Server Service Manual January 2007 3 Pull the fasteners up on the rear of the dual drive assembly and remove the dual drive assembly from the chassis FIGURE 5 9 Fasteners FIGURE 5 9 Removing the Dual Drive Assembly 4 Determine which of the two hard drives you want to remove The upper drive drive 1 is typically the data drive or mirror drive The lower drive drive 0 is typically the boot drive 5 Remove the drive from the drive bracket If you are removing the lower drive you must first remove the upper drive before you can remove the lower drive a b To remove the upper drive drive 1 Disconnect the drive cable from the data power connector on the upper drive Push the drive toward the back of the drive bracket and lift the drive away from the bracket To remove the lower drive drive 0 Disconnect the drive cable from the data power connector on the lower drive Push the drive toward the back of the drive bracket and lift the drive away from the bracket Chapter 5 Replacing Field Replaceable Units 5 11 5 4 2 2 5 12 Installing the Hard Drive in a Dual Drive Assembly 1 Unpack the replacement hard drive 2 Install the replacement drive in the drive bracket a b To replace the lower
37. FRUs are identified in fault messages using the FRU name For a list of FRU names see Appendix A 3 Check the Solaris The Solaris message buffer and log files record Section 3 6 Collecting log files for fault system events and provide information about Information From Solaris information faults OS Files and Commands e If system messages indicate a faulty device on page 3 45 replace the FRU To obtain more diagnostic information go to Chapter 5 Action No 4 4 Run SunVTS SunVTS is an application you can run to exercise Section 3 8 Exercising and diagnose FRUs To run SunVTS the server the System With SunVTS must be running the Solaris OS on page 3 49 If SunVTS reports a faulty device replace the FRU Chapter 5 e If SunVTS does not report a faulty device go to Action No 5 3 4 Sun Fire T1000 Server Service Manual January 2007 TABLE 3 1 Diagnostic Flowchart Actions Continued Action For more information see No Diagnostic Action Resulting Action these sections 5 Run POST POST performs basic tests of the server components Section 3 4 Running and reports faulty FRUs POST on page 3 22 Note diag_level min is the default ALOM CMT setting which tests devices required to boot TABLE 3 5 TABLE 3 6 the server Use diag_level max for troubleshooting and hardware replacement chapiees If POST indicates a faulty FRU while diag_level min replace the FRU e If POST indicates a faulty memory device whi
38. GURE 2 3 FIGURE 2 4 FIGURE 3 1 FIGURE 3 2 FIGURE 3 3 FIGURE 3 4 FIGURE 3 5 FIGURE 3 6 FIGURE 3 7 FIGURE 4 1 FIGURE 4 2 FIGURE 4 3 FIGURE 5 1 FIGURE 5 2 FIGURE 5 3 FIGURE 5 4 FIGURE 5 5 FIGURE 5 6 Figures Server 2 1 Server Components 2 2 Server Front Panel 2 2 Server Rear Panel 2 3 Diagnostic Flowchart 3 3 LEDs on the Server Front Panel 3 8 LEDs on the Server Rear Panel 3 9 ALOM CMT Fault Management 3 12 Flowchart of ALOM CMT Variables for POST Configuration 3 25 SunVTS GUI 3 52 SunVTS Test Selection Panel 3 53 Unlocking a Mounting Bracket 4 4 Location of the Mounting Bracket Release Buttons 4 4 Location of Top Cover Release Button 4 6 Releasing the PCI Express Card Release Lever 5 2 Removing and Installing the PCI Express Card 5 3 Removing the Fan Tray Assembly 5 4 Removing the Power Supply 5 6 Installing the Power Supply 5 7 Removing the Single Drive Assembly 5 8 FIGURE 5 7 FIGURE 5 8 FIGURE 5 9 FIGURE 5 10 FIGURE 5 11 FIGURE 5 12 FIGURE 5 13 FIGURE A 1 Installing the Single Drive Assembly 5 9 Location of Drive Power and Data Connectors on the Motherboard 5 10 Removing the Dual Drive Assembly 5 11 Installing the Dual Drive Assembly 5 13 DIMM Locations 5 15 Removing the Clock Battery From the Motherboard 5 22 Installing the Clock Battery on the Motherboard 5 23 Field Replaceable Units A 2 x Sun Fire T1000 Server Service Manual January 2007
39. M Access 0 0 gt DMMU TLB TAGS Access 0 0 gt DMMU CAM 0 0 gt IMMU TLB DATA RAM Access 0 0 gt IMMU TLB TAGS Access 0 0 gt IMMU CAM 0 0 gt Setup and Enable DMMU 0 0 gt Setup DMMU Miss Handler Chapter 3 Server Diagnostics 3 29 0 gt Niagara Version 2 0 0 gt Serial Number 00000098 00000820 fffff238 6b4c60e9 0 gt Init JBUS Config Regs 0 gt T0 Bridge unit 1 init test 0 gt sys 200 MHz CPU 1000 MHz mem 200 MHz 0 gt Integrated POST Testing 0 gt L2 Tests 0 gt Setup L2 Cache 0 gt L2 Cache Control 00000000 00300000 0 gt Scrub and Setup L2 Cache 0 gt L2 Directory clear 0 gt L2 Scrub VD amp UA 0 gt L2 Scrub Tags 0 gt Test Memory Basic 0 gt Probe and Setup Memory 0 gt INFO 4096MB at Memory Channel 0 3 Rank 0 Stack 0 0 gt INFO 4096MB at Memory Channel 0 3 Rank 0 Stack 1 0 gt INFO No memory detected at Memory Channel 0 3 Rank 1 Stack 0 0 gt INFO No memory detected at Memory Channel 0 3 Rank 1 Stack 1 20 gt 0 gt Data Bitwalk 0 gt L2 Scrub Data 0 gt L2 Enable 0 gt Testing Memory Channel 0 Rank 0 Stack 0 0 gt Testing Memory Channel 3 Rank 0 Stack 0 0 gt Testing Memory Channel 0 Rank 0 Stack 1 0 gt Testing Memory Channel 3 Rank 0 Stack 1 0 gt L2 Directory clear 0 gt L2 Scrub VD amp UA 0 gt L2 Scrub Tags OF OO gt OF TOO Tr Or VO TO TO O7 ON O0 Or OT OO Or 10 2 OO TO TOO O TO OO sO oO 0 gt L2 Disable
40. M CMT firmware and software continue to function when the server operating system goes offline or when the server is powered off Note Refer to the Advanced Lights Out Management ALOM CMT Guide for comprehensive ALOM CMT information Chapter 3 Server Diagnostics 3 11 Faults detected by ALOM CMT POST and the Solaris Predictive Self Healing PSH technology are forwarded to ALOM CMT for fault handling FIGURE 3 4 In the event of a system fault ALOM CMT ensures that the Service Required LED is lit FRU ID PROMs are updated the fault is logged and alerts are displayed Faulty FRUs are identified in fault messages using the FRU name For a list of FRU names see Appendix A Service Required LED FRU LEDs Environmentals st LS gt ALOM POST m fault manager FRUID PROMs Logs SolarisPSH Alerts i FIGURE 3 4 ALOM CMT Fault Management ALOM CMT sends alerts to all ALOM CMT users that are logged in sending the alert through email to a configured email address and writing the event to the ALOM CMT event log ALOM CMT can detect when a fault is no longer present and clears the fault in several ways m Fault recovery The system automatically detects that the fault condition is no longer present ALOM CMT extinguishes the Service Required LED and updates the FRU s PROM indicating that the fault is no longer present a Fault repair The fault has been repaired by human
41. OR at IOBD V_ 1V has exceeded low warning threshold 3 16 Sun Fire T1000 Server Service Manual January 2007 m Example showing a fault that was detected by POST These kinds of faults are identified by the message deemed faulty and disabled and by a FRU name sc gt showfaults v ID Time FRU Fault 1 OCT 13 12 47 27 MB CMP0O CH0 R1 D0 MB CMP0 CH0 R1 D0 deemed faulty and disabled a Example showing a fault that was detected by the PSH technology These kinds of faults are identified by the text Host detected fault and by a UUID sc gt showfaults v ID Time FRU Fault 0 SEP 09 11 09 26 MB CMP0 CH0 R1 D0 Host detected fault MSGID SUN4U 8000 2S UUID 7ee0e46b ea64 6565 e684 e996963F7b86 IRI Running the showenvironment Command The showenvironment command displays a snapshot of the server s environmental status This command displays system temperatures hard disk drive status power supply and fan status front panel LED status voltage and current sensors The output uses a format similar to the Solaris OS command prtdiag 1m At the sc gt prompt type the showenvironment command The output differs according to your system s model and configuration Example sc gt showenvironment System Temperatures Temperatures in Celsius Sensor Status Temp LowHard LowSoft LowWarn HighWarn HighSoft HighHard MB T_AMB OK 28 10 5 0 45 50 55 MB CMP0O T_TCORE OK 50 10 5 0 85 90 95 MB
42. R 3 46 B blacklist ASR 3 47 bootmode command 3 14 break command 3 14 C chipkill 3 7 clearasrdb command 3 47 clearfault command 3 14 3 45 clearing POST detected faults 3 38 clearing PSH detected faults 3 44 clock battery installing 5 22 removing 5 22 components disabled 3 47 3 48 components displaying the state of 3 47 connecting to ALOM CMT 3 13 console 3 14 console command 3 14 3 29 consolehistory command 3 14 D DDR 2 memory DIMMs 3 7 diag_level parameter 3 23 3 26 diag_mode parameter 3 23 3 26 diag_trigger parameter 3 23 3 26 diag_verbosity parameter 3 23 3 26 diagnostics low level 3 22 running remotely 3 11 SunVTS 3 49 DIMMs example POST error output 3 34 installing 5 16 names and socket numbers 5 15 removing 5 14 5 20 troubleshooting 3 8 disablecomponent command 3 47 3 48 disabled component 3 48 displaying FRU status 3 19 dmesg command 3 46 E electrostatic discharge ESD prevention 1 2 Index 1 enablecomponent command 3 39 3 47 3 49 environmental faults 3 4 3 5 3 13 3 16 event log checking the PSH 3 41 exercising the system with SunVTS 3 50 F fan status displaying 3 17 fan tray assembly installing 5 5 removing 5 4 fault manager daemon fmd 1M 3 39 fault message ID 3 16 fault records 3 45 faults 3 12 3 16 environmental 3 4 3 5 recovery 3 12 repair 3 12 types of 3 16 fmadm command 3 45 fmdump command 3 41 front pane
43. RMELLEMENT EXCLUES DANS LA MESURE AUTORISEE PAR LA LOI APPLICABLE Y COMPRIS NOTAMMENT TOUTE GARANTIE IMPLICITE RELATIVE A LA QUALITE MARCHANDE A L APTITUDE A UNE UTILISATION PARTICULIERE OU A L ABSENCE DE CONTREFACON Contents Preface xiii Safety Information 1 1 1 1 1 2 1 3 Safety Information 1 1 Safety Symbols 1 1 Electrostatic Discharge Safety 1 2 13 1 Using an Antistatic Wrist Strap 1 2 1 3 2 Using an Antistatic Mat 1 2 Server Overview 2 1 2 1 2 2 Server Overview 2 1 Obtaining the Chassis Serial Number 2 3 Server Diagnostics 3 1 3 1 3 2 Overview of Server Diagnostics 3 1 3 11 Memory Configuration and Fault Handling 3 6 3 1 1 1 Memory Configuration 3 7 3 1 1 2 Memory Fault Handling 3 7 3 1 1 3 Troubleshooting Memory Faults 3 8 Using LEDs to Identify the State of Devices 3 8 3 2 1 Front and Rear Panel LEDs 3 10 3 22 Power Supply LEDs 3 11 3 3 Using ALOM CMT for Diagnosis and Repair Verification 3 11 3 3 1 Running ALOM CMT Service Related Commands 3 13 3 3 1 1 Connecting to ALOM 3 13 3 3 1 2 Switching Between the System Console and ALOM 3 14 3 3 1 3 Service Related ALOM CMT Commands 3 14 3 3 2 Running the showfaults Command 3 16 3 3 3 Running the showenvironment Command 3 17 3 3 4 Running the showfru Command 3 19 3 4 Running POST 3 22 3 41 Controlling How POST Runs 3 22 3 4 2 Changing POST Parameters 3 26 3 4 3 Reasons to Run POST 3 27 3 4 3 1 Verifying Hardware Functionality 3 27 3 4 3 2 Diag
44. SENTATIONS OR WARRANTIES OF ANY KIND EXPRESS OR IMPLIED REGARDING SUCH PRODUCT OR TECHNOLOGY OR THIS DOCUMENT WHICH ARE ALL PROVIDED AS IS AND ALL EXPRESS OR IMPLIED CONDITIONS REPRESENTATIONS AND WARRANTIES INCLUDING WITHOUT LIMITATION ANY IMPLIED WARRANTY OF MERCHANTABILITY FITNESS FOR A PARTICULAR PURPOSE OR NON INFRINGEMENT ARE DISCLAIMED EXCEPT TO THE EXTENT THAT SUCH DISCLAIMERS ARE HELD TO BE LEGALLY INVALID Unless otherwise expressly set forth in such agreement to the extent allowed by applicable law in no event shall Fujitsu Limited Sun Microsystems Inc or any of their affiliates have any liability to any third party under any legal theory for any loss of revenues or profits loss of use or data or business interruptions or for any indirect special incidental or consequential damages even if advised of the possibility of such damages DOCUMENTATION IS PROVIDED AS IS AND ALL EXPRESS OR IMPLIED CONDITIONS REPRESENTATIONS AND WARRANTIES INCLUDING ANY IMPLIED WARRANTY OF MERCHANTABILITY FITNESS FOR A PARTICULAR PURPOSE OR NON INFRINGEMENT ARE DISCLAIMED EXCEPT TO THE EXTENT THAT SUCH DISCLAIMERS ARE HELD TO BE LEGALLY INVALID 4 Adobe PostScript Copyright 2007 Sun Microsystems Inc 4150 Network Circle Santa Clara California 95054 Etats Unis Tous droits r serv s Entr e et revue tecnical fournies par Fujitsu Limited sur des parties de ce mat riel Sun Microsystems Inc et Fujitsu Limited d tiennent et c
45. Server Service Manual January 2007 FIGURE 5 13 Installing the Clock Battery on the Motherboard 3 Perform the procedures described in Chapter 6 4 Use the ALOM setdate command to set the day and time Use the setdate command before you power on the host system For details about this command refer to the Advanced Lights Out Management ALOM CMT Guide Chapter 5 Replacing Field Replaceable Units 5 23 5 24 Sun Fire T1000 Server Service Manual January 2007 CHAPTER 6 Finishing Up Servicing This chapter describes how to finish up servicing the server The following topics are covered m Section 6 1 1 Replacing the Top Cover on page 6 1 m Section 6 1 2 Reinstalling the Server Chassis in the Rack on page 6 1 m Section 6 1 3 Applying Power to the Server on page 6 2 6 1 1 6 1 2 Final Service Procedures This section provides the finishing tasks in servicing your server Replacing the Top Cover Place the top cover on the chassis Set the cover down so that the cover hangs over the rear of the server by about an inch 2 5 cm Slide the cover forward until it latches into place Reinstalling the Server Chassis in the Rack Refer to the Sun Fire T1000 Server Installation Guide for installation instructions After you have reinstalled the server chassis in the rack reconnect all cables that you disconnected when you removed the chassis from the rack 6 1 6 1 3 Appl
46. Single Drive Assembly 5 8 Sun Fire T1000 Server Service Manual January 2007 5 4 1 2 Installing the Hard Drive in a Single Drive Assembly 1 Unpack the replacement single drive assembly 2 Slide the single drive assembly into the chassis until it mates with the front of the chassis FIGURE 5 7 FIGURE 5 7 Installing the Single Drive Assembly 3 Push the fasteners down to lock the drive assembly into place in the chassis 4 Redress the cable through the midwall in the chassis 5 Reconnect the data cable to the data power connector on the drive FIGURE 5 7 If you have a dual drive cable installed in your system connect the DRIVE 0 connector on the cable to the data power connector at the rear of the drive Do not connect the DRIVE 1 connector on the cable to the data power connector at the rear of the drive in a single drive assembly Chapter 5 Replacing Field Replaceable Units 5 9 6 Perform the procedures described in Chapter 6 7 Perform the necessary administrative tasks to reconfigure the hard drive The procedures that you perform at this point depend on how your data is configured You might need to partition the drive create file systems or load data from backups 5 4 2 Replacing a Hard Drive in a Dual Drive Assembly 5 4 2 1 Removing a Hard Drive in a Dual Drive Assembly 1 Perform the procedures described in Chapter 4 2 Disconnect the drive cable from the data and power connectors on the motherboard
47. acing the Optional PCI Express Card 9 11 Removing the Optional PCI Express Card Use this procedure to remove the optional low profile PCI Express PCI E card from the server 1 Perform the procedures described in Chapter 4 2 Remove any cables that are attached to the card 3 On the rear of the chassis pull the release lever that secures the PCI Express card to the chassis FIGURE 5 1 Release lever PCI E card FIGURE 5 1 Releasing the PCI Express Card Release Lever 5 2 Sun Fire T1000 Server Service Manual January 2007 4 Carefully pull the PCI Express card out of the connector on the PCI Express card riser board and the note slot FIGURE 5 2 Note slot LT E riser board FIGURE 5 2 Removing and Installing the PCI Express Card 5 Place the PCI Express card on an antistatic mat 5 1 2 Installing the Optional PCI Express Card Use this procedure to replace the PCI Express cards 1 Unpack the replacement PCI Express card and place it on an antistatic mat Note Only low profile PCI Express cards with low brackets fit into the chassis There are a variety of PCI Express cards on the market Read the product documentation for your device for additional installation requirements and instructions that are not covered here 2 Insert the PCI Express card into the connector on the PCI Express riser board and the note slot FIGURE 5 2 Chapter 5 Replacing Field Replaceable Units 5 3 3 On the rear
48. ates diagnosis has determined that a memory DIMM is faulty as a result of exceeding the threshold for correctable memory errors Memory pages associated with the correctable errors have been retired and no data has been lost However the system is at increased risk of incurring an uncorrectable error which will cause a service interruption until the memory DIMM module is replaced Use the command fmdump v u EVENT_ID with the EVENT_ID from the console message to locate the faulty DIMM For example fmdump v u 92e9fbe 735e c218 cf87 9e1720a28004 TIME UUID SUNW MSG ID Sep 14 10 09 46 2234 92e9fbe 735e c218 cf87 9e1720a28004 SUN4V 8000 Dx 95 fault memory dimm FRU mem component MB CMP0 CH0 R0 D0 J0601 rsrc mem component MB CMP0 CH0 R0O D0 J0601 In this example the DIMM location is Chapter 3 Server Diagnostics 3 43 MB CMP0 CHO RO D0 J0601 Refer to the Service Manual or the Service Label attached to the server chassis to find the physical location of the DIMM Once the DIMM has been replaced use the Service Manual for instructions on clearing the fault condition and validating the repair action NOTE The server Product Notes may contain updated service procedures The latest version of the Service Manual and Product Notes are available at the Sun Documentation Center 3 Follow the suggested actions to repair the fault 0 2 Clearing PSH Detected Faults When the Solaris PSH facility det
49. aults Last POST run THU MAR 09 16 52 44 2006 POST status Passed all devices No failures found in System 3 9 Using the Solaris Predictive Self Healing Feature The Solaris Predictive Self Healing PSH technology enables the server to diagnose problems while the Solaris OS is running and mitigate many problems before they negatively affect operations The Solaris OS uses the fault manager daemon fmd 1M which starts at boot time and runs in the background to monitor the system If a component generates an error the daemon handles the error by correlating the error with data from previous errors and other related information to diagnose the problem Once diagnosed the fault manager daemon assigns the problem a Universal Unique Identifier UUID that distinguishes the problem across any set of systems When possible the fault manager daemon initiates steps to self heal the failed component and take the component offline The daemon also logs the fault to the syslogd daemon and Chapter 3 Server Diagnostics 3 39 provides a fault notification with a message ID MSGID You can use the message ID to get additional information about the problem from Sun s knowledge article database The Predictive Self Healing technology covers the following server components a UltraSPARC T1 multicore processor Memory a I O bus The PSH console message provides the following information Type Severity Description Automated respo
50. bin xhost fest system where test system is the name of the server you plan to test Remotely log in to the server as superuser Use a command such as rlogin or telnet Start SunVTS software If you have installed SunVTS software in a location other than the default opt directory alter the path in the following command accordingly opt SUNWvts bin sunvts display display system 0 where display system is the name of the machine through which you are remotely logged in to the server The SunVTS GUI is displayed FIGURE 3 6 Chapter 3 Server Diagnostics 3 51 D SunVTS Diagnostic HME EER Green Pass Red Fail W Processor s NW Memory Cryptography SCSI Devices mpto OtherDevices Network USB Devices FIGURE 3 6 SunVTS GUI 5 Expand the test lists to see the individual tests The test selection area lists tests in categories such as Network as shown in FIGURE 3 7 To expand a category left click the Fy icon expand category icon to the left of the category name 3 52 Sun Fire T1000 Server Service Manual January 2007 Processor s W Memory W Cryptography SCSI Devices mptO Network I ipge1 netlbtest E E m ipge2 netlbtest W ipgeO nettest FIGURE 3 7 SunVTS Test Selection Panel Optional Select the tests you want to run Certain tests are enabled by default and you can choose to accept these Alternatively yo
51. ce Required LED If ALOM CMT does not perform these actions use the enablecomponent command to manually clear the fault and remove the component from the ASR blacklist This procedure describes how to do this After replacing a faulty FRU at the ALOM CMT prompt use the showfaults command to identify POST detected faults POST detected faults are distinguished from other kinds of faults by the text deemed faulty and disabled and no UUID number is reported Example sc gt showfaults v ID Time FRU Fault 1 APR 24 12 47 27 MB CMP0 CHO R1 D0 MB CMP0O CH0 R1 D0 deemed faulty and disabled a If no fault is reported you do not need to do anything else Do not perform the subsequent steps m Ifa fault is reported perform Step 2 through Step 4 3 38 Sun Fire T1000 Server Service Manual January 2007 2 Use the enablecomponent command to clear the fault and remove the component from the ASR blacklist Use the FRU name that was reported in the fault in the previous step Example sc gt enablecomponent MB CMP0 CH0 R1 D0 The fault is cleared and should not appear when you run the showfaults command Additionally if there are no other faults remaining the Service Required LED should be extinguished Power cycle the server You must reboot the server for the enablecomponent command to take effect At the ALOM CMT prompt use the showfaults command to verify that no faults are reported sc gt showf
52. command fmadm repair UUID Example fmadm repair 7ee0e46b ea64 6565 e684 e996963 7b86 3 6 3 6 1 Collecting Information From Solaris OS Files and Commands With the Solaris OS running on the server you have the full compliment of Solaris OS files and commands available for collecting information and for troubleshooting If POST ALOM or the Solaris PSH features do not indicate the source of a fault check the message buffer and log files for notifications for faults Hard drive faults are usually captured by the Solaris message files Use the dmesg command to view the most recent system message To view the system messages log file view the contents of the var adm messages file Checking the Message Buffer 1 Log in as superuser Chapter 3 Server Diagnostics 3 45 3 6 2 2 Issue the dmesg command dmesg The dmesg command displays the most recent messages generated by the system Viewing System Message Log Files The error logging daemon syslogd automatically records various system warnings errors and faults in message files These messages can alert you to system problems such as a device that is about to fail The var adm directory contains several message files The most recent messages are in the var adm messages file After a period of time usually every ten days a new messages file is automatically created The original contents of the messages file are rotated to a file
53. cts and offlines memory devices with errors that could be correctable by PSH Use the examples in this section to verify if the detected memory devices are correctable Note For servers powered on in maximum mode without the intention of validating a hardware upgrade or repair examine all faults detected by POST to verify if the errors can be corrected by Solaris PSH See Section 3 5 Using the Solaris Predictive Self Healing Feature on page 3 39 When using maximum mode if no faults are detected return POST to minimum mode sc gt setkeyswitch normal sc gt setsc diag mode normal sc gt setsc diag level min Chapter 3 Server Diagnostics 3 35 3 4 5 1 Correctable Errors for Single DIMMs If POST faults a single DIMM CODE EXAMPLE 3 1 that was not part of a hardware upgrade or repair it is likely that POST encountered a correctable error that can be handled by PSH CODE EXAMPLE 3 1 POST Fault for a Single DIMM sc gt showfaults v ID Time FRU Fault 1 OCT 13 12 47 27 MB CMP0 CH0O RO DO MB CMP0 CHO RO DO deemed faulty and disabled In this case reenable the DIMM and run POST in minimum mode as follows 1 Reenable the DIMM sc gt enablecomponent name of DIMM 2 Return POST to minimum mode sc gt setkeyswitch normal sc gt setsc diag mode normal sc gt setsc diag level min 3 Reset the system so that POST runs There are several ways to initiate a reset The followin
54. ects faults the faults are logged and displayed on the console After the fault condition is corrected for example by replacing a faulty FRU you must clear the fault Note If you are dealing with faulty DIMMs do not follow this procedure Instead perform the procedure in Section 5 5 2 Installing DIMMs on page 5 16 1 After replacing a faulty FRU power on the server 2 At the ALOM CMT prompt use the showfaults command to identify PSH detected faults PSH detected faults are distinguished from other kinds of faults by the text Host detected fault Example sc gt showfaults v ID Time FRU Fault 0 SEP 09 11 09 26 MB CMP0 CH0 R1 D0 Host detected fault MSGID SUN4U 8000 2S UUID 7ee0e46b ea64 6565 e684 e996963F7b86 a If no fault is reported you do not need to do anything else Do not perform the subsequent steps m Ifa fault is reported perform Step 2 through Step 4 3 44 Sun Fire T1000 Server Service Manual January 2007 3 Run the clearfault command with the UUID provided in the showfaults output sc gt clearfault 7ee0e46b ea64 6565 e684 e996963f7b86 Clearing fault from all indicted FRUs Fault cleared Clear the fault from all persistent fault records In some cases even though the fault is cleared some persistent fault information remains and results in erroneous fault messages at boot time To ensure that these messages are not displayed perform the following
55. ent sensor status displaying 3 17 Index 3 Index 4 Sun Fire T1000 Server Service Manual January 2007
56. erver Front Panel 2 2 Sun Fire T1000 Server Service Manual January 2007 Power supply LEDs Ethernet ports PCI E slot Locator LE ne y Service Required LED Power OK LED SC serial management port DB9 serial port SC network management port FIGURE 2 4 Server Rear Panel 2 2 Obtaining the Chassis Serial Number To obtain support for your system you need your chassis serial number On the server the chassis serial number is located on a sticker that is on the front of the server and another sticker at the rear of the server below the AC power connector You can also run the ALOM CMT showplatform command to obtain the chassis serial number Example sc gt showplatform SUNW Sun Fire T1000 Chassis Serial Number 0529AP000882 Domain Status SO OS Standby sc gt Chapter 2 Server Overview 2 3 2 4 Sun Fire T1000 Server Service Manual January 2007 CHAPTER 3 Server Diagnostics This chapter describes the diagnostics that are available for monitoring and troubleshooting the server This chapter does not provide detailed troubleshooting procedures but instead describes the server diagnostics facilities and how to use them This chapter is intended for technicians service personnel and system administrators who service and repair computer systems The following topics are covered m Section 3 1 Overview of Server Diagnostics on page 3 1 m Section 3 2 Using LEDs to Identif
57. escribes how to prepare for servicing the server Chapter 5 describes how to remove and replace the field replaceable units FRUs within the server Chapter 6 describes how to finish up the servicing of the server Appendix A lists the field replaceable components in the server xiii Using UNIX Commands This document might not contain information about basic UNIX commands and procedures such as shutting down the system booting the system and configuring devices Refer to the following for this information a Software documentation that you received with your system m Solaris Operating System documentation which is at http docs sun com Typographic Conventions Typeface Meaning Examples AaBbCc123 The names of commands files and directories on screen computer output AaBbCc123 What you type when contrasted with on screen computer output AaBbCc123 Book titles new words or terms words to be emphasized Replace command line variables with real names or values Edit your login file Use 1s a to list all files You have mail su Password Read Chapter 6 in the User s Guide These are called class options You must be superuser to do this To delete a file type rm filename The settings on your browser might differ from these settings Shell Prompts Shell C shell C shell superuser Bourne shell and Korn shell Bourne shell and Korn shell superuser x
58. etting up your equipment a Follow all Sun standard cautions warnings and instructions marked on the equipment and described in Important Safety Information for Sun Hardware Systems 816 7190 a Ensure that the voltage and frequency of your power source match the voltage and frequency inscribed on the equipment s electrical rating label m Follow the electrostatic discharge safety practices as described in this Section 1 3 Electrostatic Discharge Safety on page 1 2 12 Safety Symbols The following symbols might appear in this document Note their meanings gt E gt gt Caution There is a risk of personal injury and equipment damage To avoid personal injury and equipment damage follow the instructions Caution Hot surface Avoid contact Surfaces are hot and might cause personal injury if touched Caution Hazardous voltages are present To reduce the risk of electric shock and danger to personal health follow the instructions Ts QJ Low Electrostatic Discharge Safety Electrostatic discharge ESD sensitive devices such as the motherboard PCI cards hard drives and memory cards require special handling Caution The boards and hard drives contain electronic components that are extremely sensitive to static electricity Ordinary amounts of static electricity from clothing or the work environment can destroy components Do not touch the components along their connector
59. g example uses the powercycle command For other methods refer to the Sun Fire T1000 Server Administration Guide sc gt powercycle Are you sure you want to powercycle the system y n y Powering host off at MON JAN 10 02 52 02 2000 Waiting for host to Power Off hit any key to abort SC Alert SC Request to Power Off Host SC Alert Host system has shut down Powering host on at MON JAN 10 02 52 13 2000 SC Alert SC Request to Power On Host 4 Replace the DIMM if POST continues to fault the device in minimum mode 3 36 Sun Fire T1000 Server Service Manual January 2007 3 4 5 2 Determining When to Replace Detected Devices Note This section assumes faults are detected by POST in maximum mode If a detected device is part of a hardware upgrade or repair or if POST detects multiple DIMMs CODE EXAMPLE 3 2 replace the detected devices CODE EXAMPLE 3 2 POST Fault for Multiple DIMMs sc gt showfaults v ID Time FRU Fault 1 OCT 13 12 47 27 MB CMP0 CHO RO DO MB CMP0 CHO RO DO deemed faulty and disabled 2 OCT 13 12 47 27 MB CMP0O CHO RO D1 MB CMP0 CH0 RO D1 deemed faulty and disabled Note The previous example shows two DIMMs on the same channel rank which could be an uncorrectable error If the detected device is not a part of a hardware upgrade or repair use the following list to examine and repair the fault 1 If a detected device is not a DIMM or if more than a si
60. garanties express ment stipul es dans le contrat de licence r gissant le produit ou la technologie fourni e SAUF MENTION CONTRAIRE EXPRESS MENT STIPUL E DANS CE CONTRAT FUJITSU LIMITED SUN MICROSYSTEMS INC ET LES SOCI T S AFFILI ES REJETTENT TOUTE REPR SENTATION OU TOUTE GARANTIE QUELLE QU EN SOIT LA NATURE EXPRESSE OU IMPLICITE CONCERNANT CE PRODUIT CETTE TECHNOLOGIE OU CE DOCUMENT LESQUELS SONT FOURNIS EN L TAT EN OUTRE TOUTES LES CONDITIONS REPR SENTATIONS ET GARANTIES EXPRESSES OU TACITES Y COMPRIS NOTAMMENT TOUTE GARANTIE IMPLICITE RELATIVE LA QUALIT MARCHANDE L APTITUDE UNE UTILISATION PARTICULI RE OU L ABSENCE DE CONTREFA ON SONT EXCLUES DANS LA MESURE AUTORIS E PAR LA LOI APPLICABLE Sauf mention contraire express ment stipul e dans ce contrat dans la mesure autoris e par la loi applicable en aucun cas Fujitsu Limited Sun Microsystems Inc ou l une de leurs filiales ne sauraient tre tenues responsables envers une quelconque partie tierce sous quelque th orie juridique que ce soit de tout manque gagner ou de perte de profit de probl mes d utilisation ou de perte de donn es ou d interruptions d activit s ou de tout dommage indirect sp cial secondaire ou cons cutif m me si ces entit s ont t pr alablement inform es d une telle ventualit LA DOCUMENTATION EST FOURNIE EN L ETAT ET TOUTES AUTRES CONDITIONS DECLARATIONS ET GARANTIES EXPRESSES OU TACITES SONT FO
61. ge 0 gt L2 0 gt Data Bitwalk 0 gt L2 Scrub Data Enable OO Or OOO 0 gt Testing Memory Channel 0 Rank 0 Stack 0 gt Testing Memory Channel 3 Rank 0 Stack 0 gt Testing Memory Channel 0 Rank 1 Stack 0 0 gt ERROR TEST Data Bitwalk 0 0 gt H W under test MB CMP0 CH0O R1 D0 S0 0 0 gt Repair Instructions Replace items in under test above 0 0 gt MSG Pin 3 failed on MB CMP0 CH0 R1 D0 S0O J0701 0 0 gt END_ERROR oO J0701 order listed by H W 0 0 gt Testing Memory Channel 3 Rank 1 Stack 0 In this example POST is reporting a memory error at DIMM location MB CMP0 CHO R1 D0 J0701 Sun Fire T1000 Server Service Manual January 2007 3 4 5 b Run the showfaults command to obtain additional fault information The fault is captured by ALOM where the fault is logged the Service Required LED is lit and the faulty component is disabled Example ok sc gt showfaults v ID Time FRU Fault 1 APR 24 12 47 27 MB CMP0 CHO R1 D0 MB CMP0O CH0 R1 D0 deemed faulty and disabled In this example MB CMP0 CHO R1 D0 is disabled The system can boot using memory that was not disabled until the faulty component is replaced Note You can use ASR commands to display and control disabled components See Section 3 7 Managing Components With Automatic System Recovery Commands on page 3 46 Correctable Errors Detected by POST In maximum mode POST dete
62. ght boot or the system might remain at the ok prompt If the system is at the ok prompt type boot d Return the virtual keyswitch to normal mode sc gt setkeyswitch normal e Issue the Solaris OS fmadm faulty command fmadm faulty No memory or DIMM faults should be displayed If faults are reported refer to the diagnostics flow chart in FIGURE 3 1 for an approach to diagnose the fault Sun Fire T1000 Server Service Manual January 2007 10 11 12 13 Obtain the ALOM CMT sc gt prompt Run the showfaults command If the fault was detected by the host and the fault information persists the output will be similar to the following example sc gt showfaults v ID Time 0 SEP 09 11 09 26 MI MSGID SUN4V 8000 DX UUID FRU Fault B CMP0 CHO RO DO Host detected fault 92e9fbe 735e c218 cf 87 9e1720a28004 If the showfaults command does not report a fault with a UUID then you do not need to proceed with the following steps because the fault is cleared Run the clearfault command sc gt clearfault 92e9fbe 735e c218 cf 87 9e1720a28004 Switch to the system console sc gt console Issue the fmadm repair command with the UUID Use the same UUID that you used with the clearfault command fmadm repair 92e9fbe 735e c218 c 87 9e1720a28004 Chapter 5 Replacing Field Replaceable Units 5 19 5 6 5 6 1 5 6 2
63. ing of this document to you does not give you any rights or licenses express or implied with respect to the product or technology to which it pertains and this document does not contain or represent any commitment of any kind on the part of Fujitsu Limited or Sun Microsystems Inc or any affiliate of either of them This document and the product and technology described in this document may incorporate third party intellectual property copyrighted by and or licensed from suppliers to Fujitsu Limited and or Sun Microsystems Inc including software and font technology Per the terms of the GPL or LGPL a copy of the source code governed by the GPL or LGPL as applicable is available upon request by the End User Please contact Fujitsu Limited or Sun Microsystems Inc This distribution may include materials developed by third parties Parts of the product may be derived from Berkeley BSD systems licensed from the University of California UNIX is a registered trademark in the U S and in other countries exclusively licensed through X Open Company Ltd Sun Sun Microsystems the Sun logo Java Netra Solaris Sun StorEdge SPARC Enterprise docs sun com OpenBoot SunVTS Sun Fire SunSolve CoolThreads J2EE and Sun are trademarks or registered trademarks of Sun Microsystems Inc in the U S and other countries Fujitsu and the Fujitsu logo are registered trademarks of Fujitsu Limited All SPARC trademarks are used under license and are
64. ition to this service manual the following resources are available to help you keep your server running optimally Product Notes The Sun Fire T1000 Server Product Notes 819 3246 contain late breaking information about the system including required software patches updated hardware and compatibility information and solutions to know issues The product notes are available online at http www sun com documentation Preface xv m Release Notes The Solaris OS release notes contain important information about the Solaris OS The release notes are available online at http www sun com documentation a SunSolveSM Online Provides a collection of support resources Depending on the level of your service contract you have access to Sun patches the Sun System Handbook the SunSolve knowledge base the Sun Support Forum and additional documents bulletins and related links Access this site at http sunsolve sun com a Predictive Self Healing Knowledge Database You can access the knowledge article corresponding to a self healing message by taking the Sun Message Identifier GUNW MSG ID and entering it into the field on this page http www sun com msg Documentation Support and Training Sun Function URL Documentation http www sun com documentation Support http www sun com support Training http www sun com training xvi Third Party Web Sites Sun is not responsible for the availabi
65. iv Sun Fire T1000 Server Service Manual January 2007 Prompt machine name machine name Sun Fire T1000 Server Documentation You can view and print the following documents from the Sun documentation web site at http www sun com documentation Title Description Part Number Sun Fire T1000 Server Site Planning Guide Sun Fire T1000 Server Product Notes Sun Fire T1000 Server Getting Started Guide Sun Fire T1000 Server Overview Sun Fire T1000 Server Installation Guide Sun Fire T1000 Server Administration Guide Advanced Lights Out Management ALOM CMT Guide Sun Fire T1000 Server Safety and Compliance Guide Site planning information for the server Late breaking information about the server The latest notes are posted at http www sun com documentation Information about where to find documentation to get your system installed and running quickly Provides an overview of the features of this server Detailed rackmounting cabling power on and configuration information How to perform administrative tasks that are specific to this server How to use the Advanced Lights Out Manager ALOM CMT software on this server Provides safety and compliance information that is specific to this server 819 3749 819 3246 819 3244 819 3245 819 3247 819 3249 819 3250 version 1 1 819 6672 version 1 2 819 6674 Additional Service Related Information In add
66. ix A Installing the Motherboard and Chassis Replace the PCI Express card See Section 5 1 Replacing the Optional PCI Express Card on page 5 2 5 20 Sun Fire T1000 Server Service Manual January 2007 Replace the fan tray assembly and cable See Section 5 2 Replacing the Fan Tray Assembly on page 5 4 Replace the power supply and cable See Section 5 3 Replacing the Power Supply on page 5 5 Replace the hard drive and cable See Section 5 4 Replacing a Hard Drive on page 5 7 Replace the memory DIMMs See Section 5 5 Replacing DIMMs on page 5 14 Replace the socketed system configuration SEEPROM The location of this SEEPROM is shown in Appendix A Perform the procedures described in Chapter 6 Boot the system and run POST to verify that the system is fully operational See Section 3 4 Running POST on page 3 22 Chapter 5 Replacing Field Replaceable Units 5 21 57 Replacing the Clock Battery 5 71 Removing the Clock Battery on the Motherboard 1 Perform the procedures described in Chapter 4 2 Using a small flathead screwdriver carefully pry the battery from the motherboard FIGURE 5 12 FIGURE 5 12 Removing the Clock Battery From the Motherboard 5 7 2 Installing the Clock Battery on the Motherboard 1 Unpack the replacement battery 2 Press the new battery into the motherboard with the facing upward FIGURE 5 13 5 22 Sun Fire T1000
67. k Management Activity LED SC Network Management Link LED Front and White rear panels Front and Yellow rear panels Front and Green rear panels Front panel N A Rear panel Green Rear panel Yellow Rear panel Yellow Rear panel Green Enables you to identify a particular server Activate the LED using one of the following methods e Issuing the setlocator on or off command e Pressing the button to toggle the indicator on or off This LED provides the following indications e Off Normal operating state e Fast blink The server received a signal as a result of one of the preceding methods and is indicating here I am that it is operational If on indicates that service is required The ALOM CMT showfaults command will indicate any faults causing this indicator to light The LED provides the following indications e Off Indicates that the system is unavailable Either it has no power or ALOM CMT is not running e Steady on Indicates that the system is powered on and is running in its normal operating state No service actions are required e Standby blink Indicates the system is running at a minimum level in standby and is ready to be quickly returned to full function The service processor is running e Slow blink Indicates that a normal transitory activity is taking place Server diagnostics could be running or the system might be powering on Turns the server on and off These
68. k the replacement power supply 2 Slide the power supply into the chassis and engage the two alignment pins in the rear of the chassis that mate with the power supply 3 Push the fastener down on the front of the power supply to lock it into place in the chassis FIGURE 5 5 5 6 Sun Fire T1000 Server Service Manual January 2007 Power supply Fastener FIGURE 5 5 Installing the Power Supply 4 Redress the power cable through the midwall in the chassis and connect the cable to the motherboard 5 Perform the procedures described in Chapter 6 6 At the sc gt prompt issue the showenvironment command to verify the status of the power supply 5 4 Replacing a Hard Drive m To remove a hard drive from a single drive assembly go to Section 5 4 1 Replacing a Hard Drive in a Single Drive Assembly on page 5 8 m To remove a hard drive from a dual drive assembly go to Section 5 4 2 Replacing a Hard Drive in a Dual Drive Assembly on page 5 10 Chapter 5 Replacing Field Replaceable Units 5 7 5 4 1 Replacing a Hard Drive in a Single Drive Assembly 5 4 1 1 Removing the Hard Drive in a Single Drive Assembly 1 Perform the procedures described in Chapter 4 2 Disconnect the drive cable from the data power connector at the rear of the hard drive FIGURE 5 6 3 Pull the fasteners up on the rear of the single drive assembly and remove the assembly from the chassis FIGURE 5 6 FIGURE 5 6 Removing the
69. l LED status displaying 3 17 FRU ID PROMs 3 12 FRU status displaying 3 19 H hard drive installing 5 9 5 12 removing 5 8 5 10 status displaying 3 17 hardware components sanity check 3 27 help command 3 14 installing clock battery 5 22 DIMMs 5 16 fan tray assembly 5 5 hard drive 5 9 5 12 motherboard and chassis 5 20 PCI Express card 5 3 power supply 5 6 top cover 6 1 installing the server in the rack 6 1 L LEDs AC OK 3 4 Power OK 3 4 log files viewing 3 46 M memory configuration 3 7 fault handling 3 6 message ID 3 40 messages file 3 45 motherboard and chassis installing 5 20 removing 5 20 P PCI Express card installing 5 3 removing 5 2 POST detected faults 3 4 3 16 POST see also power on self test POST 3 22 Power OK LED 3 4 power supply installing 5 6 removing 5 5 power supply status displaying 3 17 powercycle command 3 14 3 28 3 36 powering down the system 4 2 powering on the system 6 2 poweroff command 3 15 poweron command 3 15 power on self test POST 3 5 about 3 22 ALOM CMT commands 3 23 configuration flow chart 3 25 error message example 3 34 error messages 3 34 example output 3 29 fault clearing 3 38 faulty components detected by 3 38 how to run 3 28 parameters changing 3 26 reasons to run 3 27 troubleshooting with 3 6 Predictive Self Healing PSH Index 2 Sun Fire T1000 Server Service Manual January 2007
70. le RS sea Errors Detected by POST diag_level max the detected errors might be 3 35 correctable by PSH after the server boots Lois e If POST does not indicate a faulty FRU go to Action No 9 6 Determine if the If the fault listed by the showfaults command Section 3 3 2 Running fault is an displays a temperature or voltage fault then the the showfaults environmental fault is an environmental fault Environmental Command on page 3 16 fault faults can be caused by faulty FRUs power supply or fan tray or by environmental conditions such as when computer room ambient temperature is too high or the server airflow is blocked When the environmental condition is corrected the fault will automatically clear You can also use the fault LEDs on the server to identify the faulty FRU fan tray or power supply Chapter 5 Section Replacing Field Replaceable Units on page 5 1 Section 3 2 Using LEDs to Identify the State of Devices on page 3 8 Chapter 3 Server Diagnostics 3 5 TABLE 3 1 Action No Diagnostic Flowchart Actions Continued Diagnostic Action Resulting Action For more information see these sections 7 Determine if the fault was detected by PSH Determine if the fault was detected by POST Contact technical support If the fault message displays the following text the fault was detected by the Solaris Predictive Self Healing software Host detected fault
71. les from the Reports menu This action opens a log window from which you can choose to view the following logs Information Detailed versions of all the status and error messages that appear in the test messages area m Test Error Detailed error messages from individual tests m VTS Kernel Error Error messages pertaining to SunVTS software itself You should look here if SunVTS software appears to be acting strangely especially when it starts up m Solaris OS Messages var adm messages A file containing messages generated by the operating system and various applications a Log Files var opt SUNWvts logs A directory containing the log files 3 54 Sun Fire T1000 Server Service Manual January 2007 CHAPTER 4 Preparing for Servicing This chapter describes how to prepare the server for servicing The following topics are covered m Section 4 1 Common Procedures for Parts Replacement on page 4 1 For a list of FRUs see Appendix A Note Never attempt to run the system with the cover removed The cover must be in place for proper air flow The cover interlock switch immediately shuts the system down when the cover is removed 4 1 Common Procedures for Parts Replacement Before you can remove and replace parts that are inside the server you must perform the following procedures m Section 4 1 2 Shutting the System Down on page 4 2 m Section 4 1 3 Removing the Server Fr
72. lity of third party web sites mentioned in this document Sun does not endorse and is not responsible or liable for any content advertising products or other materials that are available on or through such sites or resources Sun will not be responsible or liable for any actual or alleged damage or loss caused by or in connection with the use of or reliance on any such content goods or services that are available on or through such sites or resources Sun Fire T1000 Server Service Manual January 2007 Sun Welcomes Your Comments Sun is interested in improving its documentation and welcomes your comments and suggestions You can submit your comments by going to http www sun com hwdocs feedback Please include the title and part number of your document with your feedback Sun Fire T1000 Server Service Manual part number 819 3248 13 Preface xvii xviii Sun Fire T1000 Server Service Manual January 2007 CHAPTER 1 Safety Information This chapter provides important safety information for servicing the server The following topics are covered m Section 1 1 Safety Information on page 1 1 m Section 1 2 Safety Symbols on page 1 1 m Section 1 3 Electrostatic Discharge Safety on page 1 2 1 1 Safety Information This section describes safety information you need to know prior to removing or installing parts in the server For your protection observe the following safety precautions when s
73. lts command The showfaults command lists memory faults and lists the specific DIMMS that are associated with the fault Once you identify which DIMMs to replace see Chapter 5 for DIMM removal and replacement instructions It is important that you perform the instructions in that chapter to clear the faults and enable the replaced DIMMs 92 Using LEDs to Identify the State of Devices The server provides the following groups of LEDs m Front and rear panel LEDS FIGURE 3 2 FIGURE 3 3 and TABLE 3 2 m Power supply LEDs FIGURE 3 3 and TABLE 3 3 These LEDs provide a quick visual check of the state of the system Locator LED button ae Service Required LED Power OK LED and Power On Off button FIGURE 3 2 LEDs on the Server Front Panel 3 8 Sun Fire T1000 Server Service Manual January 2007 Activity LED Activity LED Fault LED Link LED Link LED DC OK LED Power OK LED AC OK LED Service Required LED Locator LED button FIGURE 3 3 LEDs on the Server Rear Panel Chapter 3 Server Diagnostics 3 9 CPAN Front and Rear Panel LEDs Two LEDs and one LED button are located in the upper left corner of the front panel TABLE 3 2 The LEDs are also provided on the rear panel TABLE 3 2 Front and Rear Panel LEDs LED Location Color Description Locator LED button Service Required LED Power OK LED Power On Off button Ethernet Link Activity LEDs Ethernet Link LEDs SC Networ
74. mmand S S sc gt showfru FRU_PROM at MB SEEPROM EGMENT SD Man an an an an an an an an Man 2 2 2 2 2 2 2 2 s R R UNIX_Timestamp32 R Description R Manufacture Location R Sun Part No R Sun Serial No R Vendor R Initial HW Rev Level R Shortname SpecPartNo Man Man an an an an an an Man 2 2 2 2 2 2 FRU_PROM at PS0 SEEPROM EGMENT SD R R UNIX_Timestamp32 R Description R Manufacture Location R Sun Part No R Sun Serial No R Vendor R Initial HW Rev Level R Initial HW Dash Level R Initial HW Dash Level TUE OCT 18 21 17 55 2005 ASSY Sun Fire T1000 Motherboard Sriracha Chonburi Thailand 5017302 002989 Celestica 03 01 T1000_MB 885 0505 04 SUN JUL 31 19 45 13 2005 PSU 300W AC_INPUT A207 Matamoros Tamps 3001799 GO0001 Tyco Electronics 02 OT Mexico Chapter 3 Server Diagnostics 3 19 ManR Shortname PS SpecPartNo 885 0407 02 FRU_PROM at MB CMP0 CH0 R0O D0 SEEPROM SPD Timestamp MON OCT 03 12 00 00 2005 SPD Description DDR2 SDRAM 2048 MB SPD Manufacture Location SPD Vendor Infineon formerly Siemens SPD Vendor Part No 72T256220HR3 7A SPD Vendor Serial No d03fe27 FRU_PROM at MB CMP0 CH0O R0O D1 SEEPROM SPD Timestamp MON OCT 03 12 00 00 2005 SPD Description DDR2 SDRAM
75. ngle DIMM is detected replace the detected devices 2 If a detected device is a single DIMM and the same DIMM is also detected by PSH replace the DIMM CODE EXAMPLE 3 3 CODE EXAMPLE 3 3 PSH and POST Faults on the Same DIMM sc gt showfaults v ID Time FRU Fault 0 SEP 09 11 09 26 MB CMP0 CHO0 RO DO Host detected fault MSGID SUN4V 8000 DX UUID 7ee0e46b ea64 6565 e684 e996963f7b86 1 OCT 13 12 47 27 MB CMP0 CHO0 RO DO MB CMP0 CHO RO DO deemed faulty and disabled Note The detected DIMM in the previous example must also be replaced because it exceeds the PSH page retire threshold Chapter 3 Server Diagnostics 3 37 3 4 6 3 If a device detected by POST is a single DIMM and the same DIMM is not detected by PSH follow the procedure in Section 3 4 5 1 Correctable Errors for Single DIMMs on page 3 36 After the detected devices are repaired or replaced return POST to the default minimum level sc gt setkeyswitch normal sc gt setsc diag mode normal sc gt setsc diag level min Clearing POST Detected Faults In most cases when POST detects a faulty component POST logs the fault and automatically takes the failed component out of operation by placing the component in the ASR blacklist see Section 3 7 Managing Components With Automatic System Recovery Commands on page 3 46 In most cases after the faulty FRU is replaced ALOM CMT detects the repair and extinguishes the Servi
76. nology continuously monitors the health of the CPU and memory and works with ALOM CMT to take a faulty component offline if needed The Predictive Self Healing technology enables systems to accurately predict component failures and mitigate many serious problems before they occur Log files and console messages Provide the standard Solaris OS log files and investigative commands that can be accessed and displayed on the device of your choice m SunVTS An application that exercises the system provides hardware validation and discloses possible faulty components with recommendations for repair The LEDs ALOM CMT Solaris OS PSH and many of the log files and console messages are integrated For example a fault detected by the Solaris PSH software displays the fault logs it passes information to ALOM CMT where it is logged and depending on the fault might illuminate of one or more LEDs The diagnostic flow chart in FIGURE 3 1 and TABLE 3 1 describes an approach for using the server diagnostics to identify a faulty field replaceable unit FRU The diagnostics you use and the order in which you use them depend on the nature of the problem you are troubleshooting so you might perform some actions and not others The flow chart assumes that you have already performed some troubleshooting such as verification of proper installation and visual inspection of cables and power and possibly performed a reset of the server refer to
77. nosing the System Hardware 3 28 3 4 4 Running POST in Maximum Mode 3 28 3 4 5 Correctable Errors Detected by POST 3 35 3 4 5 1 Correctable Errors for Single DIMMs 3 36 3 4 5 2 Determining When to Replace Detected Devices 3 37 3 4 6 Clearing POST Detected Faults 3 38 3 5 Using the Solaris Predictive Self Healing Feature 3 39 3 5 1 Identifying PSH Detected Faults 3 40 3 5 1 1 Using the fmdump Command to Identify Faults 3 41 3 5 2 Clearing PSH Detected Faults 3 44 3 6 Collecting Information From Solaris OS Files and Commands 3 45 3 6 1 Checking the Message Buffer 3 45 3 6 2 Viewing System Message Log Files 3 46 vi Sun Fire T1000 Server Service Manual January 2007 3 7 3 8 Managing Components With Automatic System Recovery Commands 3 3 7 1 Displaying System Components 3 47 3 7 2 Disabling Components 3 48 3 7 3 Enabling Disabled Components 3 49 Exercising the System With SunVTS 3 49 3 8 1 Checking Whether SunVTS Software Is Installed 3 49 3 8 2 Exercising the System Using SunVTS Software 3 50 3 8 3 Using SunVTS Software 3 51 Preparing for Servicing 4 1 4 1 Common Procedures for Parts Replacement 4 1 4 1 1 Required Tools 4 2 4 1 2 Shutting the System Down 4 2 4 1 3 Removing the Server From a Rack 4 3 4 1 4 Performing Electrostatic Discharge ESD Prevention Measures 4 5 4 1 5 Removing the Top Cover 4 5 Replacing Field Replaceable Units 5 1 5 1 5 2 5 3 5 4 Replacing the Optional PCI Express Card 5 2 5 1
78. nse Impact Suggested action for system administrator If the Solaris PSH facility detects a faulty component use the fmdump command to identify the fault Faulty FRUs are identified in fault messages using the FRU name For a list of FRU names see Appendix A Note Additional Predictive Self Healing information is available at http www sun com msg IO Identifying PSH Detected Faults When a PSH fault is detected a Solaris console message similar to the following is displayed UNW MSG ID SUN4V 8000 DX TYPE Fault VER 1 SEVERITY Minor VENT TIME Wed Sep 14 10 09 46 EDT 2005 LATFORM SUNW Sun Fire T200 CSN HOSTNAME wgs48 37 OURCE cpumem diagnosis REV 1 5 VENT ID 92e9fbe 735e c218 cf87 9e1720a28004 ESC The number of errors associated with this memory module has exceeded cceptable levels Refer to http sun com msg SUN4V 8000 DX for more nformation UTO RESPONSE Pages of memory associated with this memory module are being emoved from service as errors are reported MPACT Total system memory capacity will be reduced as pages are retired REC ACTION Schedule a repair procedure to replace the affected memory module Use fmdump v u lt EVENT_ID gt to identify the module HN ww w peo H K 3 40 Sun Fire T1000 Server Service Manual January 2007 3 5 1 1 The following is an example of the ALOM CMT alert for the same PSH diagnosed fault SC Alert
79. o ALOM CMT command to manually repair an environmental fault The Solaris Predictive Self Healing technology does not monitor the hard drive for faults As a result ALOM CMT does not recognize hard drive faults and will not light the fault LEDs on either the chassis or the hard drive itself Use the Solaris message files to view hard drive faults See Section 3 6 Collecting Information From Solaris OS Files and Commands on page 3 45 Running ALOM CMT Service Related Commands This section describes the ALOM CMT commands that are commonly used for service related activities Connecting to ALOM Before you can run ALOM CMT commands you must connect to the ALOM There are several ways to connect to the system controller a Connect an ASCII terminal directly to the serial management port m Use either the telnet or the ssh command to connect to ALOM CMT through an Ethernet connection on the network management port ALOM CMT can be configured for either the telnet or the ssh command but not both Note Refer to the Advanced Lights Out Management ALOM CMT Guide for instructions on configuring and connecting to ALOM Chapter 3 Server Diagnostics 3 13 3 3 1 2 Switching Between the System Console and ALOM m To switch from the console output to the ALOM CMT sc gt prompt type Hash Period Note that this command is user configureable Refer to the Advanced Lights Out Management ALOM CMT Guide for more information
80. om a Rack on page 4 3 m Section 4 1 4 Performing Electrostatic Discharge ESD Prevention Measures on page 4 5 m Section 4 1 5 Removing the Top Cover on page 4 5 The corresponding procedures that you perform when maintenance is complete are described in Chapter 6 4 1 4 1 1 4 1 2 Required Tools The server can be serviced with the following tools a Antistatic wrist strap a Antistatic mat m No 2 Phillips screwdriver Shutting the System Down Performing a graceful shutdown ensures that all of your data is saved and the system is ready for restart Log in as superuser or equivalent Depending on the nature of the problem you might want to view the system status or the log files or run diagnostics before you shut down the system Refer to the Sun Fire T1000 Server Administration Guide for log file information Notify affected users Refer to your Solaris system administration documentation for additional information Save any open files and quit all running programs Refer to your application documentation for specific information on these processes Shut down the OS At the Solaris OS prompt issue the uadmin command to halt the Solaris OS and to return to the ok prompt uadmin 2 0 WARNING proc_exit init exited syncing file systems done Program terminated ok This command is described in the Solaris system administration documentation 5 Switch from the sy
81. om the end of the buffer e b lines displays n lines from beginning of buffer e v displays entire buffer boot run specifies the log to display run is the default log Enables control of the firmware during system initialization with the following options e normal is the default boot mode e reset_nvram resets OpenBoot PROM parameters to their default values bootscript string enables the passing of a string to the boot command Performs a poweroff followed by poweron The f option forces an immediate poweroff otherwise the command attempts a graceful shutdown 3 14 Sun Fire T1000 Server Service Manual January 2007 TABLE 3 4 ALOM CMT Command Service Related ALOM CMT Commands Continued Description poweroff y f poweron c reset y c resetsc y setkeyswitch y normal stby diag locked setlocator on off showenvironment showfaults v showfru g lines s d FRU showkeyswitch showlocator showlogs b lines e lines v g lines p logtype r p showplat form v Powers off the host server The y option enables you to skip the confirmation question The option forces an immediate shutdown Powers on the host server Using the c option executes a console command after completion of the poweron command Generates a hardware reset on the host server The y option enables you to skip the confirmation question The c op
82. ontr lent toutes deux des droits de propri t intellectuelle relatifs aux produits et technologies d crits dans ce document De m me ces produits technologies et ce document sont prot g s par des lois sur le copyright des brevets d autres lois sur la propri t intellectuelle et des trait s internationaux Les droits de propri t intellectuelle de Sun Microsystems Inc et Fujitsu Limited concernant ces produits ces technologies et ce document comprennent sans que cette liste soit exhaustive un ou plusieurs des brevets d pos s aux Etats Unis et indiqu s l adresse http www sun com patents de m me qu un ou plusieurs brevets ou applications brevet es suppl mentaires aux Etats Unis et dans d autres pays Ce document le produit et les technologies aff rents sont exclusivement distribu s avec des licences qui en restreignent l utilisation la copie la distribution et la d compilation Aucune partie de ce produit de ces technologies ou de ce document ne peut tre reproduite sous quelque forme que ce soit par quelque moyen que ce soit sans l autorisation crite pr alable de Fujitsu Limited et de Sun Microsystems Inc et de leurs ventuels bailleurs de licence Ce document bien qu il vous ait t fourni ne vous conf re aucun droit et aucune licence expresses ou tacites concernant le produit ou la technologie auxquels il se rapporte Par ailleurs il ne contient ni ne repr sente aucun engagement de quelque type
83. ormation m Faulted FRU FRU mem component MB CMPO CHO0 R0 D0 J0601 that in this example MB is identified as the DIMM at RO DO J0601 Chapter 3 Server Diagnostics 3 41 Note fmdump displays the PSH event log Entries remain in the log after the fault has been repaired 2 Use the Sun message ID to obtain more information about this type of fault a In a browser go to the Predictive Self Healing Knowledge Article web site http www sun com msg 3 42 Sun Fire T1000 Server Service Manual January 2007 b Obtain the message ID from the console output or the ALOM CMT showfaults command c Enter the message ID in the SUNW MSG ID field and click Lookup In this example the message ID SUN4V 8000 DX returns the following information for corrective action Article for Message ID SUN4V 8000 DX Correctable memory errors exceeded acceptable levels Type Fault Severity Major Description The number of correctable memory errors reported against a memory DIMM has exceeded acceptable levels Automated Response Pages of memory associated with this memory DIMM are being removed from service as errors are reported Impact Total system memory capacity will be reduced as pages are retired Suggested Action for System Administrator Schedule a repair procedure to replace the affected memory DIMM the identity of which can be determined using the command fmdump v u EVENT_ID Details The Message ID SUN4V 8000 DX indic
84. re is providing service Note See TABLE 3 7 for the ALOM CMT ASR commands Chapter 3 Server Diagnostics 3 15 DA Running the showfaults Command The ALOM CMT showfaults command displays the following kinds of faults a Environmental faults temperature or voltage problems that might be caused by faulty FRUs a power supply or fan tray or by room temperature or blocked air flow to the server m POST detected faults faults on devices detected by the power on self test diagnostics m PSH detected faults faults detected by the Solaris Predictive Self Healing PSH technology Use the showfaults command for the following reasons a To see if any faults have been passed to or detected by ALOM a To obtain the fault message ID GUNW MSG ID for PSH detected faults m To verify that the replacement of a FRU has cleared the fault and not generated any additional faults At the sc gt prompt type the showfaults command The following showfaults command examples show the different kinds of output from the showfaults command m Example of the showfaults command when no faults are present sc gt showfaults Last POST run THU MAR 09 16 52 44 2006 POST status Passed all devices No failures found in System a Example of the showfaults command displaying an environmental fault sc gt showfaults v Last POST run TUE FEB 07 18 51 02 2006 POST status Passed all devices ID FRU Fault 0 IOBD VOLTAGE_SENS
85. reset the server so that the ASR command takes effect sc gt reset 3 48 Sun Fire T1000 Server Service Manual January 2007 a7 Enabling Disabled Components The enablecomponent command enables a disabled component by removing it from the ASR blacklist 1 At the sc gt prompt enter the enablecomponent command sc gt enablecomponent MB CMP0 CH3 R1 D1 SC Alert MB CMP0 CH3 R1 D1 reenabled 2 After receiving confirmation that the enablecomponent command is complete reset the server so that the ASR command takes effect sc gt reset 3 8 Exercising the System With SunVTS Sometimes a server exhibits a problem that cannot be isolated definitively to a particular hardware or software component In such cases it might be useful to run a diagnostic tool that stresses the system by continuously running a comprehensive battery of tests Sun provides the SunVTS software for this purpose This section describes the tasks necessary to use SunVTS software to exercise your server m Section 3 8 1 Checking Whether SunVTS Software Is Installed on page 3 49 m Section 3 8 2 Exercising the System Using SunVTS Software on page 3 50 3 8 1 Checking Whether SunVTS Software Is Installed This procedure assumes that the Solaris OS is running on the server and that you have access to the Solaris command line 1 Check for the presence of SunVTS packages using the pkginfo command pkginfo 1 SUNWvt
86. rlying features helps you identify and repair memory problems This section describes how the memory is configured and how the server deals with memory faults Sun Fire T1000 Server Service Manual January 2007 3 1 1 1 3 1 1 2 Memory Configuration In the server memory there are eight slots that hold DDR 2 memory DIMMs in the following DIMM sizes m 512 MB maximum of 4 GB 1 GB maximum of 8 GB 2GB maximum of 16 GB All DIMMS installed must be the same size and DIMMs must be added four at a time In addition Rank 0 memory must be fully populated for the server to function See Section 5 5 2 Installing DIMMs on page 5 16 for instructions about adding memory to the server Memory Fault Handling The server uses advanced ECC technology also called chipkill that corrects up to 4 bits in error on nibble boundaries as long as the bits are all in the same DRAM If a DRAM fails the DIMM continues to function The following server features independently manage memory faults a POST Based on ALOM CMT configuration variables POST runs when the server is powered on In normal operation the default configuration of POST diag_level min provides a check to ensure the server will boot Normal operation applies to any boot of the server not intended to test power on errors hardware upgrades or repairs Once the Solaris OS is running PSH provides run time diagnosis of faults When a memory fault is detected
87. s SUNWvtsr SUNWvtsts SUNWvtsmn a If SunVTS software is installed information about the packages is displayed Chapter 3 Server Diagnostics 3 49 3 8 2 a If SunVTS software is not installed you see an error message for each missing package ERROR information for SUNWvts was not found ERROR information for SUNWvtsr was not found The following table lists the SunVTS packages Package Description SUNWvts SunVTS framework SUNWvtsr SunVTS framework root SUNWvtsts SunVTS for tests SUNWvtsmn SunVTS man pages If SunVTS is not installed you can obtain the installation packages from the following places m Solaris Operating System DVDs m Sun Download Center http www sun com oem products vts The SunVTS 6 1 software and future compatible versions are supported on the server SunVTS installation instructions are described in the SunVTS User s Guide Exercising the System Using SunVTS Software Before you begin the Solaris OS must be running You also need to ensure that SunVTS validation test software is installed on your system See Section 3 8 1 Checking Whether SunVTS Software Is Installed on page 3 49 The SunVTS installation process requires that you specify one of two security schemes to use when running SunVTS The security scheme you choose must be properly configured in the Solaris OS for you to run SunVTS For details refer to the SunVTS User s Guide SunVTS software feat
88. sS R SUN microsystems Sun Fire T1000 Server Service Manual Sun Microsystems Inc www sun com Part No 819 3248 13 January 2007 Revision A Submit comments about this document at http www sun com hwdocs feedback Copyright 2007 Sun Microsystems Inc 4150 Network Circle Santa Clara California 95054 U S A All rights reserved Fujitsu Limited provided technical input and review on portions of this material Sun Microsystems Inc and Fujitsu Limited each own or control intellectual property rights relating to products and technology described in this document and such products technology and this document are protected by copyright laws patents and other intellectual property laws and international treaties The intellectual property rights of Sun Microsystems Inc and Fujitsu Limited in such products technology and this document include without limitation one or more of the United States patents listed at http www sun com patents and one or more additional patents or patent applications in the United States or other countries This document and the product and technology to which it pertains are distributed under licenses restricting their use copying distribution and decompilation No part of such product or technology or of this document may be reproduced in any form by any means without prior written authorization of Fujitsu Limited and Sun Microsystems Inc and their applicable licensors if any The furnish
89. sablecomponent asrkey Adds a component to the asr db blacklist where asrkey is the component to disable clearasrdb Removes all entries from the asr db blacklist The showcomponent command might not report all blacklisted DIMMS Note The components asrkeys vary from system to system depending on how many cores and memory are present Use the showcomponent command to see the asrkeys on a given system Note A reset or power cycle is required after disabling or enabling a component If the status of a component is changed with power on there is no effect to the system until the next reset or power cycle Displaying System Components The showcomponent command displays the system components asrkeys and reports their status At the sc gt prompt enter the showcomponent command Chapter 3 Server Diagnostics 3 47 Example with no disabled components sc gt showcomponent Keys ASR state clean Example showing a disabled component sc gt showcomponent Keys ASR state Disabled Devices MB CMP0 CH3 R1 D1 dimm8 deemed faulty 3 7 2 Disabling Components The disablecomponent command disables a component by adding it to the ASR blacklist 1 At the sc gt prompt enter the disablecomponent command sc gt disablecomponent MB CMP0 CH3 R1 D1 SC Alert MB CMP0 CH3 R1 D1 disabled 2 After receiving confirmation that the disablecomponent command is complete
90. splays a fault 6 Is the fault an environmental fault 7 Is the fault a PSH detected fault Identify the fault condition from the fault message Identify and replace the faulty FRU from the PSH message and perform the procedure to clear the PSH detected fault 8 The fault is a POST detected fault Identify and replace the faulty FRU from the POST message and perform the procedure to clear the POST detected faults m gt 9 Contact Support if the fault condition persists FIGURE 3 1 Diagnostic Flowchart Chapter 3 Server Diagnostics TABLE 3 1 Diagnostic Flowchart Actions Action For more information see No Diagnostic Action Resulting Action these sections 1 Check Power OK The Power OK LED is located on the front and rear Section 3 2 Using LEDs and AC OK LEDs of the chassis to Identify the State of on the server The AC OK LED is located on the rear of the server Devices on page 3 8 on each power supply If these LEDs are not on check the power source and power connections to the server 2 Run the ALOM The showfaults command displays the following Section 3 3 2 Running CMT kinds of faults the showfaults showfaults e Environmental faults Command on page 3 16 command to e Solaris Predictive Self Healing PSH detected check for faults faults POST detected faults Faulty
91. stem console prompt to the SC console prompt by issuing the Hash Period escape sequence ok sc gt 4 2 Sun Fire T1000 Server Service Manual January 2007 4 1 3 Using the SC console issue the poweroff command sc gt poweroff fy SC Alert SC Request to Power Off Host Immediately Note You can also use the Power On Off button on the front of the server to initiate a graceful system shutdown Refer to the Sun Fire T1000 Server Administration Guide for more information about the ALOM poweroff command Removing the Server From a Rack If the server is installed in a rack with the extendable slide rails outer and middle section that were supplied with the server use this procedure to remove the server chassis from the rack Optional Issue the following command from the ALOM sc gt prompt to locate the system that requires maintenance sc gt setlocator on Locator LED is on Once you have located the server press the Locator button to turn it off Check to see that no cables will be damaged or interfere when the server chassis is removed from the rack Disconnect the power cord from the power supply Note After you have disconnected the power cord from the power supply you must wait about five seconds before reconnecting the power cord to the power supply Disconnect all cables from the server and label them From the front of the server unlock
92. the Sun Fire T1000 Server Installation Guide and Sun Fire T1000 Server Administration Guide for details FIGURE 3 1 is a flow chart of the diagnostics available to troubleshoot faulty hardware TABLE 3 1 has more information about each diagnostic in this chapter Note POST is configured with ALOM CMT configuration variables TABLE 3 6 If diag_level is set to max diag_level max POST reports all detected FRUs including memory devices with errors correctable by Predictive Self Healing PSH Thus not all memory devices detected by POST need to be replaced See Section 3 4 5 Correctable Errors Detected by POST on page 3 35 Sun Fire T1000 Server Service Manual January 2007 1 Are the Power OK and AC OK LEDs off Faulty hardware suspected Yes 2 Are any faults reported by the ALOM showfaults command Identify faulty 3 Do FRU fromthe the Solaris logs fault message Yes indicate a faulty and replace FRU the FRU f Identify faulty FRU from the 4 Does Sun VTS Sun VTS report message and Yes any faulty replace the devices FRU Identify faulty 5 Does FRU from the POST report POST message any faulty and replace devices the FRU Check the power source and connections Numbers in this flow chart correspond to the Action numbers in Table 2 1 The showfaults command di
93. tion executes a console command after completion of the reset command Reboots the system controller The y option enables you to skip the confirmation question Sets the virtual keyswitch The y option enables you to skip the confirmation question when setting the keyswitch to stby Turns the Locator LED on the server on or off Displays the environmental status of the host server This information includes system temperatures power supply front panel LED hard drive fan voltage and current sensor status See Section 3 3 3 Running the showenvironment Command on page 3 17 Displays current system faults See Section 3 3 2 Running the showfaults Command on page 3 16 Displays information about the FRUs in the server e g lines specifies the number of lines to display before pausing the output to the screen e s displays static information about system FRUs defaults to all FRUs unless one is specified e d displays dynamic information about system FRUs defaults to all FRUs unless one is specified See Section 3 3 4 Running the showfru Command on page 3 19 Displays the status of the virtual keyswitch Displays the current state of the Locator LED as either on or off Displays the history of all events logged in the ALOM CMT event buffers in RAM or the persistent buffers Displays information about the host system s hardware configuration the system serial number and whether the hardwa
94. u can enable and disable individual tests or blocks of tests by clicking the checkbox next to the test name or test category name Tests are enabled when checked and disabled when not checked TABLE 3 8 lists tests that are especially useful to run on this server TABLE 3 8 Useful SunVTS Tests to Run on This Server SunVTS Tests FRUs Exercised by Tests cmttest cputest fputest DIMMS motherboard iutest 11dcachetest dtlbtest and 12sramtest indirectly mptest and systest disktest Disks cables disk backplane nettest netlbtest Network interface network cable CPU motherboard pmemtest vmemtest ramtest DIMMs motherboard serialtest I O serial port interface hsclbtest Motherboard system controller Host to system controller interface Optional Customize individual tests You can customize individual tests by right clicking on the name of the test For example in FIGURE 3 7 right clicking on the text string ce0 nettest brings up a menu that enables you to configure this Ethernet test Chapter 3 Server Diagnostics 3 53 8 Start testing Click the Start button that is located at the top left of the SunVTS window Status and error messages appear in the test messages area located across the bottom of the window You can stop testing at any time by clicking the Stop button During testing SunVTS software logs all status and error messages To view these messages click the Log button or select Log Fi
95. ure Location D Vendor Infineon formerly Siemens D Vendor Part No 72T256220HR3 7A D Vendor Serial No d03ec27 FRU_PROM at MB CMP0 CH3 R1 D1 SEEPROM SPI SPI SPI SPI SPI SPI SC gt D Timestamp MON OCT 03 12 00 00 2005 D Description DDR2 SDRAM 2048 MB D Manufacture Location D Vendor Infineon formerly Siemens D Vendor Part No 72T256220HR3 7A D Vendor Serial No d040924 Chapter 3 Server Diagnostics 3 21 3 4 3 4 1 Running POST Power on self test POST is a group of PROM based tests that run when the server is powered on or reset POST checks the basic integrity of the critical hardware components in the server CPU memory and I O buses If POST detects a faulty component the component is disabled automatically preventing faulty hardware from potentially harming any software If the system is capable of running without the disabled component the system will boot when POST is complete For example if one of the processor cores is deemed faulty by POST the core will be disabled and the system will boot and run using the remaining cores In normal operation the default configuration of POST diag_level min provides a sanity check to ensure the server will boot Normal operation applies to any power on of the server not intended to test power on errors hardware upgrades or repairs Once the Solaris OS is running PSH provides run time diagnosis of faults Note
96. ures both character based and graphics based interfaces This procedure assumes that you are using the graphical user interface GUI on a system running the Common Desktop Environment CDE For more information about the character based SunVTS TTY interface and specifically for instructions on accessing it by tip or telnet commands refer to the SunVTS User s Guide 3 50 Sun Fire T1000 Server Service Manual January 2007 3 8 3 SunVTS software can be run in several modes This procedure assumes that you are using the default mode This procedure also assumes that the server is headless that is it is not equipped with a monitor capable of displaying bitmap graphics In this case you access the SunVTS GUI by logging in remotely from a machine that has a graphics display Finally this procedure describes how to run SunVTS tests in general Individual tests may presume the presence of specific hardware or might require specific drivers cables or loopback connectors For information about test options and prerequisites refer to the following documentation m SunVTS Test Reference Manual SPARC m SunVTS Doc Supplement SPARC Using SunVTS Software Log in as superuser to a system with a graphics display The display system should be one with a frame buffer and monitor capable of displaying bitmap graphics such as those produced by the SunVTS GUI Enable the remote display On the display system type usr openwin
97. y the State of Devices on page 3 8 m Section 3 3 Using ALOM CMT for Diagnosis and Repair Verification on page 3 11 m Section 3 4 Running POST on page 3 22 m Section 3 5 Using the Solaris Predictive Self Healing Feature on page 3 39 m Section 3 6 Collecting Information From Solaris OS Files and Commands on page 3 45 m Section 3 7 Managing Components With Automatic System Recovery Commands on page 3 46 m Section 3 8 Exercising the System With SunVTS on page 3 49 3 1 Overview of Server Diagnostics There are a variety of diagnostic tools commands and indicators you can use to troubleshoot a server m LEDs Provide a quick visual notification of the status of the server and of some of the FRUs 3 1 3 2 a ALOM CMT firmware Is the system firmware that runs on the system controller In addition to providing the interface between the hardware and OS ALOM CMT also tracks and reports the health of key server components ALOM CMT works closely with POST and Solaris Predictive Self Healing technology to keep the system up and running even when there is a faulty component a Power on self test POST Performs diagnostics on system components upon system reset to ensure the integrity of those components POST is configureable and works with ALOM CMT to take faulty components offline if needed and blacklist them in the asr db m Solaris OS Predictive Self Healing PSH This tech
98. ying Power to the Server Note If you have just disconnected the power cord from the power supply you must wait about five seconds before reconnecting the power cord to the power supply Reconnect the power cord to the power supply Note As soon as the power cord is connected standby power is applied Depending on the configuration of the firmware the system might boot 6 2 Sun Fire T1000 Server Service Manual January 2007 APPENDIX A Field Replaceable Units FIGURE A 1 shows the locations of the field replaceable units FRUs in the server TABLE A 1 lists the FRUs Note that item number 4 in FIGURE A 1 is a 3 5 inch SATA drive used in the single drive configuration The 2 5 inch SAS drives used in the dual drive configuration look different but would be installed in the same location in the server A 1 FIGURE A 1 Field Replaceable Units Sun Fire T1000 Server Service Manual January 2007 A 2 TABLE A 1 Server FRU List Replacement Item No FRU Instructions Description Location 1 Motherboard Section 5 6 The motherboard and chassis are MB and chassis Replacing the replaced as a single assembly The assembly Motherboard and motherboard is provided in different Chassis on configurations to accommodate the page 5 20 different processor models 6 core and 8 core 2 DIMMs Section 5 5 Can be ordered in the following See TABLE 5 1 Replacing sizes and FIGURE 5 11 DIMMs on e 512
Download Pdf Manuals
Related Search
Related Contents
USER GUIDE INTR ODUCTION GETTING STAR TED SPLIT-TYPE AIR CONDITIONERS C2G 1.5ft, HDMI - micro HDMI Copyright © All rights reserved.
Failed to retrieve file