Home
Sun Blade T6300 Server Module Service Manual
Contents
1. button a tt H DIMM1 J6401 Channel o E DIMMO J6301 Four DIMMs installed D i D Channel 3 oc aa DIMM1 J7301 DIMM1 J6701 i Channel 1 a DIMMO J6601 E D EU Ei D a Four DIMMs Eight DIMMs installed installed DIMMO J6901 El Channel2 DIMM1 J7001 ei D RU 5 FIGURE 4 6 DIMM Installation Rules Use FIGURE 4 6 and TABLE 4 1 to map DIMM names that are displayed in messages to the socket numbers of the DIMMs Chapter 4 Replacing Cold Swappable Components 4 11 TABLE 4 1 DIMM Names and Socket Numbers Channel Number DIMM Name Used in Messages Socket No Channel 0 MB CMPO CHO RO DO J6301 MB CMPO CHO RO D1 J6401 Channel 1 MB CMP0 CH1 R0 D1 J6601 MB CMP0 CH1 R0 D1 J6701 Channel 2 MB CMP0 CH2 R0 DO J6901 MB CMP0 CH2 R0 D1 J7001 Channel 3 MB CMPO CH3 R0 DO J7201 MB CMP0 CH3 RO0 D1 J7301 Numbering key MB motherboard CMP CPU CH channel R rank D DIMM 3 Note the DIMM locations so that you can install the replacement DIMMs in the same sockets 4 Push down on the ejector levers on each side of the DIMM connector until the DIMM is released Si FIGURE 4 7 Removing DIMMs 7 5 Grasp the top corners of the faulty DIMM and remove it from the system 6 Place DIMMs on an antistatic mat 4 12 Sun Blade T6300 Server Module Service Manual April 2007 4 3 3 Replacing a DIMM Unpackage the replacement DIMMs and place them on
2. IO Bridge Quick Read Only of CSR and ID Os 0 SS Sa Se Sree pec SSS tae ae eS eh SS er mr Ser Sea du 0 0 gt fire 1 JBUSID 00000080 0f000000 0 0 gt fc000002 e03dda23 On ne y en H 2 30 Sun Blade T6300 Server Module Service Manual April 2007 0 gt Begin 0 gt Config 0 gt Config 0 gt Config 0 gt Config 0 gt Config 0 gt Config 0 gt End 0 gt Begin 0 gt VID 10b5 DI 0 0 gt INF VID 8086 DI 0 0 gt INFO VID 10b5 DI 0 gt End 0 gt Begin O o 6106 cc O cc xv o gt Or 010 S O OC OS OS 0 gt End 0 gt INFO 0 gt 0 gt POST o IO Bridge IO Bridge IO Bridge IO Bridge IO Bridge IO Bridge IO Bridge IO Bridge IO Bridge IO Bridge IO Bridge IO Bridge bu bu bu bu port A port A port port port bu port B bu IO Bridge B B B D 8532 Lin 0 100000 D 105e 100000 D 8532 Lin Lin IO Bridge unit 1 PCI id test Quick JBI Quick JBI Retur 0 gt fire 1 JBUSCSR 00000080 0f410000 Ouick Read unit unit uni unit unit lin uni lin unit lin unit lin uni unit PPPRPPPRPRPP EP PB unit Con s dev s dev s dev s dev WwW wU NUN NO RP OR o s dev 4 0 unit 1 Con dev sS kWidth 8 kWidth 4 kWidth 8 n to VBSC SC Alert Host System has Reset Chapter 2 Sun Blade T6300 Server Module Diagnostics k train k train k train k train func O O O O O func
3. 4 1 1 4 1 2 4 1 2 1 4 1 2 2 Safety Symbols The following symbols might appear in this manual note their meanings Caution There is a risk of personal injury and equipment damage To avoid personal injury and equipment damage follow the instructions Caution Hot surface Avoid contact Surfaces are hot and might cause personal injury if touched Caution Hazardous voltages are present To reduce the risk of electric shock and danger to personal health follow the instructions Electrostatic Discharge Safety Electrostatic discharge ESD sensitive devices such as the motherboard hard drives and memory cards require special handling Caution The boards and hard drives contain electronic components that are extremely sensitive to static electricity Ordinary amounts of static electricity from clothing or the work environment can destroy components Do not touch the components along their connector edges Using an Antistatic Wrist Strap Wear an antistatic wrist strap and use an antistatic mat when handling components such as drive assemblies boards or cards When servicing or removing server module components attach an antistatic strap to your wrist and then to a metal area on the chassis Do this after you disconnect the power cords from the server module Following this practice equalizes the electrical potentials between you and the server module Using an Antistatic Mat Place E
4. diag_level min If diag_mode normal runs minimum set of tests max If diag_mode normal runs all the minimum tests plus extensive CPU and memory tests diag_trigger none Does not run POST on reset diag_verbosity user_reset power_on_reset error_reset all_reset none min normal max Runs POST upon user initiated resets Only runs POST for the first poweron This is the default Runs POST if fatal errors are detected Runs POST after any reset No POST output is displayed POST output displays functional tests with a banner and pinwheel POST output displays all test and informational messages POST displays all test informational and some debugging messages Set all of these parameters using the ALOM CMT setsc command except for the setkeyswitch command Chapter 2 Sun Blade T6300 Server Module Diagnostics 2 23 diag_mode user_reset power_on reset error_reset diag_trigger Service Mode Forces a Sun prescribed level of diagnostic execution Overrides user defined settings as if parameters were diag_level max diag_verbosity max diag_trigger all resets User defined settings are not modified System Boot OpenBoot PROM Normal Mode Diagnostic execution is enabled User defined settings control test coverage and verbosity via diag_level diag_verbosity diag_trigger FIGURE 2 5 Flowchart of ALOM CMT Variables for POST Configuration 2 24
5. 17 syntax 2 16 troubleshooting with 2 4 showfru command 2 16 2 20 showkeyswitch command 2 16 showlocator command 2 16 showlogs command 2 16 showplatformcommand 1 8 2 16 shutting down the system 4 3 Solaris log files 2 4 Solaris OS collecting diagnostic information from 2 39 Solaris Predictive Self Healing PSH detected faults 2 4 specifications A 1 standby power 4 23 state of server module 2 10 striping disks 1 8 striping support 1 8 SunSolve online 1 10 SunVTS 2 4 exercising the system with 2 44 user interfaces 2 45 SUNW MSG ID online 1 10 2 35 support obtaining 2 4 syslogd daemon 2 40 system configuration PROM 4 16 system console switching to 2 14 system controller card removing 4 16 replacing 4 17 system controller see service processor 4 15 system status LEDs interpreting 2 9 system temperatures displaying 2 18 T tools required 4 3 troubleshooting actions 2 4 DIMMs 2 8 U UltraSPARC T1 processor and PSH 2 35 overview 1 8 universal connector port features 1 5 Universal Unique Identifier UUID 2 34 2 36 unremoveblade command 2 16 Index 4 Sun Blade T6300 Server Module Service Manual April 2007 V virtual keyswitch 2 26 4 14 voltage and current sensor status displaying 2 18 Index 5 Index 6 Sun Blade T6300 Server Module Service Manual April 2007
6. 6H returns the following information for corrective action CPU errors exceeded acceptable levels Type Fault Severity Major Description The number of errors associated with this CPU has exceeded acceptable levels Automated Response The fault manager will attempt to remove the affected CPU from service Impact System performance may be affected Suggested Action for System Administrator Schedule a repair procedure to replace the affected CPU the identity of which can be determined using fmdump v u lt EVENT_ID gt Details The Message ID SUNAU 8000 6H indicates diagnosis has determined that a CPU is faulty The Solaris fault manager arranged an automated attempt to disable this CPU The recommended action for the system administrator is to contact Sun support so a Sun service technician can replace the affected component c Follow the suggested actions to repair the fault Clearing PSH Detected Faults When the Solaris PSH facility detects faults the faults are logged and displayed on the console After the fault condition is corrected for example by replacing a faulty FRU you must clear the fault Note If you are dealing with faulty DIMMs do not follow this procedure Instead perform the procedure in Section 4 3 3 Replacing a DIMM on page 4 13 1 After replacing a faulty FRU boot the system Chapter2 Sun Blade T6300 Server Module Diagnostics 2 37 299 2 At the ALOM CM
7. Access 0 gt End DMMU TLB TAGS Access 0 gt Begin DMMU CAM 0 gt End DMMU CAM 0 gt Begin Setup DMMU Miss Handler 0 gt End Setup DMMU Miss Handler 0 gt Niagara Version 2 0 0 gt Serial Number 00000098 00000820 fffff238 2e4df502 0 gt Begin Init JBUS Config Regs 0 gt End Init JBUS Config Regs 0 gt Begin IO Bridge unit 1 init test 0 gt End IO Bridge unit 1 init test 0 gt sys 200 MHz CPU 1000 MHz mem 200 MHz 0 gt Begin Integrated POST Testing 0 gt End Integrated POST Testing 0 gt L2 Tests 0 gt Begin Setup L2 Cache 0 gt L2 Cache Control 00000000 00300000 0 gt End Setup L2 Cache 0 gt Begin L2 Cache Tags Test 0 gt End L2 Cache Tags Test 0 gt Begin Scrub and Setup L2 Cache 0 gt L2 Directory clear 0 gt L2 Scrub VD amp UA 0 gt L2 Scrub Tags 0 gt End Scrub and Setup L2 Cache 0 gt Test Memory 0 gt Begin Probe and Setup Memory 0 gt INFO 4096MB at Memory Channel 1 2 Rank 0 Stack 0 Or Os O vv Oe O c ce o cc oe 0 OO Os O 40 5 Y Os Os 070 GY IOs OO O 0 O 010 O Oc o 2 28 Sun Blade T6300 Server Module Service Manual April 2007 NO NO G Or GO Oe 0106 06 Or GO 0 2 09 60 02 92 0 0 x 20c 60 O 02 0 bo 0 cc oc co 0 gt Begin Data Bitwalk 0 gt L2 Scrub Data 0 gt L2 Enable 0 gt Testing Memory Channel 0 gt Testing Memory Channel 0 gt L2 Di
8. FIGURE 2 1 and TABLE 2 1 describe an approach for using the server module diagnostics to identify a faulty field replaceable unit FRU The diagnostics you use and the order in which you use them depend on the nature of the problem you are troubleshooting so you might perform some actions and not others Use this flowchart to understand what diagnostics are available to troubleshoot faulty hardware and use TABLE 2 1 to find more information about each diagnostic in this chapter Sun Blade T6300 Server Module Service Manual April 2007 Faulty hardware suspected Identify faulty FRU from the fault message and replace the FRU Identify faulty FRU from the Sun VTS message and replace the FRU Identify faulty FRU from the POST message and replace the FRU FIGURE 2 1 Diagnostic Flowchart Yes Yes 1 Are the Power OK and AC OK LEDs off Yes 2 Are any faults reported by the ALOM showfaults command 3 Do the Solaris logs indicate a faulty FRU 4 Does Sun VTS report any faulty devices 5 Does POST report any faulty devices Check the power source and connections Numbers in this flowchart correspond to the Action numbers in Table 2 1 The showfaults command displays a fault 6 Is the fault an environmental fault 7 Is the fault
9. PSH facility are also reported through ALOM CMT alerts In addition to the PSH fmdump command the ALOM CMT showfaults command also provides information about faults and displays fault UUIDs See Section 2 3 2 Displaying System Faults on page 2 17 Chapter 2 Sun Blade T6300 Server Module Diagnostics 2 35 2 36 1 Check the event log using the fmdump command with v for verbose output fmdump v TIME UUID SUNW MSG ID Apr 24 06 54 08 2005 lce22523 1c80 6062 e61d f3b39290ae2c SUN4V 8000 6H 100 fault cpu ultraSPARCT112cachedata FRU he component MB rsrc cpu cpuid 0 serial 22D1D6604A In this example a fault is displayed indicating the following details Date and time of the fault Apr 24 06 54 08 2005 Universal Unique Identifier UUID that is unique for every fault 1ce22523 lc80 6062 e61d f3b39290ae2c Sun message identifier SUNW4V 8000 6H that can be used to obtain additional fault information Faulted FRU FRU hc component MB that in this example is identified as MB indicating that the motherboard reguires replacement 2 Use the Sun message ID to obtain more information about this type of fault a In a browser go to the Predictive Self Healing Knowledge Article web site http www sun com msg Sun Blade T6300 Server Module Service Manual April 2007 2 5 2 b Enter the message ID in the SUNW MSG ID field and press Lookup In this example the message ID SUN4U 8000
10. Resulting Action For more information see these sections 7 Determine if the fault was detected by PSH Determine if the fault was detected by POST Contact Sun for support If the fault message displays the following text the fault was detected by the Solaris Predictive Self Healing software Host detected fault If the fault is a PSH detected fault identify the faulty FRU from the fault message and replace the faulty FRU After the FRU is replaced perform the procedure to clear PSH detected faults POST performs basic tests of the server module components and reports faulty FRUs When POST detects a faulty FRU it logs the fault and if possible takes the FRU offline POST detected FRUs display the following text in the fault message FRU name deemed faulty and disabled In this case replace the FRU and run the procedure to clear POST detected faults The majority of hardware faults are detected by the server module diagnostics In rare cases it is possible that a problem requires additional troubleshooting If you are unable to determine the cause of the problem contact Sun for support Section 2 5 Using the Solaris Predictive Self Healing Feature on page 2 34 Section 4 2 Common Procedures for Parts Replacement on page 4 3 Section 2 5 2 Clearing PSH Detected Faults on page 2 37 Section 2 5 3 Clearing the PSH Fault From the ALOM CMT Logs on page 2 38 Sect
11. SUNWvtsr was not found TABLE 2 9 lists some SunVTS packages TABLE2 9 Sample of installed SunVTS Packages Package Description SUNWvts SunVTS framework SUNWvtsr SunVTS Framework root SUNWvtsts SunVTS for tests SUNWvtsmn SunVTS man pages If SunVTS is not installed you can obtain the installation packages from the following resources m Solaris Operating System DVDs m Sun Download Center http www sun com oem products vts The SunVTS 6 3 software and future compatible versions are supported on the Sun Blade T6300 server module SunVTS installation instructions are described in the Sun VTS 6 3 User s Guide 820 0080 Exercising the System Using SunVTS Software Before you begin the Solaris OS must be running You must verify that SunVTS validation test software is installed on your system See Section 2 8 1 Checking SunVTS Software Installation on page 2 43 The SunVTS installation process reguires that you specify one of two security schemes to use when running SunVTS The security scheme you choose must be properly configured in the Solaris OS for you to run SunVTS 2 44 Sun Blade T6300 Server Module Service Manual April 2007 SunVTS software features both character based and graphics based interfaces For more information about the character based SunVTS TTY interface and specifically for instructions on accessing it by TIP or telnet commands refer to the Sun VTS 6 3 User s Guide Cha
12. Weight 20 15 in 12 9 in 1 7 in 17 lbs 512 mm 327 mm 44 mm 8 kg If the server module is placed in an enclosure ensure that there is adequate airflow from front to rear A 1 FIGURE A 1 Server Module Dimensions A 2 Sun Blade T6300 Server Module Service Manual April 2007 A 2 Motherboard Block Diagram yMHHBBBBB Memory ip a HE PCle x8Lanes ie ae bu j PCle x8Lanes 10 7 NON GB sec NEM 0 PCle x4 Lanes 2x Gbit Ethernet NEM 1 UltraSPARC Ti PCle x8Lanes EM 1 Fire i PCle x8Lanes EBus PCle PCle x4L i UART xilas Cle x4Lanes NS F gt Passive Midplane EE GO Server module front panel IGS and adapter Posix Serial SAS links PCle to mu fe s Bridge ALOM 10 100Mbps P SP Mgmt Ethernet ALOM Serial FIGURE A 2 Motherboard Block Diagram Appendix A Specifications A 3 A 4 Sun Blade T6300 Server Module Service Manual April 2007 Index A advanced ECC technology 2 7 Advanced Lights Out Management ALOM CMT configuration parameters 4 16 connecting to 2 14 diagnosis and repair of server 2 12 POST and 2 22 prompt 2 14 service related commands 2 14 airflow blocked 2 4 ALOM CMT see Advanced Lights Out Management ALOM CMT antistatic mat 4 2 antistatic wrist strap 4 2 architecture designation 1 5 ASR blacklist 2 41 2 42 asrkeys 2 41 asrkeys 2 41 Automatic System Recovery ASR 2 40 B battery service processor
13. an antistatic mat Ensure that the connector ejector tabs are in the open position Line up a replacement DIMM with the connector Align the DIMM notch with the key in the connector a Push each DIMM into a connector until the ejector tabs lock the DIMM in place a Perform the procedures described in Section 4 6 Finishing Component Replacement on page 4 22 Access the ALOM CMT sc gt prompt At the console issue the key sequence Run the showfaults v command to determine how to clear the fault The method that you use to clear a fault depends on how the fault is identified by the showfaults command Examples m If the fault is a host detected fault a message similar to the following is displayed sc gt showfaults v ID Time FRU Fault O SEP 09 11 09 26 MB CMPO CHO RO DO Host detected fault MSGID SUN4U 8000 2S UUID 7ee0e46b ea64 6565 e684 e996963 7b86 Go to Step 8 Chapter 4 Replacing Cold Swappable Components 4 13 m If the fault resulted in the FRU being disabled a message similar to the following is displayed sc gt showfaults v ID Time FRU Fault T OCT 13 12247327 MB CMPO CHO RO DO MB CMPO CHO RO DO deemed faulty and disabled Run the enablecomponent command to enable the FRU sc gt enablecomponent MB CMPO CHO RO DO 6 Perform the following steps to verify that there are no faults a Set the virtual keyswitch to diag so that POST w
14. g lines s d FRU showkeyswitch showlocator showlogs b lines e lines v g lines p logtype r p showplat form v Indicates if it is OK to perform a hot swap of a power supply This command does not perform any action but provides a warning if the power supply should not be removed because the other power supply is not enabled Pauses the service processor tasks and illuminates the white locator LED indicating that it is safe to remove the blade Turns off the locator LED and restores the service processor state Generates a hardware reset on the host server The y option enables you to skip the confirmation question The c option executes a console command after completion of the reset command Reboots the service processor The y option enables you to skip the confirmation question Sets the virtual keyswitch The y option enables you to skip the confirmation question when setting the keyswitch to stby Turns the Locator LED on the server on or off Displays the environmental status of the host server This information includes system temperatures power supply front panel LED hard drive fan voltage and current sensor status See Section 2 3 3 Displaying the Environmental Status on page 2 18 Displays current system faults See Section 2 3 2 Displaying System Faults on page 2 17 Displays information about the FRUs in the server The g lines option specifies the number
15. qualified by Sun DIMMs are installed in groups of four with four DIMMs of the same capacity FIGURE 2 2 a All DIMMS must use DDR 2 four data input output DRAMs m Each set of four DIMMS must have the exact same DRAM devices on the DIMM for example four DIMMs must have 256 Mbyte DRAMs or four DIMMS have 512 Mbyte DRAMs a All DIMMS must be 72 bit ECC If the DIMMs are not properly configured the system issues a message and the system does not boot See Section 5 2 Installing DIMMS on page 5 89 for DIMM installation instructions 2 6 Sun Blade T6300 Server Module Service Manual April 2007 DIMM locate button DIMM1 J6401 u crono 0 ti g DIMM1 J6701 Channel 1 DIMMO J6301 Four DIMMs installed D D IRIA DIMMO J6601 Four DIMMs installed DIMMO J6901 G ci 2 DIMM J7001 g g FIGURE 2 2 DIMM Installation Rules 2 1 1 4 aw TINNY J7201 DIMM1 J7301 di Fe Eight DIMMs installed i D D Memory Fault Handling iE mssi The Sun Blade T6300 server module uses advanced ECC technology also called chipkill that corrects up to 4 bits in error on nibble boundaries as long as they are all in the same DRAM If a DRAM fails the DIMM continues to function The following server module features manage memory faults independently m POST Runs when the server module is powered on based on ALOM CMT configurati
16. the cover off the chassis Chapter 4 Replacing Cold Swappable Components 4 7 4 3 4 3 1 4 3 1 1 4 3 1 2 Removing and Replacing DIMMS Memory Configuration The Sun Blade T6300 server module has eight slots that hold DDR 2 memory DIMMs in the following DIMM sizes m 1 Gbyte maximum of 8 Gbyte m 2 Gbyte maximum of 16 Gbyte m 4 Gbyte maximum of 32 Gbyte The Sun Blade T6300 server module performs best if all eight connectors are populated with eight DIMMs This configuration also enables the system to continue operating even when a DIMM fails or if an entire channel fails Capacity Restrictions Due to interleaving rules for the CPU the system will operate at the lowest capacity of all the DIMMs installed Therefore it is ideal to install eight identical DIMMs not four DIMMs of one capacity and four DIMMs of another capacity DIMM Installation Rules Caution The following DIMM rules must be followed The server module might not operate correctly if the DIMM rules are not followed Always use DIMMs that have been qualified by Sun DIMMs are installed in groups of four with four DIMMs of the same capacity FIGURE 4 6 a All DIMMS must use DDR 2 four data input output DRAMs m Each set of four DIMMS must have the exact same DRAM devices on the DIMM for example four DIMMs must have 256 Mbyte DRAMs or four DIMMS have 512 Mbyte DRAMs a All DIMMS must be 72 bit ECC If the DIMMs are not properl
17. 0 func func func func POST Passed all devices 00000ff5 13cb7000 jbus perf test jbus perf test int init test int init test port A port A port B port B interrupt test interrupt test tag tag tag tag tag tag fig MB bridges MB PCI SWI MB PCI SWI1 MB PCI SWI1 MB PCI SWI1 CHO CHO CHI CH1 MB PCI SWI MB PCIE IO fig MB bridges IO Bridge unit 1 PCI id test INFO 100000 count read passed for MB PCI SWITCHO Last read Loopback Block Mem Test 0 gt Ouick jbus loopback Test 262144 bytes at 00000000 00600000 Loopback Block Mem Test CH1 count read passed for MB NETO Last read count read passed for MB PCI SWITCH1 Last read 0 gt Master set ACK for vbsc runpost command and spin 2 31 5 Perform further investigation if needed When POST is finished running and if no faults were detected the system will boot If POST detects a faulty device the fault is displayed and the fault information is passed to ALOM CMT for fault handling Faulty FRUs are identified in fault messages using the FRU name For a list of FRU names see Appendix A a Interpret the POST messages POST error messages use the following syntax c s gt ERROR TEST failing test c 5 gt H W under test FRU c s gt Repair Instructions Replace items in order listed by H W under test above c s gt MSG test error message c s gt END_ERROR In this syntax c the core number s the strand num
18. 2 51 provides a fault notification with a message ID MSGID You can use message ID to get additional information about the problem from Sun s knowledge article database The Predictive Self Healing technology covers the following Sun Blade T6300 server module components m UltraSPARC T1 multicore processor CPU m Memory a I O bus The PSH console message provides the following information Type Severity Description Automated response Impact Suggested action for system administrator If the Solaris PSH facility has detected a faulty component use the fmdump command to identify the fault Faulty FRUs are identified in fault messages using the FRU name For a list of FRU names see Appendix A Note Additional Predictive Self Healing information is available at http www sun com msg Identifying Faults With the mdump Command The fmdump command displays the list of faults detected by the Solaris PSH facility Use this command for the following reasons m To see if any faults have been detected by the Solaris PSH facility m If you need to obtain the fault message ID SUNW MSG ID for detected faults m To verify that the replacement of a FRU has cleared the fault and not generated any additional faults If you already have a fault message ID go to Step 2 to obtain more information about the fault from the Sun Predictive Self Healing Knowledge Article web site Note Faults detected by the Solaris
19. 4 8 4 3 1 2 DIMM Installation Rules 4 8 43 2 Removing the DIMMs 4 9 Contents v vi 4 3 3 Replacing a DIMM 4 13 4 3 4 Removing the Service Processor 4 15 43 5 Replacing the Service Processor 4 17 44 Removing the Disk Backplane Cables 4 18 4 4 1 Replacing the Disk Backplane Cables 4 19 4 5 Removing the Battery on the Service Processor 4 20 4 5 1 Replacing the Battery on the Service Processor 4 21 4 6 Finishing Component Replacement 4 22 4 6 1 Replacing the Cover 4 22 4 6 2 Reinstalling the Server Module in the Chassis 4 22 A Specifications A 1 A1 Physical Specifications A 1 A2 Motherboard Block Diagram A 3 Index Index 1 Sun Blade T6300 Server Module Service Manual April 2007 FIGURE 1 1 FIGURE 1 2 FIGURE 1 3 FIGURE 1 4 FIGURE 1 5 FIGURE 1 6 FIGURE 2 1 FIGURE 2 2 FIGURE 2 3 FIGURE 2 4 FIGURE 2 5 FIGURE 3 1 FIGURE 3 2 FIGURE 4 1 FIGURE 4 2 FIGURE 4 3 FIGURE 4 4 FIGURE 4 5 FIGURE 4 6 FIGURE 4 7 Figures Sun Blade T6300 Server Module With the Sun Blade 6000 Chassis 1 2 Front and Rear Panels and Optional Cable Dongle 1 3 Optional Dongle Cable Connecting to the Universal Connector Port 1 4 Field Replaceable Units 1 6 PCI Express and Ethernet Connections Between Sun Blade 6000 Chassis and Sun Blade T6300 Server Module 1 7 Serial Number and MAC Address Location 1 9 Diagnostic Flowchart 2 3 DIMM Installation Rules 2 7 Front Panel and Hard Drive LEDs 2 9 ALOM CMT Fa
20. All information related to Solaris system administration commands and features Third Party Web Sites Sun is not responsible for the availability of third party web sites mentioned in this document Sun does not endorse and is not responsible or liable for any content advertising products or other materials that are available on or through such sites or resources Sun will not be responsible or liable for any actual or alleged damage or loss caused by or in connection with the use of or reliance on any such content goods or services that are available on or through such sites or resources xiv Sun Blade T6300 Server Module Service Manual April 2007 Documentation Support and Training Sun Function URL Documentation http www sun com documentation Support http www sun com support Training http www sun com training Sun Welcomes Your Comments Sun is interested in improving its documentation and welcomes your comments and suggestions You can submit your comments by going to http www sun com hwdocs feedback Please include the title and part number of your document with your feedback Sun Blade T6300 Server Module Service Manual part number 820 0276 Preface xv xvi Sun Blade T6300 Server Module Service Manual April 2007 CHAPTER 1 Sun Blade T6300 Server Module Product Description This chapter provides an overview of the features of the Sun Blade T6300 server module A serv
21. Blade T6300 Server Module Service Manual April 2007 At the sc gt prompt enter the showfru command In the following example the showfru command is used to get information about the motherboard MB sc gt showfru MB SEEPROM SEGMENT SD ManR ManR UNIX_ Timestamp32 WED FEB 14 18 24 28 2007 ManR Description ASSY Sun Fire T6300 CPU Board ManR Manufacture Location Sriracha Chonburi Thailand ManR Sun Part No 5016843 ManR Sun Serial No NC00OD ManR Vendor Celestica ManR Initial HW Dash Level 06 ManR Initial HW Rev Level 02 ManR Shortname T2000_MB SpecPartNo 885 0483 04 SEGMENT FL Configured_LevelR Configured_LevelR UNIX_Timestamp32 WED FEB 14 18 24 28 2007 Configured_LevelR Sun_Part_No 5410827 Configured_LevelR Configured_Serial_No N4001A Configured_LevelR HW_Dash_Level 03 Chapter 2 Sun Blade T6300 Server Module Diagnostics 2 21 2 4 2 4 1 Running POST Power on self test POST is a group of PROM based tests that run when the server module is powered on or reset POST checks the basic integrity of the critical hardware components in the server module CPU memory and I O buses If POST detects a faulty component it is disabled automatically preventing faulty hardware from potentially harming any software If the system is capable of running without the disabled component the system will boot when POST is complete For example if one of the
22. Enabling a Disabled Component With the enablecomponent Command The enablecomponent command enables a disabled component by removing it from the ASR blacklist 1 At the sc gt prompt enter the enablecomponent command sc gt enablecomponent MB CMP0 CH3 R1 D1 SC Alert MB CMPO CH3 R1 D1 reenabled 2 After receiving confirmation that the enablecomponent command is complete reset the server module for so that the ASR command takes effect sc gt reset 2 8 Exercising the System With SunVTS Sometimes a system exhibits a problem that cannot be isolated definitively to a particular hardware or software component In such cases it might be useful to run a diagnostic tool that stresses the system by continuously running a comprehensive battery of tests Sun provides the SunVTS software for this purpose 2 8 1 Checking SunVTS Software Installation This procedure assumes that the Solaris OS is running on the Sun Blade T6300 server module and that you have access to the Solaris command line Chapter 2 Sun Blade T6300 Server Module Diagnostics 2 43 2 8 2 1 Check for the presence of Sun VTS packages using the pkginfo command pkginfo 1 SUNWvts SUNWvtsr SUNWvtsts SUNWvtsmn a If SunVTS software is loaded information about the packages is displayed m If SunVTS software is not loaded you see an error message for each missing package ERROR information for SUNWvts was not found ERROR information for
23. FM5 FIN OK 4017 MP FM5 FOUT OK 4017 Voltage Sensors in Volts Sensor Status Voltage LowSoft LowWarn HighWarn HighSoft MB V_VCORE OK 1 30 1 02 1 08 1 40 1 58 MB V_VTTB OK 0 87 0 76 0 81 0 99 1 03 MB V_VTTT OK 0 87 0 76 0 81 0 99 1 03 MB V_VCCB OK 1 78 1 53 1 62 1 98 2 07 MB V_VCCT OK 1 76 1 53 1 62 1 98 2 07 MB V_ 1V1 OK 1 10 0 85 0 90 1 15 1 18 MB V_ 1V2 OK 1 18 1 02 1 08 1 32 1 38 MB V_ 1V5 OK 1 46 1 28 1 35 1 65 1 72 MB V_ 1V8 OK 1 79 1 53 1 62 1 98 2 07 MB V_ 3V3 OK 3533 2 80 3 00 3 63 3 80 MB V_ 3V3STBY OK 3 34 2 80 297 303 3 80 MB V_ 5V OK 4 91 4 25 4 50 5 50 5 75 MB V_ 12V OK 12 18 10 20 10 80 13 20 13 80 Chapter 2 Sun Blade T6300 Server Module Diagnostics 2 19 SC BAT V_BAT OK 3 03 2 25 sm MP V_ 12V OK 12 40 10 20 10 80 13 20 13 80 Sensor Status Load Warn Shutdown MB I_CORE OK 10 760 80 000 88 000 MB I_MEMB OK 2 040 60 000 66 000 MB I_MEMT OK 1 740 60 000 66 000 MB I_12V OK 9 000 40 000 45 000 Supply Present On MP PSO PRESENT FAULTED MP PS1 PRESENT OK sc gt Note Some environmental information might not be available when the server module is in standby mode 2 2 4 Displaying FRU Information The showfru command displays information about the FRUs in the server module Use this command to see information about an individual FRU or for all the FRUs Note By default the output of the show ru command for all FRUs is very long 2 20 Sun
24. FRU name 1 6 replacing 4 20 blacklist ASR 2 41 bootmode command 2 15 break command 2 15 button Locator 4 4 Power 2 11 C cable kit 1 6 cables disk backplane removing 4 18 replacing 4 19 cfgadm command 3 2 3 5 chassis illustration 1 2 reinstalling server 4 22 serial number 1 8 chip multithreading CMT 1 8 chipkill 2 7 clearasrdb command 2 41 clearfault command 2 15 2 39 4 15 clearing POST detected faults 2 33 clearing PSH detected faults 2 37 common procedures for parts replacement 4 3 component replaceable 1 6 components disabled 2 41 2 42 components displaying the state of 2 41 connecting to ALOM CMT 2 14 connector locations 1 2 connector front panel 1 3 console 2 14 console command 2 15 2 27 4 14 consolehistory command 2 15 cooling 1 5 cores 1 8 cover removing 4 7 cover replacing 4 22 Index 1 D DDR 2 memory DIMMs 2 6 4 8 diag_level parameter 2 23 2 25 diag_mode parameter 2 22 2 25 diag_trigger parameter 2 23 2 25 diag_verbosity parameter 2 23 2 25 diagnostics about 2 1 flowchart 2 3 low level 2 22 running remotely 2 12 SunVTS 2 43 DIMMs 1 6 example POST error output 2 32 installation rules 2 6 4 8 interleaving 2 6 4 8 names and socket numbers 4 12 replacing 4 13 troubleshooting 2 8 disablecomponent command 2 41 2 42 disabled component 2 42 disabled DIMMs 4 14 disk configuration RAID 1 8 striping 1 8 disk drives see har
25. MT enables you to run remote diagnostics such as power on self test POST that would otherwise require physical proximity to the server module serial port You can also configure ALOM CMT to send email alerts of hardware failures hardware warnings and other events related to the server module or to ALOM The ALOM CMT circuitry runs independently of the server module using the server module standby power Therefore ALOM CMT firmware and software continue to function when the server module operating system goes offline or when the server module is powered off Note Refer to the Advanced Lights out Management ALOM CMT v1 3 Guide 819 7981 for comprehensive ALOM CMT information Faults detected by ALOM CMT POST and the Solaris Predictive Self healing PSH technology are forwarded to ALOM CMT for fault handling FIGURE 2 4 In the event of a system fault ALOM CMT ensures that the Service Action Required LED is lit FRU ID PROMs are updated the fault is logged and alerts are displayed faulty FRUs are identified in fault messages using the FRU name For a list of FRU names see Appendix A 2 12 Sun Blade T6300 Server Module Service Manual April 2007 Service Required LED FRU LEDs Environmentals mes 7 ALOM POST gt fault manager FRUID PROMS Logs Solaris PSH Alerts i FIGURE 2 4 ALOM CMT Fault Management ALOM CMT sends alerts to all ALOM CMT users that
26. Memory Chapter 2 Sun Blade T6300 Server Module Diagnostics 2 29 0 gt Begin DMMU Registers Access 0 gt End DMMU Registers Access 0 gt Begin DMMU Registers Access 0 gt Begin DMMU Registers Access 0 gt End DMMU Registers Access 0 gt End DMMU Registers Access S0 gt CPU 2 0 2 3 4 5 6 rg 9 BOO TT TA 13 14 15 16 17 18 19 28 29 0 2 0 gt Test slave strand registers 0 gt Extended CPU Tests J J amp amp O UOUNMmDOUN JB s UO Uw 0 gt Begin I Cache RAM Test 0 gt End I Cache RAM Test 0 gt Begin I Cache RAM Test 0 gt End I Cache RAM Test 0 gt Begin I Cache RAM Test 0 gt End I Cache RAM Test 0 gt Begin I Cache RAM Test 0 gt End I Cache RAM Test 0 gt Begin I Cache RAM Test 0 gt End I Cache RAM Test 0 gt Scrub Memory 0 gt Begin Scrub Memory 0 gt Scrub 00000000 00600000 gt 00000001 00000000 on Memory Channel 1 2 Rank 0 Stack 0 0 0 gt End Scrub Memory 0 0 gt Extended Memory Tests 0 0 gt Begin Print Mem Config 0 0 gt Caches Icache is ON Dcache is ON 0 0 gt Bank 0 4096MB 00000000 00000000 gt 00000001 00000000 0 0 gt End Print Mem Config 0 0 gt Begin Block Mem Test 0 0 gt Test 4288675840 bytes at 00000000 00600000 Memory Channel 1 2 Rank 0 Stack 0 OO y a 0 0 gt End Block Mem Test 0 0 gt IO Bridge Tests 0 0 gt Begin IO Bridge Ouick Read 0 0 gt Aa a o CD WDD UN cd 0 0 gt
27. Qe SUN microsystems Sun Blade T6300 Server Module Service Manual Sun Microsystems Inc www sun com Part No 820 0276 10 April 2007 Revision A Submit comments about this document at http www sun com hwdocs feedback Copyright 2007 Sun Microsystems Inc 4150 Network Circle Santa Clara California 95054 U S A All rights reserved Sun Microsystems Inc has intellectual property rights relating to technology fiat is described in this document In particular and without limitation these intellectual property rights may include one or more of the U S patents listed at http www sun com patents and one or more additional patents or pending patent applications in the U S and in other countries This document and the product to which it pertains are distributed under licenses restricting their use copying distribution and decompilation No part of the product or of this document may be reproduced in any form by any means without prior written authorization of Sun and its licensors if any Third party software including font technology is copyrighted and licensed from Sun suppliers Parts of the one may be derived from Berkeley BSD systems licensed from the University of California UNIX is a registered trademark in the U S and in other countries exclusively licensed through X Open Company Ltd Sun Sun Microsystems the Sun logo AnswerBook2 docs sun com Java OpenBoot SunSolve SunVTS Sun Blade and So
28. RJ45 virtual console DB 9 serial male TTYA VGA 15 pin female USB 2 0 not supported two connectors FIGURE 4 1 Disconnecting the Cable Dongle Note The front panel of the server module is for temporary conections only You should disconnect the cable dongle from the universal connector port after accessing or transferring data 3 Open the ejector levers FIGURE 4 2 Chapter 4 Replacing Cold Swappable Components 4 5 FIGURE 4 2 Removing the Sun Blade T6300 Server Module From the Sun Blade 6000 Chassis 4 While pinching the release latches slowly pull the server module forward until the slide rails latch Caution Do not stack server modules higher than five units tall They might fall Caution Hold the server module firmly so that you do not drop it The server module weighs approximatley 14 to 17 pounds 6 4 8 0 kg A and cause damage or injury 4 6 Sun Blade T6300 Server Module Service Manual April 2007 FIGURE 4 3 Stack Five Server Modules or Fewer 5 Set the server module on an antistatic mat 6 Attach an antistatic wrist strap When servicing or removing server module components attach an antistatic strap to your wrist and then to a metal area on the chassis FIGURE 4 4 Antistatic Mat and Wrist Strap 7 While pressing the top cover release button slide the cover toward the rear of the server module about an inch 2 5 mm 8 Lift
29. RU List FRU Description FRU Name Replacement Instructions Service Controls the host power and monitors host system SC Section 4 3 4 Removing the processor events power and environmental Socketed Service Processor on page 4 15 card EEPROM stores system configuration all Ethernet MAC addresses and the host ID Service Lithium battery SC BAT Section 4 5 Removing the processor Battery on the Service battery Processor on page 4 20 DIMMs 1 Gbyte 2 Gbyte 4 Gbyte MB CMPx Section 4 3 2 Removing the CHx Rx Dx DIMMs on page 4 9 Cable kit Two HDD signal cables and one HDD power cable n a Section 4 4 Removing the Disk Backplane Cables on page 4 18 Hard SFF SAS or SATA 2 5 inch hard drive in NEMO HDDO 1 2 Section 3 2 2 Removing a drive bracket 3 Hard Drive on page 3 2 Server Chassis wtih CPU hard disk backplane and MB New server module Module backplane cables The FRU name is used in system messages 1 6 Sun Blade T6300 Server Module Service Manual April 2007 BL blade server module NEMO NETO NET1 NET2 NET3 NET4 NET5 NET6 NET7 NET8 NET9 NETO NET1 NET2 NET3 NET4 NET5 NET6 NET7 NET8 NET9 FIGURE 1 5 PCI Express and Ethernet Connections Between Sun Blade 6000 Chassis and Sun Blade T6300 Server Module Chapter 1 Sun Blade T6300 Server Module Product Description 1 7 Lid LL2 Chip Multithreaded CMT Multicore Processor and Memory Technology The UltraSPARC T1 multicor
30. SD sensitive components such as the motherboard memory and other PCB cards on an antistatic mat 4 2 Sun Blade T6300 Server Module Service Manual April 2007 4 2 4 2 1 4 2 2 Common Procedures for Parts Replacement Before you can remove and replace internal components you must perform the procedures in this section Required Tools You can service the Sun Blade T6300 server module with the following tools m Antistatic wrist strap m Antistatic mat Shutting Down the System Performing a graceful shutdown ensures that all of your data is saved and that the system is ready for restart Log in as superuser or eguivalent Depending on the nature of the problem you might want to view the system status the log files or run diagnostics before you shut down the system For more information refer to m Sun Blade T6300 Seroer Module Administration Guide 820 0277 a Advanced Lights out Management ALOM CMT v1 3 Guide 819 7981 Notify affected users Refer to your Solaris system administration documentation for additional information Save any open files and quit all running programs Refer to your application documentation for specific information on these processes Shut down the Solaris OS Switch from the system console to the ALOM CMT sc gt prompt by typing the Hash Period key sequence Chapter 4 Replacing Cold Swappable Components 4 3 6 At the ALOM CMT sc gt prompt issue the pow
31. Sun Blade T6300 Server Module Service Manual April 2007 TABLE 2 7 shows typical combinations of ALOM CMT variables and associated POST modes TABLE 2 7 ALOM CMT Parameters and POST Modes Keyswitch Normal Diagnostic Mode Diagnostic Service Diagnostic Preset Parameter default settings No POST Execution Mode Values diag_mode normal off service normal setkeyswitch normal normal normal diag diag_level min n a max max diag_trigger power on reset none all resets all resets error reset diag_verbosity normal n a max Description of POST This is the default POST POST does not POST runs the execution configuration This run resulting in full spectrum of configuration tests the quick system tests with the system thoroughly and initialization but maximum output suppresses some of the this is not a displayed detailed POST output suggested configuration max POST runs the full spectrum of tests with the maximum output displayed The setkeyswitch parameter when set to diag overrides all the other ALOM CMT POST variables 2 4 2 Changing POST Parameters 1 Access the ALOM CMT sc gt prompt At the console issue the key sequence ooo I Chapter2 Sun Blade T6300 Server Module Diagnostics 2 25 2 4 3 2 4 3 1 2 4 3 2 2 4 4 2 At the ALOM CMT sc gt prompt use the setsc command to set the POST parameter Example sc gt setsc diag_mode service The setkeyswitch para
32. T prompt use the showfaults command to identify PSH detected faults PSH detected faults are distinguished from other kinds of faults by the text Host detected fault Example sc gt showfaults v ID Time FRU Fault O SEP 09 11 09 26 MB CMP0 CHO RO DO Host detected fault MSGID SUN4U 8000 2S UUID 7ee0e46b ea64 6565 e684 e996963f7b86 If no fault is reported you do not need to do anything else Do not perform the subseguent step Clear the fault from all persistent fault records In some cases even though the fault is cleared some persistent fault information remains and results in erroneous fault messages at boot time To ensure that these messages are not displayed perform the following command fmadm repair UUID Example sc gt fmadm repair 7ee0e46b ea64 6565 e684 e996963 7b86 Clearing the PSH Fault From the ALOM CMT Logs When the Solaris PSH facility detects faults the faults are also logged by the ALOM CMT service processor After the fault condition is corrected for example by replacing a faulty FRU you must clear the fault from the ALOM CMT logs Note If you are dealing with faulty DIMMs do not follow this procedure Instead perform the procedure in Section 4 3 3 Replacing a DIMM on page 4 13 2 38 Sun Blade T6300 Server Module Service Manual April 2007 1 After replacing a faulty FRU at the ALOM CMT prompt use the showfaults command to identify PSH detected fa
33. UID 7ee0e46b ea64 6565 e684 e996963f7b86 2 Displaying the Environmental Status The showenvironment command displays a snapshot of the server module environmental status This command displays system temperatures hard drive status power supply and fan status front panel LED status voltage and current sensors The output uses a format similar to the Solaris OS command prtdiag 1m At the sc gt prompt type the showenvironment command The output differs according to your system s model and configuration Example sc gt showenvironment Environmental Status System Indicator Status SYS LOCATE SYS SERVICE SYS ACT SYS OK_TO_RM OFF ON ON OFF Disk Status Service OKtoRem HDDO OK OFF OFF HDD1 OK OFF OFF HDD2 OK OFF OFF HDD3 OK OFF OFF 2 18 Sun Blade T6300 Server Module Service Manual April 2007 System Temperatures Temperatures in Celsius Sensor Status Temp LowHard LowSoft LowWarn HighWarn HighSoft HighHard MB T_AMB OK 32 10 5 0 45 50 55 MB CMPO T_TCORE OK 45 10 5 0 80 80 85 MB CMPO T_BCORE OK 46 10 5 0 80 80 85 MB FO T_CORE OK 47 10 5 0 95 100 105 Fans Status Speeds in Revolutions Per Minute Sensor Status Speed MP FMO FIN OK 3970 MP FMO FOUT OK 3970 MP FM1 FIN OK 3970 MP FM1 FOUT OK 4017 MP FM2 FIN OK 4066 MP FM2 FOUT OK 4066 MP FM3 FIN OK 3970 MP FM3 FOUT OK 4017 MP FM4 FIN OK 4017 MP FM4 FOUT OK 4017 MP
34. a PSH detected fault Identify the fault condition from the fault message Identify and replace the faulty FRU from the PSH message and perform the procedure to clear the PSH detected fault 8 The fault is a POST detected fault Identify and replace the faulty FRU from the POST F message and perform the procedure to clear the POST detected faults 9 Contact Sun Support if the fault condition persists Chapter 2 Sun Blade T6300 Server Module Diagnostics 2 3 TABLE 2 1 Diagnostic Flowchart Actions Action For more information see No Diagnostic Action Resulting Action these sections 1 Check the OK The OK LED is located on the front of the chassis Section 2 2 Interpreting LED If the LED is not lit check that the blade is properly System LEDs on page 2 9 plugged in and the chassis has power 2 Run the ALOM The showfaults command displays the following Section 2 3 2 Displaying CMT kinds of faults System Faults on showfaults Environmental faults page 2 17 command to check for faults 3 Check the Solaris log files for fault information 4 Run SunVTS software 5 Run POST 6 Determine if the fault is an environmental fault Solaris Predictive Self Healing PSH detected faults e POST detected faults Faulty FRUs are identified in fault messages using the FRU name For a list of FRU names s
35. agnostics 2 41 Example with no disabled components sc gt showcomponent Keys MB CMP0 PO MB CMPO P1 MB CMPO0 P2 MB CMP0 P3 MB CMP0 P4 MB CMP0 P5 MB CMP0 P6 MB CMPO0 P7 MB CMP0 P8 MB CMPO P9 MB CMP0 P10 MB CMPO P11 MB CMP0 P12 MB CMP0 P13 MB CMP0 P14 MB CMPO P15 MB CMP0 P16 MB CMP0 P17 MB CMPO P18 MB CMP0 P19 MB CMP0 P20 MB CMPO P21 MB CMP0 P22 MB CMP0 P23 MB CMP0 P24 MB CMP0 P25 MB CMP0 P26 MB CMP0 P27 MB CMP0 P28 MB CMP0 P29 MB CMP0 P30 MB CMP0 P31 MB CMPO CHO RO DO MB CMPO CHO RO D1 MB CMPO CH1 RO DO MB CMP0 CH1 R0 D1 MB CMPO CH2 R0 DO MB CMP0 CH2 R0 D1 MB CMPO CH3 R0 DO MB CMPO CH3 RO D1 MB PCIEa MB PCIEb MB EMO MB EM1 MB NEMO MB NEM1 MB PCI BRIDGE MB USB TTYA MB NET MB SAS SATA HBA State clean Example showing a disabled component sc gt showcomponent ASR state Disabled Devices MB CMP0 CH3 R1 D1 dimm15 deemed faulty 22 Disabling Components With the disablecomponent Command The disablecomponent command disables a component by adding it to the ASR blacklist 1 At the sc gt prompt enter the disablecomponent command sc gt disablecomponent MB CMP0 CH3 R1 D1 SC Alert MB CMP0 CH3 R1 D1 disabled 2 42 Sun Blade T6300 Server Module Service Manual April 2007 2 After receiving confirmation that the disablecomponent command is complete reset the server module so that the ASR command takes effect sc gt reset 2 7 3
36. ain support for your system you need the serial number The serial number is located on a sticker on the front of the server module FIGURE 1 6 1 8 Sun Blade T6300 Server Module Service Manual April 2007 ee Serial number FIGURE 1 6 Serial Number and MAC Address Location You can also run the ALOM CMT showplat form command to obtain the chassis serial number sc gt showplatform SUNW Sun Blade T6300 Chassis Serial Number YB0079 Slot number 2 Domain Status SO OS Standby sc gt 1 3 Additional Service Related Information Documentation for the Sun Blade T6300 server module and related hardware and software is listed in Accessing Sun Documentation on page xiii Chapter 1 Sun Blade T6300 Server Module Product Description 1 9 The following resources are also available a SunSolves Online Provides a collection of support resources Depending on the level of your service contract you have access to Sun patches the Sun System Handbook the SunSolve knowledge base the Sun Support Forum and additional documents bulletins and related links Access this site at http www sunsolve sun com handbook_pub m Predictive Self Healing Knowledge Database You can access the knowledge article corresponding to a self healing message by taking the Sun Message Identifier SUNW MSG ID and entering it into the field on this page http www sun com msg 1 10 Sun Blade T6300 Serve
37. are logged in sending the alert through email to a configured email address and writing the event to the ALOM CMT event log ALOM CMT can detect when a fault is no longer present and clears the fault in several ways a Fault recovery The system automatically detects that the fault condition is no longer present ALOM CMT extinguishes the Service Action Required LED and updates the FRU PROM indicating that the fault is no longer present a Fault repair The fault has been repaired by human intervention In most cases ALOM CMT detects the repair and extinguishes the Service Required LED In the event that ALOM CMT does not perform these actions you must perform these tasks manually with clearfault or enablecomponent commands ALOM CMT can detect the removal of a FRU in many cases even if the FRU is removed while ALOM CMT is powered off This enables ALOM CMT to know that a fault diagnosed to a specific FRU has been repaired The ALOM CMT clearfault command enables you to manually clear certain types of faults without a FRU replacement or if ALOM CMT was unable to automatically detect the FRU replacement ALOM CMT does not automatically detect hard drive replacement Many environmental faults can automatically recover For example a temperature that is exceeding a threshold might return to normal limits An unplugged power supply can be plugged in The recovery of environmental faults is automatically detected Recovery events ar
38. ault is cleared and should not show up when you run the showfaults command Additionally the Service Action Reguired LED is no longer on Reboot the server module You must reboot the server module for the enablecomponent command to take effect At the ALOM CMT prompt use the showfaults command to verify that no faults are reported sc gt showfaults Last POST run THU MAR 09 16 52 44 2006 POST status Passed all devices No failures found in System 20 2 34 Using the Solaris Predictive Self Healing Feature The Solaris Predictive Self Healing PSH technology enables the Sun Blade T6300 server module to diagnose problems while the Solaris OS is running and mitigate many problems before they negatively affect operations The Solaris OS uses the fault manager daemon fmd 1M which starts at boot time and runs in the background to monitor the system If a component generates an error the daemon handles the error by correlating the error with data from previous errors and other related information to diagnose the problem Once diagnosed the fault manager daemon assigns the problem a Universal Unique Identifier UUID that distinguishes the problem across any set of systems When possible the fault manager daemon initiates steps to self heal the failed component and take the component offline The daemon also logs the fault to the syslogd daemon and Sun Blade T6300 Server Module Service Manual April 2007
39. aulty After the cause of the fault is repaired FRU replacement loose connector reseated and so on you must remove the component from the ASR blacklist The ASR commands TABLE 2 8 enable you to view and manually add or remove components from the ASR blacklist These commands are run from the ALOM CMT sc gt prompt TABLE 2 8 ASR Commands Command Description showcomponent Displays system components and their current state enablecomponent asrkey Removes a component from the asr db blacklist where asrkey is the component to enable disablecomponent asrkey Adds a component to the asr db blacklist where asrkey is the component to disable clearasrdb Removes all entries from the asr db blacklist The showcomponent command might not report all blacklisted DIMMS 8 P Note The components asrkeys vary from system to system depending on how many cores and memory are present Use the showcomponent command to see the asrkeys on a given system Note A reset or powercycle is required after disabling or enabling a component If the status of a component is changed with power on there is no effect to the system until the next reset or powercycle Displaying System Components With the showcomponent Command The showcomponent command displays the system components asrkeys and reports their status 1 At the sc gt prompt enter the showcomponent command Chapter 2 Sun Blade T6300 Server Module Di
40. ber Warning and informational messages use the following syntax INFO or WARNING message The following example shows a POST error message po 2 gt ERROR TEST Data Bitwalk 2 gt H W under test MB CMPO CH2 RO DO SO MB CMPO CH2 RO DO 2 gt Repair Instructions Replace items in order listed by H W under test above 7 2 gt MSG Pin 149 failed on MB CMP0 CH2 R0 D0 S0 J6901 7 2 gt END_ERROR I 1 1 7 2 gt Decode of Dram Error Log Reg Channel 2 bits 60000000 0000108c 7 2 gt 1 MEC 62 R W1C Multiple corrected errors one or more CE not logged 7 2 gt 1 DAC 61 R W1C Set to 1 if the error was a DRAM access CE 7 2 gt 108c SYND 15 0 RW ECC syndrome 7 2 gt 7 2 gt Dram Error AFAR channel 2 00000000 00000000 7 2 gt L2 AFAR channel 2 00000000 00000000 In this example POST is reporting a memory error at DIMM location MB CMP0 CH2 R0 D0 This error was detected by POST running on core 7 strand 2 2 32 Sun Blade T6300 Server Module Service Manual April 2007 2 4 5 b Run the showfaults command to obtain additional fault information The fault is captured by ALOM where the fault is logged the Service Action Required LED is lit and the faulty component is disabled Example ok sc gt showfaults v ID Time FRU Fault 1 APR 24 12 47 27 MB CMPO CH2 RO DO MB CMPO CH2 RO0 DO deemed faulty and disabled In this example MB CMP0 CH2 RO0 DO0 is disabled The system
41. book title Document Title Description Sun Blade T6300 Server Module Product Notes 820 0278 Sun Blade T6300 Server Module Installation Guide 820 0275 Sun Blade T6300 Server Module Administration Guide 820 0277 Sun Blade T6300 Server Module Safety and Compliance Manual 820 0279 Sun Blade 6000 Chassis Documentation Sun Blade 6000 Modular System Service Manual 820 0051 Sun Blade 6000 Modular System Product Notes 820 0055 Software Documentation Advanced Lights out Management ALOM CMT v1 3 Guide 819 7981 Configuring Jumpstart Servers to Provision Sun x86 64 Systems 819 1962 10 Solaris 10 6 06 Installation Guide Network Based Installations Sun VTS 6 3 User s Guide 820 0080 Solaris Operating System documentation Important late breaking information about the server module and related software Basic information about installing powering on and installing software Administrative tasks that are specific to the Sun Blade T6300 server module Important safety information for the Sun Blade T6300 server module Component removal and replacement procedures diagnostics information and specifications Late breaking information about the Sun Blade 6000 chassis and related software Advanced Lights Out Manager ALOM CMT software Configuring JumpStart servers Setting up network based installations and JumpStart servers Testing the server module and creating custom hardware tests
42. can boot using memory that was not disabled until the faulty component is replaced Note You can use ASR commands to display and control disabled components See Section 2 7 Managing Components With Automatic System Recovery Commands on page 2 40 Clearing POST Detected Faults In most cases when POST detects a faulty component POST logs the fault and automatically takes the failed component out of operation by placing the component in the ASR blacklist see Section 2 7 Managing Components With Automatic System Recovery Commands on page 2 40 After the faulty FRU is replaced you must clear the fault by removing the component from the ASR blacklist At the ALOM CMT prompt use the showfaults command to identify POST detected faults POST detected faults are distinguished from other kinds of faults by the text deemed faulty and disabled and no UUID number is reported Example sc gt showfaults v ID Time FRU Fault 1 APR 24 12 47 27 MB CMPO CH2 RO DO MB CMPO CH2 R0 DO deemed faulty and disabled If no fault is reported you do not need to do anything else Do not perform the subseguent steps Chapter2 Sun Blade T6300 Server Module Diagnostics 2 33 2 Use the enablecomponent command to clear the fault and remove the component from the ASR blacklist Use the FRU name that was reported in the fault in the previous step Example sc gt enablecomponent MB CMPO CHO RO DO The f
43. console output to the ALOM CMT sc gt prompt type Hash Period m To switch from the sc gt prompt to the console type console 2 14 Sun Blade T6300 Server Module Service Manual April 2007 2 3 1 3 Service Related ALOM CMT Commands TABLE 2 5 describes the typical ALOM CMT commands for servicing a Sun Blade T6300 server module For descriptions of all ALOM CMT commands issue the help command or refer to the Advanced Lights out Management ALOM CMT v1 3 Guide 819 7981 TABLE 2 5 Service Related ALOM CMT Commands ALOM CMT Command Description help command break y c clearfault UUID console f consolehistory b lines e lines v g lines boot run bootmode normal reset_nvram bootscript string powercycle f poweroff y f poweron c Displays a list of all ALOM CMT commands with syntax and descriptions Specifying a command name as an option displays help for that command Takes the host server from the OS to either kmdb or OpenBoot PROM equivalent to a Stop A command depending on the Solaris mode that was booted The y option skips the confirmation question The c option executes a console command after completion of the break command Manually clears host detected faults The UUID is the unique fault ID of the fault to be cleared Connects you to the host system The option forces the console to have read and write capabilities Displays the co
44. d drives displaying FRU status 2 20 dmesg command 2 40 Drive Ready to Remove LED 2 10 E electrostatic discharge ESD prevention 4 2 enablecomponent command 2 34 2 41 2 43 4 14 environmental faults 2 4 2 13 2 17 Ethernet MAC addresses 4 16 Ethernet ports about 1 5 LEDs 2 12 specifications 1 5 event log checking the PSH 2 36 exercising the system with SunVTS 2 44 F fan status displaying 2 18 fault manager daemon fmd 1M 2 34 fault message ID 2 17 fault records 2 38 faults 2 17 ALOM handling 2 12 environmental 2 4 managing DIMM faults 4 13 recovery 2 13 repair 2 13 types of 2 17 feature specifications 1 2 features server module 1 1 field replaceable units FRUs also see FRUs 4 1 fmadm command 2 38 4 15 fmdump command 2 35 front panel LED status displaying 2 18 LEDs 2 9 FRU disabled messages 4 14 enablecomponent command 4 14 replacement common procedures 4 3 status displaying 2 20 FRU ID PROMs 2 12 FRUs hot swapping 3 1 G guide organization 2 xii H hard drives 1 6 backplane cable replacing 4 18 hot plugging 3 1 identification 3 2 latch release button 3 3 mirroring 1 8 replacing 3 2 replacing or installing new 3 4 specifications 1 5 status displaying 2 18 hardware component test and verification 2 26 HDD hard drive FRU names 1 6 help command 2 15 host ID 4 16 hot plugging hard drives 3 1 Index 2 Sun Blade T6300 Server Module Se
45. d in the socket You hear a click when the ejector levers lock into place Perform the procedures described in Section 4 6 Finishing Component Replacement on page 4 22 4 4 4 18 Removing the Disk Backplane Cables Perform the procedures described in Section 4 2 Common Procedures for Parts Replacement on page 4 3 Disconnect the power cable from the power cable plug Note which data cable is plugged into each connector and disconnect the four data cables from the disk backplane Remove the five screws that secure the disk backplane to the chassis FIGURE 4 10 Sun Blade T6300 Server Module Service Manual April 2007 FIGURE 4 10 Removing the Disk Backplane Cables 4 4 1 Replacing the Disk Backplane Cables Each cable FRU kit contains two hard drive signal cables and one power signal cable 1 Ensure that the system is on an antistatic mat and remove the cables from the package 2 Connect the power cable to the controller and to the motherboard FIGURE 4 10 Caution Ensure that the hard drive signal cables are connected to the appropriate connectors at each end 3 Connect the signal cable to the controller and to the motherboard FIGURE 4 10 4 Verify that the cables are tight Chapter 4 Replacing Cold Swappable Components 4 19 5 Perform the procedures described in Section 4 6 Finishing Component Replacement on page 4 22 4 5 Removing the Battery on t
46. des the operating system and the operating system is not mirrored on another drive m The hard drive cannot be logically isolated from the online operations of the server module If your drive falls into these conditions you must shut the system down before you replace the hard drive See Section 4 2 2 Shutting Down the System on page 4 3 Removing a Hard Drive For more information about the cfgadm and hard drive management commands see the Solaris man pages Identify the physical location of the hard drive that you want to replace FIGURE 3 2 Issue the Solaris OS commands reguired to stop using the hard drive Exact commands reguired depend on the configuration of your hard drives You might need to unmount file systems or perform RAID commands One command that you may use to take the drive offline is cfgadm For more information see the cfgadm man page Verify that the blue Drive Ready to Remove LED is illuminated on the front of the hard drive 3 2 Sun Blade T6300 Server Module Service Manual April 2007 Blue LED Drive Ready to Remove Amber LED Service Action Required Green LED Drive OK FIGURE 3 1 Hard Drive Locations and LEDs 4 Push the latch release button FIGURE 3 2 Caution The latch is not an ejector The latch can be damaged if you bend it too much 5 Grasp the latch and pull the drive out of the drive slot Chapter 3 Replacing Hot Swappable and Hot Pl
47. ds refer to the Sun Blade T6300 Server Module Administration Guide 820 0277 sc gt powercycle Are you sure you want to powercycle the system y n y Powering host off at MON JAN 10 02 52 02 2000 Waiting for host to Power Off hit any key to abort SC Alert SC Request to Power Off Host SC Alert Host system has shut down Powering host on at MON JAN 10 02 52 13 2000 SC Alert SC Request to Power On Host 4 Switch to the system console to view the post output sc gt console Example of POST output with some output omitted 0 0 gt 0 0 gt Sun Blade T6300 Server Module POST 4 25 0 2007 01 16 11 57 0 0 gt Copyright 2007 Sun Microsystems Inc All rights reserved SUN PROPRIETARY CONFIDENTIAL Use is subject to license terms 0 0 gt VBSC selecting POST MAX Testing 0 0 gt POST enabling threads f00fffff 0 0 gt VBSC setting verbosity level 3 Chapter 2 Sun Blade T6300 Server Module Diagnostics 2 27 0 gt Start Selftest 0 gt Begin Init CPU 0 gt End Init CPU 0 gt Master CPU Tests Basic 0 gt CPU 0 0 gt Begin DMMU Registers Access 0 gt End DMMU Registers Access 0 gt Begin Common MMU regs 0 gt End Common MMU regs 0 gt Begin Init mmu regs 0 gt End Init mmu regs 0 gt Begin D Cache RAM 0 gt End D Cache RAM 0 gt Init MMU 0 gt Begin DMMU TLB DATA RAM Access 0 gt End DMMU TLB DATA RAM Access 0 gt Begin DMMU TLB TAGS
48. ds 2 15 ALOM CMT Parameters Used For POST Configuration 2 22 ALOM CMT Parameters and POST Modes 2 25 ASR Commands 2 41 Sample of installed SunVTS Packages 2 44 DIMM Names and Socket Numbers 4 12 Exterior Dimensions A 1 x Sun Blade T6300 Server Module Service Manual April 2007 Preface The Sun Blade Server Module Service Manual provides information to aid in diagnosing hardware problems and describes how to replace components This manual also describes how to add components such as hard drives and memory This manual is written for technicians service personnel and system administrators who service and repair computer systems The person qualified to use this manual Can open a system chassis and can identify and replace internal components Understands the Solaris Operating System and the command line interface Has superuser privileges for the system being serviced Understands typical hardware troubleshooting tasks xi How This Book Is Organized This guide is organized into the following chapters Chapter 1 describes the main features of the Sun Blade T6300 server module Chapter 2 describes diagnostics procedures and related information Chapter 3 explains how to remove and replace hot pluggable hard drives Chapter 4 describes how to remove and replace components that cannot be hot swapped Appendix A provides specifications xii Using UNIX Commands This document might not contain informatio
49. dware and OS ALOM CMT also tracks and reports the health of key server module components ALOM CMT works closely with POST and Solaris Predictive Self Healing technology to keep the system up and running even when there is a faulty component a Power on self test POST POST performs diagnostics on system components upon system reset to ensure the integrity of those components POST is configureable and works with ALOM CMT to take faulty components offline if needed m Solaris OS Predictive Self Healing PSH This technology continuously monitors the health of the CPU and memory and works with ALOM CMT to take a faulty component offline if needed The Predictive Self Healing technology enables Sun systems to accurately predict component failures and mitigate many serious problems before they occur m Log files and console messages Provide the standard Solaris OS log files and investigative commands that can be accessed and displayed on the device of your choice m SunVTS An application that exercises the system provides hardware validation and discloses possible faulty components with recommendations for repair The LEDs ALOM Solaris OS PSH and many of the log files and console messages are integrated For example a fault detected by the Solaris software will display the fault log it pass information to ALOM CMT where it is logged and depending on the fault might illuminate one or more LEDs The diagnostic flowchart in
50. e fault message ID SUNW MSG ID for PSH detected faults m To verify that the replacement of a FRU has cleared the fault and not generated any additional faults At the sc gt prompt type the showfaults command The following showfaults command examples show the different kinds of output from the showfaults command m Example of the showfaults command when no faults are present sc gt showfaults Last POST run THU MAR 09 16 52 44 2006 POST status Passed all devices No failures found in System m Example of the showfaults command displaying an environmental fault POST status ID FRU 0 IOBD sc gt showfaults v Last POST run TUE FEB 07 18 51 Passed all devices Fault VOLTAGE_SI low warning threshold 02 2006 ENSOR at IOBD V_ 1V has exceeded Chapter 2 Sun Blade T6300 Server Module Diagnostics 2 17 m Example showing a fault that was detected by POST These kinds of faults are identified by the message deemed faulty and disabled and by a FRU name sc gt showfaults v ID Time FRU Fault 1 OCT 13 12 47 27 MB CMPO CHO R0 DO MB CMPO CHO RO DO deemed faulty and disabled m Example showing a fault that was detected by the PSH technology These kinds of faults are identified by the text Host detected fault and by a Universal Unigue Identifier UUID sc gt showfaults v ID Time FRU Fault O SEP 09 11 09 26 MB CMP0 CHO RO DO Host detected fault MSGID SUN4U 8000 2S U
51. e processor is the basis of the Sun Blade T6300 server module The UltraSPARC T1 multicore processor is based on chip multithreading CMT technology that is optimized for highly threaded transactional processing This processor improves throughput while using less power and dissipating less heat than conventional processor designs The processor has six or eight UltraSPARC cores Each core eguates to a 64 bit execution pipeline capable of running four threads The result is that the 8 core processor handles up to 32 active threads concurrently Additional processor components such as L1 cache L2 cache memory access crossbar DDR 2 memory controllers and a JBus I O interface have been carefully tuned for optimal performance For more information about the UltraSPARC T1 multicore processor refer to the coolthreads white papers at http www sun com servers wp jsp tab 1 Support for RAID Storage Configurations In addition to software RAID configurations you can set up hardware RAID 1 mirroring and hardware RAID 0 striping configurations for any pair of internal hard drives using the on board controller providing a high performance solution for hard drive mirroring By attaching one or more external storage devices to the Sun Blade T6300 server module you can use a redundant array of independent drives RAID to configure system drive storage in a variety of different RAID levels 1 2 Finding the Serial Number To obt
52. e reported using one of two forms m fru at location is OK m sensor at location is within normal range Environmental faults can be repaired through hot removal of the faulty FRU FRU removal is automatically detected by the environmental monitoring and all faults associated with the removed FRU are cleared The message for that case and the alert sent for all FRU removals is Chapter 2 Sun Blade T6300 Server Module Diagnostics 2 13 fru at location has been removed There is no ALOM CMT command to manually repair an environmental fault ALOM CMT does not handle hard drive faults Use the Solaris message files to view hard drive faults See Section 2 6 Collecting Information From Solaris OS Files and Commands on page 2 39 2 3 1 Running ALOM CMT Service Related Commands This section describes the ALOM CMT commands that are commonly used for service related activities 2 3 1 1 Connecting to ALOM Before you can run ALOM CMT commands you must connect to the ALOM There are several ways to connect to the service processor m Connect an ASCII terminal directly to the serial management port m Use the telnet command to connect to ALOM CMT through an Ethernet connection on the network management port Note Refer to the Advanced Lights out Management ALOM CMT v1 3 Guide 819 7981 for instructions on configuring and connecting to ALOM 2 3 1 2 Switching Between the System Console and ALOM m To switch from the
53. ed Meanings Color Behavior Definition Description White Off Steady state Fast blink 4 Hz repeating This indicator helps you to locate a particular sequence equal enclosure board or subsystem for example the intervals On Locator LED The LED is activated using one of the and Off following methods Issuing the setlocator on or off command e Pressing the button to toggle the indicator on or off This LED provides the following indications Off Normal operating state Fast blink The server module received a signal as a result of one of the preceding methods and is indicating that the server module is active Blue Off Steady state Steady state it is safe to remove the server module from the chassis Steady on Steady state If blue is on a service action can be performed on the applicable component with no adverse consequences for example the OK to Remove LED Yellow or Off Steady state Amber 2 10 Sun Blade T6300 Server Module Service Manual April 2007 TABLE 2 3 LED Behaviors With Assigned Meanings Continued Color Behavior Definition Description Steady on Steady state This indicator signals the existance of a fault condition Service is required for example the Service Required LED The ALOM CMT showfaults command provides details about any faults that cause this indicator to be lit Green Off Steady state Off The system is unavailable Either it has no power or ALOM CMT is not running Standby blink Repeatin
54. ee TABLE 1 3 The Solaris message buffer and log files record system events and provide information about faults e If system messages indicate a faulty device replace the FRU e To obtain more diagnostic information go to Action 4 SunVTS can exercise and diagnose FRUs To run SunVTS the server module must be running the Solaris OS If SunVTS reports a faulty device replace the FRU If SunVTS does not report a faulty device go to Action 5 POST performs basic tests of the server module components and reports faulty FRUs e If POST indicates a faulty FRU replace the FRU e If POST does not indicate a faulty FRU go to Action 9 If the fault listed by the showfaults command displays a temperature or voltage fault then the fault is an environmental fault Environmental faults can be caused by faulty FRUs power supply fan or blower or by environmental conditions such as high ambient temperature or blocked airflow Section 2 6 Collecting Information From Solaris OS Files and Commands on page 2 39 Section 2 8 Exercising the System With SunVTS on page 2 43 Section 2 4 Running POST on page 2 22 Section 2 3 2 Displaying System Faults on page 2 17 See the Sun Blade 6000 Modular System Service Manual 820 0051 2 4 Sun Blade T6300 Server Module Service Manual April 2007 TABLE 2 1 Action No Diagnostic Flowchart Actions Continued Diagnostic Action
55. em LEDs The Sun Blade T6300 server module has LEDs on the front panel and the hard drives The behavior of LEDs on your server module conform to the American National Standards Institute ANSI Status Indicator Standard SIS These standard LED behaviors are described in TABLE 2 2 Front Panel LEDs and Buttons The front panel LEDs and buttons are located in the center of the server module FIGURE 2 3 TABLE 2 2 and TABLE 2 3 and TABLE 2 4 White Locator LED press to reset the LED Blue Ready to Remove LED Amber Service Action Required LED Green OK LED Power button Not functional Universal connector port UCP Green Disk OK LED Amber Disk Service Action Required LED Blue Disk Ready to Remove LED FIGURE 2 3 Front Panel and Hard Drive LEDs Chapter 2 Sun Blade T6300 Server Module Diagnostics 2 9 TABLE 2 2 LED Behavior and Meaning LED Behavior Meaning Off The condition represented by the color is not true Steady on The condition represented by the color is true Standby blink The system is functioning at a minimal level and ready to resume full function Slow blink Transitory activity or new activity represented by the color is taking place Fast blink Attention is required Feedback flash Activity is taking place commensurate with the flash rate such as disk drive activity The LEDs have assigned meanings described in TABLE 2 3 TABLE 2 3 LED Behaviors With Assign
56. em Message Log Files The error logging daemon syslogd automatically records various system warnings errors and faults in message files These messages can alert you to system problems such as a device that is about to fail The var adm directory contains several message files The most recent messages are in the var adm messages file After a period of time usually every ten days a new messages file is automatically created The original contents of the messages file are rotated to a file named messages 1 Over a period of time the messages are further rotated to messages 2 and messages 3 and then deleted 1 Log in as superuser 2 Issue the following command more var adm messages 3 If you want to view all logged messages issue the following command more var adm messages 2 7 Managing Components With Automatic System Recovery Commands The Automatic System Recovery ASR feature enables the server module to automatically unconfigure failed components to remove them from operation until they can be replaced In the Sun Blade T6300 server module the following components are managed by the ASR feature m UltraSPARC T1 processor strands m Memory DIMMS a I O bus 2 40 Sun Blade T6300 Server Module Service Manual April 2007 AF The database that contains the list of disabled components is called the ASR blacklist asr db In most cases POST automatically disables a component when it is f
57. er module is also known as a blade The following topics are covered m Section 1 1 Component Overview on page 1 1 m Section 1 2 Finding the Serial Number on page 1 8 m Section 1 3 Additional Service Related Information on page 1 9 1 1 Component Overview FIGURE 1 2 shows the main St Paul components and some basic connections to the Sun Blade 6000 chassis For information about connectivity to system fans PCI ExpressModules Ethernet modules and other components see the Sun Blade 6000 chassis documentation at http www sun com documentation FIGURE 1 1 Sun Blade T6300 Server Module With the Sun Blade 6000 Chassis FIGURE 1 5 shows the physical characteristics of the Sun Blade T6300 server module TABLE 1 1 lists the Sun Blade T6300 server module features TABLE 1 2 lists some of Sun Blade 6000 chassis input output features 1 2 Sun Blade T6300 Server Module Service Manual April 2007 Front View White Locator LED Rear View r press to reset the LED Power connector Blue Ready to Remove LED Amber Service Action Required LED Signal connector Green OK LED Power button Not functional Universal connector port UCP Green Drive OK LED Amber Drive Service Action Required LED Blue Drive Ready to Remove LED FIGURE 1 2 Front and Rear Panels and Optional Cable Dongle Chapter 1 Sun Blade T6300 Server Module Product Description 1 3 RJ45 virtual co
58. eroff command sc gt poweroff y SC Alert SC Request to Power Off Host Note You can also use the Power button on the front of the server module to initiate a graceful system shutdown Use a paper clip to press this button Feb 10 17 17 11 dt90 107 unix WARNING Power off requested system will now shutdown Shutdown started Sat Feb 10 17 17 11 PST 2007 Changing to init state 5 please wait Broadcast Message from root msglog on dt90 107 Sat Feb 10 TLF es THE SYSTEM dt90 107 IS BEING SHUT DOWN NOW Log off now or risk your files being damaged svc startd The system is coming down Please wait svc startd 87 system services are now being stopped Feb 10 17 17 27 dt90 107 syslogd going down on signal 15 SC Alert Host system has shut down Refer to the Advanced Lights out Management ALOM CMT v1 3 Guide 819 7981 for more information about the ALOM CMT poweroff command 4 2 3 Removing the Sun Blade T6300 Server Module From the Sun Blade 6000 Chassis 1 To perform an orderly shutdown run the removeblade command sc gt removeblade The top white LED is the Locator LED Once you have located the server module you can press the Locator LED to turn it off Or you can use the setlocator off command to reset the LED 4 4 Sun Blade T6300 Server Module Service Manual April 2007 2 If a cable is connected to the front of the server module disconnect it
59. g The system is running at a minimum level and is sequence ready to be quickly revived to full function for consisting of a example the System Activity LED brief 0 1 sec on flash followed by a long off period 2 9 sec Steady on Steady state Status normal system or component functioning with no service actions required Slow blink A transitory temporary event is taking place for which direct proportional feedback is not needed or not feasible ALOM is enabled but the server module is not fully powered on Indicates that the service processor is running while the system is running at a minimum level in standby mode and ready to be returned to its normal operating state TABLE 2 4 Front Panel Buttons LED Color Description Power gray Turns the host system on and off Use a paper clip or other small button tipped object to completely press this button reset gray This button does not function on the Sun Blade T6300 server module Chapter 2 Sun Blade T6300 Server Module Diagnostics 2 11 2 2 2 Ethernet Port LEDs For information about Ethernet LEDs see the Sun Blade 6000 Modular System Service Manual 820 0051 at http www sun com documentation 20 Using ALOM CMT for Diagnosis and Repair Verification The Sun Advanced Lights Out Manager ALOM CMT is a service processor in the Sun Blade T6300 server module that enables you to remotely manage and administer your server module ALOM C
60. ge Configurations 1 8 1 2 Finding the Serial Number 1 8 1 3 Additional Service Related Information 1 9 Sun Blade T6300 Server Module Diagnostics 2 1 2 1 Sun Blade T6300 Server Module Diagnostics Overview 2 1 2 11 Memory Configuration and Fault Handling 2 5 2 1 1 1 2 1 1 2 2 1 1 3 2 1 1 4 2 1 1 5 Memory Configuration 2 6 Capacity Restrictions 2 6 DIMM Installation Rules 2 6 Memory Fault Handling 2 7 Troubleshooting Memory Faults 2 8 2 2 Interpreting System LEDs 2 9 2 2 1 Front Panel LEDs and Buttons 2 9 2 2 2 Ethernet Port LEDs 2 12 iv 2 3 2 4 2 5 2 6 2 7 Using ALOM CMT for Diagnosis and Repair Verification 2 12 2 3 1 Running ALOM CMT Service Related Commands 2 14 2 3 1 1 Connecting to ALOM 2 14 2 3 1 2 Switching Between the System Console and ALOM 2 14 2 3 1 3 Service Related ALOM CMT Commands 2 15 2 3 2 Displaying System Faults 2 17 2 3 3 Displaying the Environmental Status 2 18 2 3 4 Displaying FRU Information 2 20 Running POST 2 22 2 4 1 Controlling How POST Runs 2 22 2 42 Changing POST Parameters 2 25 2 43 Reasons to Run POST 2 26 2 4 3 1 Verifying Hardware Functionality 2 26 243 2 Diagnosing the System Hardware 2 26 2 44 Running POST 2 26 2 4 5 Clearing POST Detected Faults 2 33 Using the Solaris Predictive Self Healing Feature 2 34 2 5 1 Identifying Faults With the fmdump Command 2 35 2 5 2 Clearing PSH Detected Faults 2 37 2 5 3 Clearing the PSH Fault From the ALOM CMT Logs 2 38 Collect
61. he Service Processor 1 Perform the procedures described in Section 4 2 Common Procedures for Parts Replacement on page 4 3 2 Remove the service processor from the chassis Section 4 3 4 Removing the Service Processor on page 4 15 and place it on an antistatic mat 3 Carefully remove the battery FIGURE 4 11 from the service processor 4 20 Sun Blade T6300 Server Module Service Manual April 2007 FIGURE 4 11 Removing the Battery From the Service Processor 4 5 1 Replacing the Battery on the Service Processor Remove the replacement battery from the package Press the new battery into the service processor FIGURE 4 11 with the positive side facing upward away from the card Replace the service processor See Section 4 3 5 Replacing the Service Processor on page 4 17 Perform the procedures described in Section 4 6 Finishing Component Replacement on page 4 22 Chapter 4 Replacing Cold Swappable Components 4 21 5 Use the ALOM CMT setdate command to set the day and time Use the setdate command before you poweron the host system For details about this command refer to the Advanced Lights out Management ALOM CMT v1 3 Guide 819 7981 4 6 Finishing Component Replacement 4 6 1 Replacing the Cover 1 Place the cover on the chassis Set the cover down so that it hangs over the rear of the server module by about an inch 2 5 mm 2 Slide the cover forward
62. ide 820 0277 See Reconfiguring and Unconfiguring Devices Chapter 3 Replacing Hot Swappable and Hot Pluggable Components 3 5 3 6 Sun Blade T6300 Server Module Service Manual April 2007 CHAPTER 4 Replacing Cold Swappable Components This chapter describes how to remove and replace field replaceable units FRUs that must be cold swapped The following topics are covered 4 1 Section 4 1 Safety Information on page 4 1 Section 4 2 Common Procedures for Parts Replacement on page 4 3 Section 4 3 Removing and Replacing DIMMS on page 4 8 Section 4 4 Removing the Disk Backplane Cables on page 4 18 Section 4 5 Removing the Battery on the Service Processor on page 4 20 Section 4 6 Finishing Component Replacement on page 4 22 Safety Information This section describes important safety information you need to know prior to removing or installing parts in the Sun Blade T6300 server module For your protection observe the following safety precautions when setting up your equipment Follow all Sun standard cautions warnings and instructions marked on the equipment and described in Important Safety Information for Sun Hardware Systems 816 7190 10 Ensure that the voltage and frequency of your power source match the voltage and frequency inscribed on the equipment s electrical rating label Follow the electrostatic discharge safety practices as described in this section 4 1
63. ill run in service mode sc gt setkeyswitch diag b Issue the poweron command sc gt poweron c Switch to the system console to view POST output sc gt console Watch the POST output for possible fault messages The following output is a sign that POST did not detect any faults 2000 02 11 21 33 43 045 0 0 gt POST Passed all devices 2000 02 11 21 33 43 272 0 0 gt POST Return to VBSC 2000 02 11 21 33 43 280 0 0 gt Master set ACK for vbsc runpost command and spin Note Depending on the configuration of ALOM CMT POST variables and whether POST detected faults or not the system might boot or the system might remain at the ok prompt If the system is at the ok prompt type boot 4 14 Sun Blade T6300 Server Module Service Manual April 2007 4 3 4 10 11 d Issue the Solaris OS fmadm faulty command fmadm faulty No memory or DIMM faults should be displayed If faults are reported refer to FIGURE 2 1 to diagnose the fault Access the ALOM CMT sc gt prompt Run the showfaults command m If the fault was detected by the host and the fault information persists the output will be similar to the following example sc gt showfaults v ID Time FRU Fault O SEP 09 11 09 26 MB CMPO CHO R0 DO Host detected fault MSGID SUN4U 8000 2S UUID 7ee0e46b ea64 6565 e684 e996963f7b86 m If the showfaults command does not report a fault with a UUID then
64. ing Information From Solaris OS Files and Commands 2 39 2 6 1 Checking the Message Buffer 2 39 2 6 2 Viewing the System Message Log Files 2 40 Managing Components With Automatic System Recovery Commands 2 40 2 71 Displaying System Components With the showcomponent Command 2 41 2 7 2 Disabling Components With the disablecomponent Command 2 42 Sun Blade T6300 Server Module Service Manual April 2007 2 8 2 7 3 Enabling a Disabled Component With the enablecomponent Command 2 43 Exercising the System With SunVTS 2 43 2 8 1 Checking SunVTS Software Installation 2 43 2 8 2 Exercising the System Using SunVTS Software 2 44 Replacing Hot Swappable and Hot Pluggable Components 3 1 3 1 3 2 3 3 Hot Pluggable Hard Drives 3 1 Hot Plugging a Hard Drive 3 1 3 2 1 Rules for Hot Plugging 3 2 3 22 Removing a Hard Drive 3 2 3 23 Replacing a Hard Drive or Installing a New Hard Drive 3 4 Adding PCI ExpressModules 3 5 Replacing Cold Swappable Components 4 1 4 1 4 2 4 3 Safety Information 4 1 4 1 1 Safety Symbols 4 2 4 1 2 Electrostatic Discharge Safety 4 2 4 1 2 1 Using an Antistatic Wrist Strap 4 2 4 1 2 2 Using an AntistaticMat 4 2 Common Procedures for Parts Replacement 4 3 42 1 Required Tools 4 3 4 2 2 Shutting Down the System 4 3 42 3 Removing the Sun Blade T6300 Server Module From the Sun Blade 6000 Chassis 4 4 Removing and Replacing DIMMS 4 8 4 3 1 Memory Configuration 4 8 4 3 1 1 Capacity Restrictions
65. ion 2 4 Running POST on page 2 22 Section 4 2 Common Procedures for Parts Replacement on page 4 3 Section 2 4 5 Clearing POST Detected Faults on page 2 33 Sun Support information http www sun com support Section 1 2 Finding the Serial Number on page 1 8 ALI Memory Configuration and Fault Handling This section describes how the memory is configured and how the server module deals with memory faults Chapter 2 Sun Blade T6300 Server Module Diagnostics 2 5 2 1 1 1 2 1 1 2 2 1 1 3 Memory Configuration The Sun Blade T6300 server module has eight slots that hold DDR 2 memory DIMMs in the following DIMM sizes m 1 Gbyte maximum of 8 Gbyte m 2 Gbyte maximum of 16 Gbyte m 4 Gbyte maximum of 32 Gbyte The Sun Blade T6300 server module performs best if all eight connectors are populated with eight DIMMs This configuration also enables the system to continue operating even when a DIMM fails or if an entire channel fails Capacity Restrictions Due to interleaving rules for the CPU the system will operate at the lowest capacity of all the DIMMs installed Therefore it is ideal to install eight identical DIMMs not four DIMMs of one capacity and four DIMMs of another capacity DIMM Installation Rules Caution The following DIMM rules must be followed The server module might not operate correctly if the DIMM rules are not followed Always use DIMMs that have been
66. laris are trademarks or registered trademarks of Sun Microsystems Inc in the U S and in other countries All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International Inc in the U S and in other countries Products bearing SPARC trademarks are based upon an architecture developed by Sun Microsystems Inc The OPEN LOOK and Sun Graphical User Interface was developed by Sun Microsystems Inc for its users and licensees Sun acknowledges the pioneering efforts of Xerox in researching and developing the concept of visual or graphical user interfaces for the computer industry Sun holds a non exclusive license from Xerox to the Xerox Graphical User Interface which license also covers Sun s licensees Wna implement OPEN LOOK GUIs and otherwise comply with Sun s written license agreements U S Government Rights Commercial use Government users are subject to the Sun Microsystems Inc standard license agreement and applicable provisions of the FAR and its supplements DOCUMENTATION IS PROVIDED AS IS AND ALL EXPRESS OR IMPLIED CONDITIONS REPRESENTATIONS AND WARRANTIES INCLUDING ANY IMPLIED WARRANTY OF MERCHANTABILITY FITNESS FOR A PARTICULAR PURPOSE OR NON INFRINGEMENT ARE DISCLAIMED EXCEPT TO THE EXTENT THAT SUCH DISCLAIMERS ARE HELD TO BE LEGALLY INVALID Copyright 2007 Sun Microsystems Inc 4150 Network Circle Santa Clara Californie 95054 Etats Unis Tous droits r serv s S
67. marques SPARC sont utilis es sous licence et sont des marques de fabrique ou des marques d pos es de SPARC International Inc aux Etats Unis et dans d autres pays Les produits portant les marques SPARC sont bas s sur une architecture d velopp e par Sun Microsystems Inc L interface d utilisation graphique OPEN LOOK et Sun a t d velopp e par Sun Microsystems Inc pour ses utilisateurs et licenci s Sun reconnait les efforts de pionnier de Xerox pour la recherche et le d veloppement du concept des interfaces d utilisation visuelle ou graphique pour l industrie de l informatique Sun d tient une license non exclusive de Xerox sur l interface d utilisation grephique Xerox cette licence couvrant galement les licenci es de Sun qui mettent en place l interface d utilisation graphique OPEN LOOK et qui en outre se conforment aux licences crites de Sun LA DOCUMENTATION EST FOURNIE EN L TAT ET TOUTES AUTRES CONDITIONS DECLARATIONS ET GARANTIES EXPRESSES OU TACITES SONT FORMELLEMENT EXCLUES DANS LA MESURE AUTORISEE PAR LA LOI APPLICABLE Y COMPRIS NOTAMMENT TOUTE GARANTIE IMPLICITE RELATIVE A LA QUALITE MARCHANDE A UAPTITUDE A UNE UTILISATION PARTICULIERE OU A L ABSENCE DE CONTREFA ON Ca Adobe PostScript Contents Preface xi Sun Blade T6300 Server Module Product Description 1 1 1 1 Component Overview 1 1 111 Chip Multithreaded CMT Multicore Processor and Memory Technology 1 8 1 1 2 Support for RAID Stora
68. meter is a command that sets the virtual keyswitch so it does not use the setsc command Example sc gt setkeyswitch diag Reasons to Run POST You can use POST to test and verify server module hardware Verifying Hardware Functionality POST tests critical hardware components to verify functionality before the system boots and accesses software If POST detects an error the faulty component is disabled automatically preventing faulty hardware from potentially harming software Under normal operating conditions the server module is usually configured to run POST in maximum mode for all power on or error generated resets Diagnosing the System Hardware You can use POST as an initial diagnostic tool for the system hardware In this case configure POST to run in diagnostic service mode for maximum test coverage and verbose output Running POST This procedure describes how to run POST when you want maximum testing as in the case when you are troubleshooting a system 2 26 Sun Blade T6300 Server Module Service Manual April 2007 1 Switch from the system console prompt to the ALOM CMT sc gt prompt by issuing the escape sequence ok sc gt 2 Set the virtual keyswitch to diag so that POST will run in service mode sc gt setkeyswitch diag 3 Reset the system so that POST runs There are several ways to initiate a reset The following example uses the powercycle command For other metho
69. n about basic UNIX commands and procedures such as shutting down the system booting the system and configuring devices Refer to the following for this information m Software documentation that you received with your system m Solaris Operating System documentation which is at http docs sun com Sun Blade T6300 Server Module Service Manual April 2007 Typographic Conventions Typeface Meaning Examples AaBbCc123 The names of commands files Edit your login file and directories on screen Use 1s a to list all files computer output You have mail AaBbCc123 What you type when contrasted su with on screen computer output password AaBbCc123 Book titles new words or terms Read Chapter 6 in the User s Guide words to be emphasized These are called class options Replace command line variables You must be superuser to do this with real names or values To delete a file type rm filename The settings on your browser might differ from these settings 8 y 8 8 Shell Prompts Shell Prompt C shell machine name C shell superuser machine name Bourne shell and Korn shell Bourne shell and Korn shell superuser Accessing Sun Documentation You can view print or purchase a broad selection of Sun documentation including localized versions at http www sun com documentation Search on St Paul Preface xiii To find software documents search on the software name or
70. ng 2 33 faulty components detected by 2 33 how torun 2 26 memory faults and 2 7 parameters changing 2 25 reasons to run 2 26 troubleshooting with 2 5 power on standby 4 23 Predictive Self Healing PSH about 2 34 clearing faults 2 37 2 38 knowledge database 1 10 2 35 memory faults and 2 8 Sun URL 2 35 procedures for finishing up 4 22 procedures for parts replacement 4 3 processor 1 8 processor description 1 4 product notes 2 xiv PROM system configuration 4 16 Index 3 PSH detected faults 2 17 PSH see also Predictive Self Healing PSH 2 34 Q quick visual notification 2 1 R RAID redundant array of independent disks storage configurations 1 8 remote management 1 5 removeblade command 2 16 removefru command 2 16 reset button 2 11 reset command 2 16 resetsc command 2 16 S safety information 4 1 safety symbols 4 2 SC service processor FRU name 1 6 SC BAT service processor battery FRU name 1 6 serial number chassis 1 8 finding 1 8 server module locating 2 10 weight A 1 Service Action Required LED 2 10 2 12 2 34 service information resources 1 10 service mode 2 26 service processor 2 2 battery 4 20 description 1 6 service resources 1 10 setkeyswitch parameter 2 16 2 22 2 25 2 26 4 14 setlocator command 2 10 2 16 showcomponent command 2 41 showenvironment command 2 16 2 18 showfaults command 2 4 4 13 4 15 description and examples 2
71. nsole DB 9 serial male TTYA VGA 15 pin female USB 2 0 not supported two connectors FIGURE 1 3 Optional Dongle Cable Connecting to the Universal Connector Port TABLE 1 1 Sun Blade T6300 Server Module Features Feature Description Processor One UltraSPARC T1 multicore processor e 1 0 GHz six core 1 2 GHz eight core 1 4 GHz eight core Memory Eight DIMM slots for DDR 2 DIMMS 1 Gbyte 8 Gbyte maximum 2 Gbyte 16 Gbyte maximum 4 Gbyte 32 Gbyte maximum 1 4 Sun Blade T6300 Server Module Service Manual April 2007 Feature TABLE 1 1 Sun Blade T6300 Server Module Features Continued Description Internal hard drives Universal Connector Port Architecture Up to four hot pluggable 2 5 inch hard drives with RAID 0 and RAID 1 support e SFF SAS 73 Gbyte 10k rpm e SFF SATA 80 Gbyte 5 4k rpm e SFF SAS 146 Gbyte 10k rpm Filler panels are inserted anywhere hard drives are not installed One universal connector port UCB in the front panel A universal cable is included with the Sun Blade 6000 chassis The cable is also available as an optional component and has the following connectors Two USB 2 0 e VGA video 15 pin female not functional on this server module e RJ45 virtual console Three wire interface no hardware handshaking DB 9 male TTYA posix serial port SPARC V9 architecture ECC protected Platform group sun4v Platform name SUNW S
72. ntents of the system s console buffer The following options enable you to specify how the output is displayed e g lines option specifies the number of lines to display before pausing e e lines option displays n lines from the end of the buffer e b lines option displays n lines from beginning of the buffer e v option displays the entire buffer boot run option specifies the log to display run is the default log Enables control of the firmware during system initialization with the following options normal is the default boot mode reset_nvram resets OpenBoot PROM parameters to their default values bootscript string enables the passing of a string to the boot command Performs a poweroff followed by poweron The f option forces an immediate poweroff otherwise the command attempts a graceful shutdown Powers off the host server The y option enables you to skip the confirmation question The option forces an immediate shutdown Powers on the host server Using the c option executes a console command after completion of the poweron command Chapter 2 Sun Blade T6300 Server Module Diagnostics 2 15 TABLE 2 5 ALOM CMT Command Service Related ALOM CMT Commands Continued Description removefru PS0 PS1 removeblade unremoveblade reset y c resetsc y setkeyswitch y normal stby diag locked setlocator on off showenvironment showfaults v showfru
73. of lines to display before pausing the output to the screen The s option displays static information about system FRUs defaults to all FRUs unless one is specified The d option displays dynamic information about system FRUs defaults to all FRUs unless one is specified See Section 2 3 4 Displaying FRU Information on page 2 20 Displays the status of the virtual keyswitch Displays the current state of the Locator LED as either on or off Displays the history of all events logged in the ALOM CMT event buffers in RAM or the persistent buffers Displays information about the host system s hardware configuration the system serial number and whether the hardware is providing service 2 16 Sun Blade T6300 Server Module Service Manual April 2007 Note See TABLE 2 8 for the ALOM CMT ASR commands 202 Displaying System Faults The ALOM CMT showfaults command displays the following kinds of faults a Environmental faults Temperature or voltage problems that might be caused by faulty FRUs power supplies fans or blower or by room temperature or blocked air flow m POST detected faults Faults on devices detected by the power on self test diagnostics a PSH detected faults Faults detected by the Solaris Predictive Self healing PSH technology Use the showfaults command for the following reasons m To see if any faults have been passed to or detected by ALOM CMT m To obtain th
74. on variables and thoroughly tests the memory subsystem Chapter 2 Sun Blade T6300 Server Module Diagnostics 2 7 21 15 If a memory fault is detected POST displays the fault with the FRU name of the faulty DIMMS logs the fault and disables the faulty DIMMs by placing them in the ASR blacklist For a given memory fault POST disables half of the physical memory in the system When this occurs you must replace the faulty DIMMs based on the fault message and enable the disabled DIMMs with the ALOM CMT enablecomponent command Solaris Predictive Self healing PSH technology Afeature of the Solaris OS uses the fault manager daemon fmd to watch for various kinds of faults When a fault occurs the fault is assigned a unique fault ID UUID and logged PSH reports the fault and provides a recommended proactive replacement for the DIMMs associated with the fault Troubleshooting Memory Faults If you suspect that the server module has a memory problem follow the flowchart see FIGURE 2 1 Run the ALOM CMT showfaults command The showfaults command lists memory faults and lists the specific DIMMS that are associated with the fault Once you ve identified which DIMMs to replace see Chapter 4 for DIMM removal and replacement instructions You must perform the instructions in that chapter to clear the faults and enable the replaced DIMMs 2 8 Sun Blade T6300 Server Module Service Manual April 2007 2 2 22 1 Interpreting Syst
75. processor cores is deemed faulty by POST the core will be disabled and the system will boot and run using the remaining cores Note Devices can be manually enabled or disabled using ASR commands see Section 2 7 Managing Components With Automatic System Recovery Commands on page 2 40 Controlling How POST Runs The server module can be configured for normal extensive or no POST execution You can also control the level of tests that run the amount of POST output that is displayed and which reset events trigger POST by using ALOM CMT variables TABLE 2 6 lists the ALOM CMT variables used to configure POST and FIGURE 2 5 shows how the variables work together TABLE 2 6 ALOM CMT Parameters Used For POST Configuration Parameter Values Description setkeyswitch normal The system can power on and run POST based on the other parameter settings For details see FIGURE 2 5 This parameter overrides all other commands diag The system runs POST based on predetermined settings stby The system cannot power on locked The system can power on and run POST but no flash updates can be made diag_ mode off POST does not run normal Runs POST according to diag_level value 2 22 Sun Blade T6300 Server Module Service Manual April 2007 TABLE 2 6 ALOM CMT Parameters Used For POST Configuration Continued Parameter Values Description service Runs POST with preset values for diag_level and diag_verbosity
76. pter 2 Sun Blade T6300 Server Module Diagnostics 2 45 2 46 Sun Blade T6300 Server Module Service Manual April 2007 CHAPTER 3 Replacing Hot Swappable and Hot Pluggable Components This chapter describes how to remove and replace the hot swappable and hot pluggable field replaceable units FRUs in the Sun Blade T6300 server module The following topics are covered m Section 3 1 Hot Pluggable Hard Drives on page 3 1 m Section 3 2 Hot Plugging a Hard Drive on page 3 1 m Section 3 3 Adding PCI ExpressModules on page 3 5 3 1 Hot Pluggable Hard Drives Hot pluggable devices are those devices that can be removed and installed while the system is running but you must perform administrative tasks first The Sun Blade T6300 server module hard drives can be hot swappable depending on how they are configured For information about a hot swappable or hot pluggable PCI ExpressModule PCI EM or network express module NEM see the Sun Blade 6000 Modular System Service Manual 820 0051 3 2 Hot Plugging a Hard Drive The hard drives in the Sun Blade T6300 server module are hot pluggable but this capability depends on how the hard drives are configured 3 1 CPAN TAL Rules for Hot Plugging To safely remove a hard drive you must m Prevent any applications from accessing the hard drive m Remove the logical software links Hard drives cannot be hot plugged if m The hard drive provi
77. r Module Service Manual April 2007 CHAPTER 2 Sun Blade T6300 Server Module Diagnostics This chapter describes the diagnostics that are available for monitoring and troubleshooting the Sun Blade T6300 server module This chapter is intended for technicians service personnel and system administrators who service and repair computer systems The following topics are covered m Section 2 1 Sun Blade T6300 Server Module Diagnostics Overview on page 2 1 m Section 2 2 Interpreting System LEDs on page 2 9 m Section 2 3 Using ALOM CMT for Diagnosis and Repair Verification on page 2 12 m Section 2 4 Running POST on page 2 22 m Section 2 5 Using the Solaris Predictive Self Healing Feature on page 2 34 m Section 2 6 Collecting Information From Solaris OS Files and Commands on page 2 39 m Section 2 7 Managing Components With Automatic System Recovery Commands on page 2 40 m Section 2 8 Exercising the System With SunVTS on page 2 43 2 1 Sun Blade T6300 Server Module Diagnostics Overview There are a variety of diagnostic tools commands and indicators you can use to monitor and troubleshoot a Sun Blade T6300 server module m LEDs Provide a quick visual notification of the status of the server module and of some of the FRUs 2 1 2 2 a ALOM CMT firmware This system firmware runs on the service processor In addition to providing the interface between the har
78. rectory clear 0 gt L2 Scrub VD amp UA 0 gt L2 Scrub Tags 0 gt L2 Disable 0 gt End Data Bitwalk 0 gt Begin Address Bitwalk 0 gt Testing Memory Channel 0 gt Testing Memory Channel 0 gt End Address Bitwalk 0 gt Test Slave Threads Basic 0 gt Begin Test Mailbox region 0 gt Begin Run POST from Memory 0 gt End Run POST from Memory 0 gt Begin L2 Cache Ram Test 0 gt End L2 Cache Ram Test 0 gt Begin Enable L2 Cache 0 gt L2 Scrub Data 0 gt L2 Enable Rank 0 Rank 0 Rank 0 Rank 0 0 gt End Test Mailbox region 0 gt Begin Set Mailbox 0 gt End Set Mailbox 0 gt Begin Setup Final DMMU Entries 0 gt End Setup Final DMMU Entries 0 gt Begin Post Image Region Scrub 0 gt End Post Image Region Scrub 0 gt Verifying checksum on copied image 0 gt The Memory s CHECKSUM value is 242 0 gt The Memory s Content Size value is 84f42 0 gt End Enable L2 Cache 0 gt CPU 0 4 8 12 16 28 0 gt Begin DMMU Registers Access 0 gt End DMMU Registers Access 0 gt Begin DMMU Registers Access 0 gt End DMMU Registers Access Stack 0 Stack 0 Stack 0 Stack 0 0 gt Success Checksum on Memory Validated 0 gt INFO No memory detected at Memory Channel 1 2 Rank 0 Stack 1 0 gt INFO No memory detected at Memory Channel 1 2 Rank 1 Stack 0 0 gt INFO No memory detected at Memory Channel 1 2 Rank 1 Stack 1 0 gt 0 gt End Probe and Setup
79. rvice Manual April 2007 hot swapping FRUs 3 1 l I O port front panel 1 5 identifying the chassis 1 8 J JBus I O interface 1 8 K knowledge database PSH 1 10 2 35 L L1 and L2 cache 1 8 latch release button hard drive 3 3 LEDs descriptions 2 9 2 10 Ethernet port chassis 2 12 OK 2 4 system interpreting 2 9 locating the server for maintenance 4 4 locating the server module 2 10 Locator LED 4 4 log files viewing 2 40 M MAC address label 1 8 MB server module FRU name 1 6 memory configuration 2 6 4 8 fault handling 2 5 overview 1 4 memory access crossbar 1 8 memory also see DIMMs message ID 2 35 messages file 2 39 mirrored disk 1 8 MSG ID online 1 10 2 35 N NVRAM system controller PROM 4 16 O OK LED 2 4 2 10 operating state determining 2 10 P parts replaceable 1 6 parts replacement see FRUs PCI EM adding 3 5 PCI ExpressModules adding 3 5 platform name 1 5 POST detected faults 2 4 reasons to run 2 26 POST detected faults 2 17 POST see also power on self test POST 2 22 Power button 2 11 power supply 1 5 power supply status displaying 2 18 powercycle command 2 15 2 27 powering off the system 4 3 poweroff command 2 15 4 4 poweron command 2 15 4 14 power on self test POST 2 4 about 2 22 ALOM CMT commands 2 22 configuration flowchart 2 24 error message example 2 32 error messages 2 32 example output 2 27 fault cleari
80. ssor contains the persistent storage for the system host ID and Ethernet MAC addresses The service processor also contains the ALOM CMT configuration including the IP addresses and ALOM CMT user accounts if configured This information will be lost unless the system configuration PROM NVRAM is removed and installed in the replacement service processor The PROM does not hold the fault data and the fault data will no longer be accessible when the service processor is replaced 4 16 Sun Blade T6300 Server Module Service Manual April 2007 FIGURE 4 9 Removing the System Configuration PROM NVRAM 4 3 5 Replacing the Service Processor Remove the replacement service processor from the package and place it on an antistatic mat Install the system configuration PROM that you removed from the faulty service processor The PROM is keyed to ensure proper orientation Locate the service processor slot on the motherboard assembly Ensure that the ejector levers are open Chapter 4 Replacing Cold Swappable Components 4 17 5 Holding the bottom edge of the service processor parallel to its socket carefully align the service processor so that each of its contacts is centered on a socket pin Ensure that the service processor is correctly oriented A notch along the bottom of the service processor corresponds to a tab on the socket Push firmly and evenly on both ends of the service processor until it is firmly seate
81. uggable Components 3 3 FIGURE 3 2 Hard Drive Locations Release Button and Latch 329 Replacing a Hard Drive or Installing a New Hard Drive The hard drive is physically addressed to the slot in which it is installed Note If you removed a hard drive ensure that you install the replacement drive in the same slot 1 If necessary remove the hard drive filler panel 2 Slide the drive into the bay until it is fully seated FIGURE 3 2 3 Close the latch to lock the drive in place 3 4 Sun Blade T6300 Server Module Service Manual April 2007 4 Perform administrative tasks to reconfigure the hard drive The procedures that you perform at this point depend on how your data is configured You might need to partition the drive create file systems load data from backups or have data updated from a RAID configuration m You can use the Solaris command cfgadm al to list all disks in the device tree including unconfigured disks m If the disk is not in the list such as with a newly installed disk you can use devfsadm to configure it into the tree See the devfsadm man page for details 3 3 Adding PCI ExpressModules The PCI ExpressModules PCI EMs plug into the Sun Blade 6000 chassis To verify installation and to set up the PCI EMs see m Sun Blade 6000 Modular System Service Manual 820 0051 a Advanced Lights out Management ALOM CMT v1 3 Guide 819 7981 m Sun Blade T6300 Server Module Administration Gu
82. ult Management 2 13 Flowchart of ALOM CMT Variables for POST Configuration 2 24 Hard Drive Locations and LEDs 3 3 Hard Drive Locations Release Button and Latch 3 4 Disconnecting the Cable Dongle 4 5 Removing the Sun Blade T6300 Server Module From the Sun Blade 6000 Chassis 4 6 Stack Five Server Modules or Fewer 4 7 Antistatic Mat and Wrist Strap 4 7 DIMM Test Button and DIMM Ejector LEDs 4 10 DIMM Installation Rules 4 11 Removing DIMMs 4 12 vii FIGURE 4 8 FIGURE 4 9 FIGURE 4 10 FIGURE 4 11 FIGURE 4 12 FIGURE 4 13 FIGURE A 1 FIGURE A 2 Ejecting and Removing the Service Processor 4 16 Removing the System Configuration PROM NVRAM 4 17 Removing the Disk Backplane Cables 4 19 Removing the Battery From the Service Processor 4 21 Replacing the Cover 4 22 Inserting the Server Module in the Chassis 4 23 Server Module Dimensions A 2 Motherboard Block Diagram A 3 viii Sun Blade T6300 Server Module Service Manual April 2007 Tables TABLE 1 1 TABLE 1 2 TABLE 1 3 TABLE 2 1 TABLE 2 2 TABLE 2 3 TABLE 2 4 TABLE 2 5 TABLE 2 6 TABLE 2 7 TABLE 2 8 TABLE 2 9 TABLE 4 1 TABLE A 1 Sun Blade T6300 Server Module Features 1 4 Interfaces With the Sun Blade 6000 Chassis 1 5 Sun Blade T6300 Server Module FRU List 1 6 Diagnostic Flowchart Actions 2 4 LED Behavior and Meaning 2 10 LED Behaviors With Assigned Meanings 2 10 Front Panel Buttons 2 11 Service Related ALOM CMT Comman
83. ults PSH detected faults are distinguished from other kinds of faults by the text Host detected fault Example sc gt showfaults ID FRU Fault 0 MB Host detected fault MSGID SUNW TESTO7 UUID 7ee0e46b ea64 6565 e684 e996963f 7b86 If no fault is reported you do not need to do anything else Do not perform the subseguent steps Run the clearfault command with the UUID provided in the showfaults output sc gt clearfault 7ee0e46b ea64 6565 e684 e996963f7b86 Clearing fault from all indicted FRUs Fault cleared 2 6 2 6 1 Collecting Information From Solaris OS Files and Commands With the Solaris OS running on the Sun Blade T6300 server module you have all the Solaris OS files and commands available for collecting information and for troubleshooting In the event that POST ALOM or the Solaris PSH features did not indicate the source of a fault check the message buffer and log files for notifications for faults Hard drive faults are usually captured by the Solaris message files Use the dmesg command to view the most recent system message To view the system messages log file view the contents of the var adm messages file Checking the Message Buffer 1 Log in as superuser Chapter2 Sun Blade T6300 Server Module Diagnostics 2 39 2 6 2 2 Issue the dmesg command dmesg The dmesg command displays the most recent messages generated by the system Viewing the Syst
84. un Blade T6300 Minimum system firmware 6 3 6 or subsequent compatible release Some USB 2 0 connectors are thick and may distort or damage the connector when you try to connect two USB 2 0 cables You can use a USB hub to avoid this problem Hardware handshaking signals are not transmitted through the RJ45 dongle connector only transmit receive and ground signals are transmitted TABLE 1 2 Interfaces With the Sun Blade 6000 Chassis Feature Description Ethernet ports PCI Express I O Remote management Power Cooling Up to two Network Express Module NEM connections in the Sun Blade 6000 chassis 10 100 1000 Mb autonegotiating FIGURE 1 5 Two x8 lane ports connect to Sun Blade 6000 chassis midplane Can support up to two PCI ExpressModules PCI EM FIGURE 1 5 ALOM CMT management controller on the service processor CLI maanagement telnet ssh and N1 system manager support Power is provided in the Sun Blade 6000 chassis Environmental controls are provided by the Sun Blade 6000 chassis For more information about Sun Blade 6000 chassis features and controls refer to the Sun Blade 6000 Modular System Service Manual 820 0051 Chapter 1 Sun Blade T6300 Server Module Product Description 1 5 Hard drive backplane cable kit 1 power cable 2 signal cables DIMMs Battery Service processor Hard drives FIGURE 1 4 Field Replaceable Units TABLE 1 3 Sun Blade T6300 Server Module F
85. un Microsystems Inc a les droits de propriete intellectuels relatants la technologie qui est d crit dans ce document En particulier et sans la limitation ces droits de propri t intellectuels peuvent inclure un ou plus des brevets am ricains num r s http www sun com patents et un ou les brevets plus suppl mentaires ou les applications de brevet en attente dans les Etats Unis et dans les autres pays Ce produit ou document est prot g par un copyright et distribu avec des licences qui en restreignent l utilisation la copie la distribution et la d compilation Aucune partie de ce produit ou document ne peut tre reproduite sous aucune forme par quelque moyen que ce soit sans l autorisation pr alable et crite de Sun et de ses bailleurs de licence s il y ena Le logiciel d tenu par des tiers et qui comprend la technologie relative aux polices de caract res est prot g par un copyright et licenci par des fournisseurs de Sun Des parties de ce produit pourront tre d riv es des syst mes Berkeley BSD licenci s par l Universit de Californie UNIX est une marque d pos e aux Etats Unis et dans d autres pays et licenci e exclusivement par X Open Company Ltd Sun Sun Microsystems le ogo Sun AnswerBook2 docs sun com Java OpenBoot SunSolve Sun VTS Sun Blade et Solaris sont des marques de fabrique ou des marques d pos es de Sun Microsystems Inc aux Etats Unis et dans d autres pays Toutes les
86. until it latches into place FIGURE 4 12 FIGURE 4 12 Replacing the Cover 4 6 2 Reinstalling the Server Module in the Chassis Caution Hold the server module firmly so that you do not drop it The server module weighs approximatley 14 to 17 pounds 6 4 8 0 kg 1 Turn the server module over so that the ejector levers are on the right side FIGURE 4 13 4 22 Sun Blade T6300 Server Module Service Manual April 2007 2 Push the server module into the chassis 3 Close the latches m The server module powers on with 3 3v standby power a The ALOM CMT software boots You can either press the Power button to fully power on the server module or use the ALOM poweron command For more information see the Advanced Lights out Management ALOM CMT v1 3 Guide 819 7981 FIGURE 4 13 Inserting the Server Module in the Chassis Chapter 4 Replacing Cold Swappable Components 4 23 4 24 Sun Blade T6300 Server Module Service Manual April 2007 APPENDIX A Specifications This appendix discusses the various specifications of the Sun Blade T6300 server module Topics covered are m Section A 1 Physical Specifications on page A 1 m Section A 2 Motherboard Block Diagram on page A 3 If you need specifications for the Sun Blade 6000 chassis see the Sun Blade 6000 Modular System Site Planning Guide 820 0426 A 1 Physical Specifications TABLE A 1 Exterior Dimensions Depth Width Height
87. y configured the system issues a message and the system does not boot 4 8 Sun Blade T6300 Server Module Service Manual April 2007 4 3 2 For more information about DIMMs see Section 2 1 1 4 Memory Fault Handling on page 2 7 Removing the DIMMs The DIMM ejectors have LEDs that indicate if a DIMM requires replacement m If the system is still powered on and installed in the chassis see Section 2 1 1 Memory Configuration and Fault Handling on page 2 5 to determine if a DIMM requires replacement m If the server module is powered off you can remove the server module from the chassis and test the DIMM LEDs on the motherboard to determine if the DIMMs require replacement Caution Ensure that you follow antistatic practices as described in Section 4 1 2 Electrostatic Discharge Safety on page 4 2 Perform the procedures described in Section 4 2 Common Procedures for Parts Replacement on page 4 3 Locate the DIMMs FIGURE 4 6 that you want to replace The server module has a DIMM test button on the motherboard Press the DIMM Locate button to illuminate the ejectors of the bad DIMMs Chapter 4 Replacing Cold Swappable Components 4 9 DIMM locate button FIGURE 4 5 DIMM Test Button and DIMM Ejector LEDs You can also use FIGURE 4 6 and TABLE 4 1 to identify the DIMMs you want to remove 4 10 Sun Blade T6300 Server Module Service Manual April 2007 DIMM locate
88. you do not need to precede with the following steps because the fault is cleared Run the clearfault command sc gt clearfault 7ee0e46b ea64 6565 e684 e996963 7b86 Switch to the system console sc gt console Issue the fmadm repair command with the UUID Use the same UUID that you used with the clearfault command in Step 9 fmadm repair 7ee0e46b ea64 6565 e684 e996963 7b86 Removing the Service Processor The service processor controls the host power and monitors host system events power and environmental The service processor holds a socketed EEPROM for storing the system configuration all Ethernet MAC addresses and the host ID Chapter 4 Replacing Cold Swappable Components 4 15 Caution The service processor card can be hot To avoid injury handle it carefully 1 Perform the procedures described in Section 4 2 Common Procedures for Parts Replacement on page 4 3 2 Locate the service processor card 3 Push down on the ejector levers on each side of the service processor until the card is released from the socket FIGURE 4 8 Ejecting and Removing the Service Processor 4 Grasp the top corners of the service processor and pull it out of the socket 5 Place the service processor card on an antistatic mat 6 Remove the system configuration PROM NVRAM FIGURE 4 9 from the service processor card and place the PROM on an antistatic mat The service proce
Download Pdf Manuals
Related Search
Related Contents
2009年ごろ(13.6MB) SMS alarm controller User manual Group 6 - Clutch FRONIUS IG 15 / 20 / 30 / 40 / 60 / 60 HV MSERV-05-00 - MANUAL DE SERVIÇO FBCN V - FB Toshiba Satellite P870/05P 取扱説明書 Copyright © All rights reserved.
Failed to retrieve file