Home
Sun SPARC Enterprise M3000 Server Service Manual
Contents
1. Chapter 6 Motherboard Unit Replacement 6 7 2 Hold the two opaque handles on both sides of the motherboard unit and lift the motherboard unit while sliding it out FIGURE 6 5 Removing the Motherboard Unit 3 Remove the motherboard unit from the server and place it on a conductive mat When removing the motherboard unit from the server take care not to damage the connector 4 Remove the DIMMs For details see Section 7 2 Removing the DIMMs on page 7 4 6 3 Mounting the Motherboard Unit 1 Hold the two opaque handles on both sides of the motherboard unit and slide the unit in while lowering it to align with the predetermined position from the front of the server When mounting the motherboard unit in the server take care not to damage the connector 6 8 SPARC Enterprise M3000 Server Service Manual October 2008 2 Install and tighten the four screws to secure the motherboard unit in position 6 4 Reassembling the Server 1 Align the shutter unit with the securing bracket on the power supply u
2. 0 ok probe scsi all pci 0 600000 pci 0 pci 0 scsi 0 MPT Version 1 05 Firmware Version 1 24 00 00 Target 0 Unit 0 Disk FUJITSU MAY2073RC 3701 143374738 Blocks 73 GB SASAddress 500000e0197292c2 PhyNum 0 Target 1 Unit 0 Disk FUJITSU MAY2073RC 3701 143374738 Blocks 73 GB SASAddress 500000e 019728f22 PhyNum 1 Target 2 Unit 0 Disk FUJITSU MAY2073RC 3701 143374738 Blocks 73 GB SASAddress 500000e019729002 PhyNum 2 Target 3 Unit 0 Disk FUJITSU MAY2073RC 3701 143374738 Blocks 73 GB SASAddress 500000e019729302 PhyNum 3 Target 4 Unit 0 Removable Read Only device MATSHITADVD RAM UJ875AS 1000 SATA device PhyNum 4 0 ok 6 Type the boot command to start the Solaris OS ok boot 4 18 SPARC Enterprise M3000 Server Service Manual October 2008 CHAPTER D Internal Components Access This chapter explains how to access internal components m Section 5 1 Sliding the Server Into and Out of the Equipment Rack on page 5 1 m Section 5 2 Removing and Attaching the Top Cover on page 5 3 m Section 5 3 Removing and Attaching the Air Duct on page 5 4 m Section 5 4 Removing and Attaching the Fan Cover on page 5 7 5 1 SLL Sliding the Server Into and Out of the Equipment Rack This section explains how to slide the server out from the equipment rack and how to push it into the equipment rack For details of equipment racks
3. 1 Power off the server This procedure includes the steps of turning the mode switch on the operator panel to the Service position verifying that the POWER LED is off and disconnecting the power cord For details see Section 4 5 1 Powering off the Server on page 4 11 2 Slide the server out from the equipment rack For details see Section 5 1 1 Sliding the Server Out from the Equipment Rack on page 5 1 Caution To prevent the equipment rack from tipping over you must deploy the antitilt feature if applicable before you slide the server out of the equipment rack Note When the cable management arm is not mounted remove the cable ties that fix the PCI cable to the rear of the server and slide the server out 3 Remove the top cover For details see Section 5 2 1 Removing the Top Cover on page 5 3 Chapter 8 Replacement and Installation of PCle Cards 8 3 Removing a PCIe Card 8 2 Note If a new PCle card is to be installed in an empty disk slot remove the PCIe slot cover first 1 Remove the one screw securing the PCIe card t and then place it on a conductive 2 Pull a PCIe card straight up from the slo mat FIGURE 8 2 Removing a PCIe card Mounting a PCle Card 1 Insert a PCIe card in a slot 8 3 SPARC Enterprise M3000 Server Service Manual October 2008 8 4 Note A card must be firmly secured so that it is properly mounted
4. 2 Slide the server out from the equipment rack For details see Section 5 1 1 Sliding the Server Out from the Equipment Rack on page 5 1 Caution To prevent the equipment rack from tipping over you must deploy the antitilt feature if applicable before you slide the server out of the equipment rack Note When the cable management arm is not mounted remove the cable ties that fix the PCI cable to the rear of the server and slide the server out 3 Remove the top cover For details see Section 5 2 1 Removing the Top Cover on page 5 3 4 Remove the air duct from the server For details see Section 5 3 2 Removing the Air Duct on page 5 5 5 Remove the four screws on the front panel and then remove the front panel Chapter 15 Operator Panel Replacement 15 3 15 2 Removing the Operator Panel 1 Disconnect the signal cable from the operator pane 1 in FIGURE 15 2 2 Loosen the one thumbscrew 3 Pull the operator panel out to remove it and place it on a conductive mat 2 in FIGURE 15 2 FIGURE 15 2 Removing the Operator Panel pi ANA MY QA LAG ks SN RK NS il Thumbscrew 15 4 SPARC Enterprise M3000 Server Service Manual October 2008 15 3 Mounting the Operator Panel 1 Align the keyhole of the operator panel with the thumbscrew and gently push the operator panel toward the server If the operator panel will not move gently adjust
5. 3020 223b28ab 3020 223b283e 3020 223b2829 3020 223b2840 3020 223b2830 Other Other 4 10 SPARC Enterprise M3000 Server Service Manual October 2008 The showhardconf output continued OPNL Status Normal Ver 0101h Serial PP082202R8 FRU Part Number CA07082 D911 A1 541 3306 01 A PSU 0 Status Normal Serial EA08210127 FRU Part Number CA01022 0720 02B 300 2193 02 i Power Status On PSU 1 Status Normal Serial EA08210131 FRU Part Number CA01022 0720 02B 300 2193 02 i Power Status On FANBP B Status Normal Ver 0101h Serial PP0821031E FRU Part Number CA20399 B12X 004AA 541 3304 01 FAN At0 Status Normal FAN_A 1 Status Normal XSCF gt For details see the manual pages of showhardconf 2 Confirm the state of the status LEDs of the FRU For information on the LED status see TABLE 2 3 and TABLE 2 5 4 5 Cold Replacement Cold Addition In cold replacement all business operations must be stopped When accessing the server power off the server and disconnect the power cord to ensure safety For cold addition do the same operation as that for cold replacement 4 5 1 Powering off the Server This section explains how to power off the server 4 5 1 1 Power off by Using the XSCF Command 1 Notify users that the server is being powered off 2 Back up the system files and data to tape if necessary 3 Turn the mode switch on the operator panel to the Servi
6. If a hard disk drive cannot be inserted smoothly do not forcibly push the hard disk drive into the slot If the HDD is forcibly inserted despite the presence of any obstruction in a slot or any problems with the connector pin serious damage may result gt Caution Do not insert a hard disk drive into a slot when the eject lock lever is closed If an HDD is inserted in this condition it stops halfway and becomes difficult to pull out gt 1 Align the hard disk drive with the slot and then push the hard disk drive in until it stops 2 Fasten the eject lock lever 9 4 Reassembling the Server 1 Configure the hard disk drive in the domain This procedure includes the steps of using the cfgadm command to connect the hard disk drive to the domain and for verifying that it has been added to the domain For details see Section 4 3 3 Configuring a FRU in a Domain on page 4 6 2 Check the hardware This procedure includes the steps of running a program to verify that all the components have been remounted and to start the Solaris OS For details see Section 4 5 4 Verifying the Hardware Operation on page 4 15 3 Verify the state of the LED indicator of the hard disk drive For information on the LED status see TABLE 2 3 to TABLE 2 5 Chapter 9 Replacement and Installation of a Hard Disk Drive HDD 9 5 9 6 SPARC Enterprise M3000 Server Service Manual October 2008 CHAPTER 1 0 Re
7. domain to the completion of the Solaris OS start Chapter 3 Troubleshooting 3 9 3 4 3 Messaging Output To understand error conditions collect messaging output information For the collection of the information use the commands shown in TABLE 3 5 TABLE 3 5 Commands for Checking Messaging Output Command Operand Description showlogs XSCF env Displays the temperature history log The environmental temperature command fmdump Solaris OS command data and power status are indicated in 10 minute intervals The data is stored for a maximum of six months power Displays power and reset information event Displays information reported to the system and stored it as event logs error Displays error logs Displays FMA diagnostic results and errors This command is provided as a Solaris OS command and XSCF shell command Each error message logged by the predictive self healing architecture has a message ID and Web address associated with the message From this message ID and Web address information on the most up to date corrective measures can be retrieved For details of predictive self healing see the Solaris OS documents 39 3 10 Using Troubleshooting Commands When any message listed in TABLE 3 3 is displayed detailed information on the error may be required For details on troubleshooting commands see manual pages of the Solaris OS or XSCF shell This section provides detailed explanations of the fo
8. You are about to replace FAN_A 0 Do you want to continue r replace c cancel r Please confirm the Check LED is blinking If this is the case please replace FAN_A 0 After replacement has been completed please select f finish 4 8 SPARC Enterprise M3000 Server Service Manual October 2008 The replacefru command automatically tests the status of the component after the completion of removal and replacement Diagnostic tests for FAN_A 0 have started This operation may take up to 3 minute s progress scale reported in seconds AES 30 done Maintenance Replacement Menu Status of the replaced FRU FRU Status Maintenance Replacement Menu Please select a type of FRU to be replaced 1 FAN Fan Unit 2 PSU Power Supply Unit Select 1 2 c cancel e The display may vary depending on the XCP version When the tests are complete the program displays the original menu again To return to the XSCF shell prompt select cancel For details see the manual pages of replacefru Chapter 4 FRU Replacement Preparation 4 9 4 4 2 Verifying the Hardware Operation 1 Type the showhardconf command to confirm that the new component has been installed XSCF gt SPARC A Fa showhardconf Enterprise M3000 Serial IKK0813023 Power_Supply_System Single SCF ID XSCF 0 System Power On System_Phase Cabinet Power On MBU A Status Normal Ver 0101h Serial PP0822020X FRU Part Nu
9. 2 Install and tighten the one screw to secure the PCIe card in position 8 4 Reassembling the Server 1 Attach the top cover For details see Section 5 2 2 Attaching the Top Cover on page 5 4 2 Slide the server into the equipment rack and secure the stabilizer bar For details see Section 5 1 2 Sliding the Server into the Equipment Rack on page 5 3 3 Power on the server This procedure includes the steps of reconnecting the power cord checking the LED status and turning the mode switch on the operator panel to the Locked position For details see Section 4 5 3 1 Power on by Using the XSCF Command on page 4 13 Note If automatic startup of the Solaris OS is specified use the sendbreak d domain_id command of the XSCF shell to display the ok prompt after the display console banner is displayed but before the system starts booting the Solaris OS 4 Check the hardware This procedure includes the steps for running a program to verify that all the components have been remounted and to start the Solaris OS For details see Section 4 5 4 Verifying the Hardware Operation on page 4 15 Chapter 8 Replacement and Installation of PCle Cards 8 5 8 6 SPARC Enterprise M3000 Server Service Manual October 2008 CHAPTER 9 Replacement and Installation of a Hard Disk Drive HDD This chapter explains how to replace and install a hard disk drive m Section 9 1 Accessing a H
10. Connect all the power cords to AC power outlets 3 Verify that the XSCF STANDBY LED on the operator panel is on 4 Turn the mode switch on the operator panel to the desired mode position Locked or Service Chapter 4 FRU Replacement Preparation 4 13 5 Press the power button on the operator panel Soon the following activity is executed m The POWER LED on the operator panel is turned on m The power on self test POST is executed Then the server is completely powered on Note If automatic startup of the Solaris OS is specified use the sendbreak d domain_id command of the XSCF shell to display the ok prompt after the display console banner is displayed but before the system starts booting the Solaris OS 4 14 SPARC Enterprise M3000 Server Service Manual October 2008 4 5 4 Verifying the Hardware Operation 1 In response to the ok prompt press the ENTER key and enter default value and then press the period key The domain console is switched to the XSCF console 2 Use the showhardconf command to confirm that the new component has been installed XSCF gt showhardconf SPARC Enterprise M3000 Serial IKK0813023 Operator Panel Switch Locked Power Supply System Single SCF ID XSCF 0 System Power On System Phase Cabinet Power On Domain 0 Domain Status OpenBoot Execution Completed MBU_A Status Normal Ver 0101h Serial PP0822020X FRU Part Number CA07082 D901 A1 541 3
11. Remove the cables connected to the CD RW DVD RW drive unit 1 in FIGURE 11 2 At this time disconnect the cable while pressing the tab on the cable Chapter 11 CD RW DVD RW Drive Unit DVDU Replacement 11 3 2 While pushing the black lever upward 2 in FIGURE 11 2 push the CD RW DVD RW drive unit toward the front side to remove it from the server 3 in FIGURE 11 2 3 Place the drive unit on the conductive mat FIGURE 11 2 Removing the CD RW DVD RW Drive Unit 11 3 Mounting the CD RW DVD RW Drive Unit Caution Do not forcibly push the CD RW DVD RW drive unit to insert in the slot Forcibly pushing the component in a slot may damage the component or the server 1 Align the CD RW DVD RW drive unit with the drive slot and then slowly push the unit in until it locks in position 11 4 SPARC Enterprise M3000 Server Service Manual October 2008 2 11 4 Connect the cable for the CD RW DVD RW drive unit Reassembling the Server 1 Attach the top cover For details see Section 5 2 2 Attaching the Top Cover on page 5 4 Slide the server into the equipment rack and secure the stabilizer bar For details see Section 5 1 2 Sliding the Server into the Equipment Rack on page 5 3 Power on the server This procedure includes the steps of reconnecting the power cords and turning the mode switch on the operator panel to the Locked position For details see S
12. Uni XSB showdcl Displays the configuration information of a domain hardware resource information showfru Displays the setting information of a device For details see the SPARC Enterprise M3000 M4000 M5000 M8000 M9000 Servers XSCF User s Guide 3 3 2 1 Checking the Software Configuration The following procedure can be used to check the software configuration from the domain console Type showrev showrev The showrev command displays system configuration information on the screen 3 6 SPARC Enterprise M3000 Server Service Manual October 2008 3 3 2 2 3 3 2 3 Checking the Firmware Configuration User authority is required to check the firmware configuration The procedure below can be used to check the configuration from the maintenance terminal 1 Log in to XSCF shell 2 Type version XSCF gt version The version command displays firmware version information on the screen For details see the SPARC Enterprise M3000 M4000 M5000 M8000 M9000 Servers XSCF User s Guide Downloading Error Log Information To download error log information use the XSCF log fetch function The XSCF unit has an interface with external units so that the service engineer can easily obtain useful maintenance information such as error logs Connect the maintenance terminal and use the XSCF shell or XSCF Web to download error log information to the maintenance terminal 3 4 Error Conditions To int
13. driver not attached cpu driver not attached core driver not attached cpu driver not attached cpu driver not attached core driver not attached cpu driver not attached cpu driver not attached core driver not attached cpu driver not attached cpu driver not attached instance 0 ebus instance 0 pci flashprom driver not attached serial instance 0 scfc instance 0 panel instance 0 instance 0 pci instance 0 pci instance 1 scsi instance 0 tape disk sd instance 1 sd instance 0 instance 2 pci instance 0 network instance network instance instance 3 pci instance 1 network instance network instance pci instance 4 instance 1 pci instance 5 pci instance 6 pci instance 7 pci instance 8 os io driver not attached iscsi instance 0 pci pci pci pci 0 1 2 3 driver not attached driver not attached driver not attached driver not attached driver not attached Chapter 3 Troubleshooting 3 23 pseudo The prtconf output continued instance 0 3 6 4 Using the netstat Command The netstat command displays the network status and protocol statistics 3 6 4 1 Options TABLE 3 9 lists the options of the netstat command and how those options can help troubleshooting TABLE 3 9 Options for netstat Option Description How it can help i Displays the interface status The information Provides a
14. see Section 5 2 2 Attaching the Top Cover on page 5 4 Slide the server into the equipment rack and secure the stabilizer bar For details see Section 5 1 2 Sliding the Server into the Equipment Rack on page 5 3 Power on the server This procedure includes the steps of reconnecting the power cord checking the LED status and turning the mode switch on the operator panel to the Locked position For details see Section 4 5 3 1 Power on by Using the XSCF Command on page 4 13 14 6 SPARC Enterprise M3000 Server Service Manual October 2008 Note If automatic startup of the Solaris OS is specified use the sendbreak d domain_id command of the XSCF shell to display the ok prompt after the display console banner is displayed but before the system starts booting the Solaris OS 7 Check the hardware This procedure includes the steps of running a program to verify that all the components have been remounted and starting the Solaris OS For details see Section 4 5 4 Verifying the Hardware Operation on page 4 15 Chapter 14 Fan Backplane Replacement 14 7 14 8 SPARC Enterprise M3000 Server Service Manual October 2008 CHAPTER 15 Operator Panel Replacement This chapter explains how to replace the operator panel m Section 15 1 Accessing the Operator Panel on page 15 3 m Section 15 2 Removing the Operator Panel on page 15 4 m Section 15 3 Mounting the Operator Panel o
15. 2 SPARC Enterprise M3000 Server Service Manual October 2008 E 3 Configuration of Signal Lines This section provides the signal definitions FIGURE E 2 shows the configuration of signal lines when the UPS is connected FIGURE E 2 Connection between the Server and UPS BTL BPS UALM ACOFF SG UPS UPS cable Server TABLEE 1 Signal Line Definitions Signal name Definitions Pin number Remarks BPS UALM Indicates a UPS error 6 Normal state Off Abnormal state On BTL Indicates that the battery level of the UPS has 7 Normal state Off decreased and warns that battery power will be Abnormal warning unavailable after a certain period of time has On elapsed Note 1 Appendix E UPS Controller E 3 TABLEE 1 Signal Line Definitions Continued Signal name Definitions Pin number Remarks ACOFF Indicates that a power failure has occurred inthe 9 Normal state Off commercial power supply connected to the UPS Abnormal state On Note 2 SG Signal ground 5 ER Indicates that the server is operating 1 Do not connect Equipment Ready anything to this pin On Indicates that the contact is closed Off Indicates that the contact is open Note 1 Use a UPS that can normally supply power from the battery at least 10 to 60 seconds after BTL is turned on Note 2 Use a UPS that can normally supply power from the battery within 2 seconds of a momentary commercial power interruption even if ACOFF is not turned
16. A single rank B dual rank B 2 SPARC Enterprise M3000 Server Service Manual October 2008 B 2 Memory Mounting Conditions Memory mounting conditions are as follows m A maximum of eight DIMMs can be mounted They are mounted in units of four DIMMs in each of group A and group B m The DIMM capacity of group A must be equal to or greater than that of group B The DIMM capacity of group A must not be smaller than that of group B m The DIMMs included in the same group must have the same capacity and same rank Appendix B Memory Mounting Conditions B 3 FIGURE B 2 shows an example of mounting DIMM FIGURE B 2 Example Mounting DIMM MEM 00A MEM 01A 4GB dual rank MEM 02A MEM 03A MEM 00B MEM 01B 2GB single rank MEM 02B MEM 03B 1 Mount DIMMs in group A first B 4 SPARC Enterprise M3000 Server Service Manual October 2008 2 Mount DIMMs in group B DIMMs may not necessarily be mounted in group B Appendix B Memory Mounting Conditions B 5 B 6 SPARC Enterprise M3000 Server Service Manual October 2008 APPENDIX C FRU List This appendix shows the FRUs of the M3000 server The FRU list consists of the following items m Section C 1 Server Overview on page C 1 m Section C 2 Motherboard Unit on page C 2 m Section C 3 Drive on page C 5 m Section C 4 Power Supply Unit on page C 6 m Section C 5 Fan Unit on page C 7 C 1 Server Overview TABLE
17. Advanced Technology Attachment Packet Interface ATAPI Location Above hard disk drive Active replacement No Hot replacement No Cold replacement Yes For the replacement procedure see Chapter 11 C 4 C 6 Power Supply Unit The power supply unit provides the server with power received from the external AC power source The redundant configuration allows continued server operation if a power supply unit fails TABLE C 7 shows the power supply unit specifications TABLE C 7 Power Supply Unit Specifications Item Specifications Number of power cords 2 1 for each power supply unit Power cord length 3 m 9 84 ft Cooling fan unit 2 1 for each power supply unit Redundancy 1 1 redundant configuration Input voltage AC 100 to 120 V AC 200 to 240 V Rated current 4 80 A AC 100 to 120 V 2 59 A AC 200 to 240 V Frequency 50 60 Hz SPARC Enterprise M3000 Server Service Manual October 2008 TABLE C 7 Power Supply Unit Specifications Continued Item Specifications Power consumption 470 W AC 100 to 120 V 460 W AC 200 to 240 V Apparent power 480 VA AC 100 to 120 V 517 VA AC 200 to 240 V Maximum heat dissipation 1 603 7 BTU hr 1 692 kJ hr AC100 to 120V 1 569 6 BTU hr 1 656 kJ hr AC200 to 240V Power factor 0 98 AC 100 to 120 V for the maximum configuration 0 89 AC 200 to 240 V for the maximum configuration Active replacement Yes Hot replacement Yes Cold replacement Yes In th
18. C 1 lists the M3000 FRU components TABLE C 1 FRU components Cold Hot Active Cold Hot Active Component Redundant replacement replacement replacement addition addition addition Motherboard unit No Yes MBU_A Memory DIMM No Yes Yes PCIe card PCIe No Yes Yes Hard disk drive HDD Yes Yes Yes Yes Yes Yes Yes Hard disk drive backplane No Yes HDDBP CD RW DVD RW drive No Yes unit DVDU C 1 TABLE C 1 FRU components Continued Cold Hot Active Cold Hot Active Component Redundant replacement replacement replacement addition addition addition Power supply unit PSU Yes Yes Yes Yes Fan unit FAN_A Yes Yes Yes Yes Fan backplane FANBP_B No Yes Operator panel OPNL No Yes mA redundant configuration is created with the hard disk drive only when the disk mirroring software is used a If a hard disk drive is an unmirrored boot device it must be replaced by using the cold replacement proce dure However if a boot device can be disconnected by means of a Solaris OS function or disk mirroring soft ware function active replacement can also be performed The procedure for disconnecting a hard disk drive varies depending on the software being used For details see the manuals for the relevant software CZ C 2 1 Motherboard Unit The motherboard unit is the main circuit board of the M3000 server The following components connect to the motherboard unit m Memory DIMM on page C 2 m PCI
19. Inc aux Etats Unis et dans d autres pays Les produits portant les marques SPARC sont bas s sur une architecture d velopp e par Sun Microsystems Inc SPARC64 est une marques d pos e de SPARC International Inc utilis e sous le permis par Fujitsu Microelectronics Inc et Fujitsu Limited L interface d utilisation graphique OPEN LOOK et Sun a t d velopp e par Sun Microsystems Inc pour ses utilisateurs et licenci s Sun reconna t les efforts de pionniers de Xerox pour la recherche et le d veloppement du concept des interfaces d utilisation visuelle ou graphique pour l industrie de l informatique Sun d tient une license non exclusive de Xerox sur l interface d utilisation graphique Xerox cette licence couvrant galement les licenci s de Sun qui mettent en place l interface d utilisation graphique OPEN LOOK et qui en outre se conforment aux licences crites de Sun Droits du gouvernement am ricain logiciel commercial Les utilisateurs du gouvernement am ricain sont soumis aux contrats de licence standard de Sun Microsystems Inc et de Fujitsu Limited ainsi qu aux clauses applicables stipul es dans le FAR et ses suppl ments Avis de non responsabilit les seules garanties octroy es par Fujitsu Limited Sun Microsystems Inc ou toute soci t affili e de l une ou l autre entit en rapport avec ce document ou tout produit ou toute technologie d crit e dans les pr sentes correspondent aux garanties express
20. Online Online Online Online Online Preface Xv Documentation Support and Training Sun Function URL Documentation http www sun com documentation Support http www sun com support Training http www sun com training Sun Welcomes Your Comments Sun is interested in improving its documentation and welcomes your comments and suggestions You can submit your comments by going to http www sun com hwdocs feedback Please include the title and part number of your document with your feedback Sun SPARC Enterprise M3000 Server Service Manual part number 820 5683 10 xvi SPARC Enterprise M3000 Server Service Manual October 2008 CHAPTER 1 Safety Precautions for Maintenance This chapter provides safety precautions required for maintenance m Section 1 1 ESD Precautions on page 1 1 m Section 1 2 Server Precautions on page 1 3 1 1 ESD Precautions To ensure that you and bystanders are not exposed to harm and to prevent damage to the system observe the following safety precautions TABLE 1 1 ESD Precautions Item Precaution ESD jack wrist strap Connect the ESD connector to your server and wear the antistatic wrist strap when handling printed circuit boards See FIGURE 1 1 for the wrist strap connection destination Conductive mat An approved conductive mat provides protection from static damage when used with a wrist strap The mat also cushions and protects
21. PCIe 1 aa 8533 10b5 3 8 0 okay 0 8 pci pciex10b5 8533 N A pci 1 700000 pci 0 pci 8 00 PCIe 1 aa 8533 10b5 3 9 0 okay 0 8 pci pciex10b5 8533 N A pci 1 700000 pci 0 pci 9 Hardware Revisions System PROM revisions OBP 4 24 8 2008 04 23 15 15 Environmental Status Mode switch is in LOCK mode EE System Processor Mod ES SPARC64 VII mode 3 6 3 Using the prtconf Command Similar to the show devs command executed at the ok prompt the prtconf command displays the devices that are configured The prtconf command identifies hardware that is recognized by the Solaris OS If software applications are having problems with hardware but the hardware is not suspected of being faulty the prtconf command can be used to check whether the Solaris software recognizes the hardware and whether a driver for the hardware is loaded Chapter 3 Troubleshooting 3 21 3 6 3 1 Options TABLE 3 8 lists the options of the prtconf command and how those options can help troubleshooting TABLE 3 8 Options for prtconf Option Description How it can help No option Displays the device tree of devices recognized If a hardware device is recognized then it is by the operating system considered to be functioning properly If the message driver not attached is displayed for the device or sub device the
22. Release the hard disk drive from the domain This procedure includes the steps of using the cfgadm command to check the Ap Id and for releasing the hard disk drive For details see Section 4 3 1 Releasing a FRU from a Domain on page 4 5 9 2 Removing a Hard Disk Drive Note If a new hard disk drive is to be installed in an empty disk slot remove the HDD filler panel first 1 Push the square button on the front of the hard disk drive 1 in FIGURE 9 2 to release the eject lock lever 2 in FIGURE 9 2 2 Pull the eject lock lever 3 in FIGURE 9 2 so that the hard disk drive is pulled straight out from the unit 4 in FIGURE 9 2 Chapter 9 Replacement and Installation of a Hard Disk Drive HDD 9 3 3 Remove the hard disk drive and place it on a conductive mat FIGURE 9 2 Removing a Hard Disk Drive ED 228222 000000 m oe SOOD 000 ff Bf 000 2 9 g 9 g g g 22222222222222 22222222222220 2000090000 DODOOOODOOO III TIIGI ODOODOOODOOO 2D0DD0DDDD0000E D9D0DDDD0DDDD0E 282222820002228 22022222002220 DODDDDDDDDDDDE SOOOOODOOODODO OOOOOODODOOOOO 2222202220022090 222222222222209 2222022022222020 22222220022228 2220222220202220 QOQIDODQOBODOO 222222209022299 2282022220900220 222222902228028 000000808808 2222222220 2222222229 22222222222229 22222222222222 SPARC Enterprise M3000 Server Service Manual October 2008 9 4 Installing a Hard Disk Drive Caution
23. Section 4 5 1 Powering off the Server on page 4 11 2 Slide the server out from the equipment rack For details see Section 5 1 1 Sliding the Server Out from the Equipment Rack on page 5 1 Caution To prevent the equipment rack from tipping over you must deploy the antitilt feature if applicable before you slide the server out of the equipment rack Note When the cable management arm is not mounted remove the cable ties that fix the PCI cable to the rear of the server and slide the server out 3 Remove the top cover For details see Section 5 2 1 Removing the Top Cover on page 5 3 4 Remove the air duct from the server For details see Section 5 3 2 Removing the Air Duct on page 5 5 5 Removing the fan units For details see Section 13 2 Removing a Fan Unit on page 13 3 6 Press on the projections to remove the two tabs 1 in FIGURE 14 2 and incline the unit diagonally backward 2 in FIGURE 14 2 Chapter 14 Fan Backplane Replacement 14 3 7 Pull the fan cage upward to remove it 3 in FIGURE 14 2 FIGURE 14 2 Removing the Fan Cage 22222222222220 22222222222229 22222222222222 22222222222229 OOOOOOODOOOOOOO 22222222222220 222222222220222 22222222222222 2222222222222 222222222222292 22222222222229 22222222222220 n 22222222222222 4 XK OOOOOOOOOOOOOED g C SPARC Enterprise M3000 Server Service Manual October 2008 14 4 14 2 Remo
24. Status Normal Serial EA08210127 FRU Part Number CA01022 0720 02B 300 2193 02 Power Status On PSU 1 Status Normal Serial EA08210131 FRU Part Number CA01022 0720 02B 300 2193 02 Power Status On FANBP B Status Normal Ver 0101h Serial PP0821031E FRU Part Number CA20399 B12X 004AA 541 3304 01 E FAN At0 Status Normal FAN Atl Status Normal XSCF gt For details see the manual pages of showhardconf 3 Type the console command to switch from the XSCF console to the ok prompt domain console again XSCF gt console d 0 4 From the ok prompt type the show devs command to confirm that all the PCle cards are mounted 0 ok show devs pci 1 700000 pci 0 600000 pci 8 4000 cmp 400 0 pseudo mc 200 200 nvram pseudo console virtual memory memoryem0 aliases options openprom chosen packages 4 16 SPARC Enterprise M3000 Server Service Manual October 2008 The show devs output continued pci pci pci pci pci pci pci pci pci pci pci pci pci 1 700000 pci 0 1 700000 pci 0 pci 9 1 700000 pci 0 pci 8 1 700000 pci 0 pci 0 1 700000 pci 0 pci 9 pci 0 1 700000 pci 0 pci 9 pci 0 FJSV e2ta 4 1 1 700000 pci 0 pci 9 pci 0 FUSV e2tae 4 1 700000 pci 0 pci 8 pci 0 1 700000 pci 0 pci 8 pci 0 FJSV e2ta 4 1 1 700000 pci 0 pci 8 pci 0 FUSV e2tae 4 1 700000 pci 0 pci 0 pci 0 1 700000 pci 0 pci 0 pci 0 FJSV e2ta 4 1 1 7
25. WARRANTIES OF ANY KIND EXPRESS OR IMPLIED REGARDING SUCH PRODUCT OR TECHNOLOGY OR THIS DOCUMENT WHICH ARE ALL PROVIDED AS IS AND ALL EXPRESS OR IMPLIED CONDITIONS REPRESENTATIONS AND WARRANTIES INCLUDING WITHOUT LIMITATION ANY IMPLIED WARRANTY OF MERCHANTABILITY FITNESS FOR A PARTICULAR PURPOSE OR NON INFRINGEMENT ARE DISCLAIMED EXCEPT TO THE EXTENT THAT SUCH DISCLAIMERS ARE HELD TO BE LEGALLY INVALID Unless otherwise expressly set forth in such agreement to the extent allowed by applicable law in no event shall Fujitsu Limited Sun Microsystems Inc or any of their affiliates have any liability to any third party under any legal theory for any loss of revenues or profits loss of use or data or business interruptions or for any indirect special incidental or consequential damages even if advised of the possibility of such damages DOCUMENTATION IS PROVIDED AS IS AND ALL EXPRESS OR IMPLIED CONDITIONS REPRESENTATIONS AND WARRANTIES INCLUDING ANY IMPLIED WARRANTY OF MERCHANTABILITY FITNESS FOR A PARTICULAR PURPOSE OR NON INFRINGEMENT ARE DISCLAIMED EXCEPT TO THE EXTENT THAT SUCH DISCLAIMERS ARE HELD TO BE LEGALLY INVALID SO mad KA Adobe PostScript Copyright 2008 FUJITSU LIMITED 1 1 Kamikodanaka 4 chome Nakahara ku Kawasaki shi Kanagawa ken 211 8588 Japon Tous droits r serv s Entr e et revue tecnical fournies par Sun Microsystems Incl sur des parties de ce mat riel Sun Microsystems Inc et Fujitsu Limited
26. all the domains are stopped and then the server is powered off a FRU is operated Note Do not operate a target FRU while the OpenBoot PROM is running the ok prompt is displayed After stopping the relevant domain power off or starting the Solaris OS operate the target FRU FRU Installation For empty slots without hard disk drives or PCle cards the number of mounted FRUs can be changed from 1 to the maximum number as reguired There are some components that are tentatively mounted physically in the server If such a SPARC Enterprise M3000 Server Service Manual October 2008 component is a hard disk drive it is called an HDD dummy and if such a component is a PCle card it is called a PCle slot cover These components are necessary to protect the server from noise and to properly cool the server The same methods as those used for replacement are used for installation Note When installing a new component in an empty slot remove the HDD dummy or PCle slot cover and then install a new FRU TABLE 4 2 lists the access locations and applicable replacement methods for each FRU TABLE 4 2 FRU Access Locations and Replacement Methods Access Cold Hot Active Where to find the FRU location replacement replacement replacement procedure Motherboard unit Top Yes No No Chapter 6 MBU_A Memory DIMM Top Yes No No Chapter 7 PCIe card PCIe Top Yes No No Chapter 8 Hard disk drive HDD Front Yes Ye
27. and number of hops Comparing multiple routes can identify bottlenecks The following example shows output for the ping s command ping s san ff2 17 a C PING san ff2 17 a 56 data bytes 64 bytes from san ff2 17 a 10 1 67 31 icmp_seq 0 time 0 427 ms 64 bytes from san ff2 17 a 10 1 67 31 icmp_seq 1 time 0 194 ms san ff2 17 a PING Statistics 2 packets transmitted 2 packets received 0 packet loss round trip ms min avg max stddev 0 172 0 256 0 427 0 102 3 6 6 Using the ps Command The ps commands lists the status of processes If no option is specified the ps command outputs information about the processes that have the same execution user ID as the user who is executing this command and are controlled from the same control terminal as this command If any option is specified the output information is controlled according to the specified option 3 26 SPARC Enterprise M3000 Server Service Manual October 2008 3 6 6 1 Options TABLE 3 11 lists the options of the ps command and how those options can help troubleshooting TABLE 3 11 Options for ps Option Description How it can help e Displays information for every process Identifies the process ID and the executable files f Generates a full listing Provides the following process information user ID parent process ID time when executed and the paths to the executable files o option Enables configurable outp
28. confirm that the server and other components are operating correctly and to collect status information and log data on the server and components Work for mounting removing or replacing a specific component requires special tools including screwdrivers and an antistatic wrist strap These items are generally named maintenance tools and are listed in TABLE 4 1 TABLE 4 1 Maintenance Tools Item Part name Use 1 Phillips screwdriver No 2 2 Wrist strap For electrostatic control 3 Conductive mat For electrostatic control 4 SunVTS Test program 4 1 4 2 4 2 1 4 2 2 4 2 FRU Replacement and Installation Methods This section explains how to replace and install FRUs FRU Replacement There are three methods of replacing FRUs as follows m Active replacement A target FRU is operated while the Solaris OS of the domain to which the FRU belongs is operating The target FRU is operated by using Solaris OS commands or XSCF commands Because the power supply unit PSU and fan unit FAN do not belong to any domain they are operated by using XSCF commands regardless of the operating state of the Solaris OS m Hot replacement A target FRU is operated while the domain to which the FRU belongs is stopped Depending on the target FRU there are two cases as follows Power supply unit Fan unit operated with XSCF commands Hard disk drive operated directly not by using XSCF commands m Cold replacement After
29. de quelque type que ce soit de la part de Fujitsu Limited ou de Sun Microsystems Inc ou des soci t s affili es Ce document et le produit et les technologies qu il d crit peuvent inclure des droits de propri t intellectuelle de parties tierces prot g s par copyright et ou c d s sous licence par des fournisseurs Fujitsu Limited et ou Sun Microsystems Inc y compris des logiciels et des technologies relatives aux polices de caract res Par limites du GPL ou du LGPL une copie du code source r gi par le GPL ou LGPL comme applicable est sur demande vers la fin utilsateur disponible veuillez contacter Fujitsu Limted ou Sun Microsystems Inc Cette distribution peut comprendre des composants d velopp s par des tierces parties Des parties de ce produit pourront tre d riv es des syst mes Berkeley BSD licenci s par l Universit de Californie UNIX est une marque d pos e aux Etats Unis et dans d autres pays et licenci e exclusivement par X Open Company Ltd Sun Sun Microsystems le logo Sun Java Netra Solaris Sun Ray Answerbook2 docs sun com OpenBoot et Sun Fire sont des marques de fabrique ou des marques d pos es de Sun Microsystems Inc ou ses filiales aux Etats Unis et dans d autres pays Fujitsu et le logo Fujitsu sont des marques d pos es de Fujitsu Limited Toutes les marques SPARC sont utilis es sous licence et sont des marques de fabrique ou des marques d pos es de SPARC International
30. error statistics Provides information about devices manufacturer model number serial number size and errors n Displays names in a descriptive format The descriptive format helps identify devices x Reports extended drive statistics of each drive Similar to the e option but provides rate The output is in a tabular form information This helps identify internal devices with poor performance and other I O devices with poor performance across the network 3 18 SPARC Enterprise M3000 Server Service Manual October 2008 The following example shows output for the iostat command iostat En c0t0d0 Soft Errors 0 Hard Errors 0 Transport Errors 0 Model ST3120026A Revision 8 01 Serial No 3JT4H4C2 Size 120 03GB lt 120031641600 bytes gt Media Error 0 Device Not Ready 0 No Device 0 Recoverable 0 Illegal Request 0 c0t2d0 Soft Errors 0 Hard Errors 0 Transport Errors 0 Vendor LITE ON Product COMBO SOHC 4832K Revision O3K1 Serial No Size 0 00GB lt 0 bytes gt Media Error 0 Device Not Ready 0 No Device 0 Recoverable 0 Illegal Request 0 Predictive Failure Analysis 0 3 6 2 Using the prtdiag Command The prtdiag command displays system configuration and diagnostic information The diagnostic information identifies any failed FRU in the system The prtdiag command is located in the usr platform platform name sbin directory The prtdiag command may indicate a slot number different from t
31. ment stipul es dans le contrat de licence r gissant le produit ou la technologie fourni e SAUF MENTION CONTRAIRE EXPRESSEMENT STIPULEE DANS CE CONTRAT FUJITSU LIMITED SUN MICROSYSTEMS INC ET LES SOCIETES AFFILIEES REJETTENT TOUTE REPRESENTATION OU TOUTE GARANTIE QUELLE QU EN SOIT LA NATURE EXPRESSE OU IMPLICITE CONCERNANT CE PRODUIT CETTE TECHNOLOGIE OU CE DOCUMENT LESQUELS SONT FOURNIS EN L TAT EN OUTRE TOUTES LES CONDITIONS REPRESENTATIONS ET GARANTIES EXPRESSES OU TACITES Y COMPRIS NOTAMMENT TOUTE GARANTIE IMPLICITE RELATIVE A LA QUALIT MARCHANDE L APTITUDE UNE UTILISATION PARTICULI RE OU LABSENCE DE CONTREFA ON SONT EXCLUES DANS LA MESURE AUTORIS E PAR LA LOI APPLICABLE Sauf mention contraire express ment stipul e dans ce contrat dans la mesure autoris e par la loi applicable en aucun cas Fujitsu Limited Sun Microsystems Inc ou l une de leurs filiales ne sauraient tre tenues responsables envers une quelconque partie tierce sous quelque th orie juridique que ce soit de tout manque gagner ou de perte de profit de probl mes d utilisation ou de perte de donn es ou d interruptions d activit s ou de tout dommage indirect sp cial secondaire ou cons cutif m me si ces entit s ont t pr alablement inform es d une telle ventualit LA DOCUMENTATION EST FOURNIE EN L ETAT ET TOUTES AUTRES CONDITIONS DECLARATIONS ET GARANTIES EXPRESSES OU TACITES SONT FORMELLEMENT EXCLUES DANS LA M
32. quick overview of the network includes packets in out errors in out status collisions and queues i interval Repeats the setstat command in the Identifies intermittent or long duration intervals of as many seconds as specified after network events By piping setstat output to the i option a file overnight activity can be viewed all at once p Displays the media table Provides the MAC address for hosts on the subnet r Displays the routing table Provides routing information n Replaces host names with IP addresses and Used when an IP address is more useful than a displays them host name 3 24 SPARC Enterprise M3000 Server Service Manual October 2008 The following example shows the output for the netstat p command netstat p Net to Media Table IPv4 Device IP Address Mask Flags Phys Addr bge0 san ff1 14 a 255 255 255 255 o 00 14 4f 3a 93 61 bge0 san ff2 40 a 255 255 255 255 0 00 14 4f 3a 93 85 sppp0 224 0 0 22 255 255 255 255 bge0 san ff2 42 a 255 255 255 255 o 00 14 4f 3a 93 af bge0 san09 lab r01 66 255 255 255 255 o 00 e0 52 ec 1a 00 spppO 192 168 1 1 255 255 255 255 bge0 san ff2 9 b 255 255 255 255 o 00 03 ba dc af 2a bge0 bizzaro 255 255 255 255 o 00 03 ba 11 b3 c1 bge0 san ff2 9 a 255 255 255 255 o 00 03 ba dc af 29 bge0 racerx b 255 255 255 255 o 00 0b 5d dc 08 b0 bge0 224 0 0 0 240 0 0 0 SM 01 00 5e 00 00 00 3 6 5 Using the ping Command The ping command sends an ICMP ECHO _ REQUEST packet to a
33. small parts that are attached to printed circuit boards ESD safe packaging box Place a printed board or component in the ESD safe packaging box after you remove it FIGURE 1 1 Wrist Strap Connection Destination m FRU other than the units on the left 1 place either at the upper right on the front of the server or at the upper left at the back m Hard disk drive or fan unit One of 2 thumbscrews on the front of the server Caution Do not connect the wrist strap cable to the conductive mat Connect it directly to the server The wrist strap and FRU must have the same level of potential 1 2 SPARC Enterprise M3000 Server Service Manual October 2008 1 2 1 21 1 22 Server Precautions When maintaining the server observe the following precautions for your protection m Follow all cautions warnings and instructions marked on the server Caution Do not insert any object in an opening of the server If any object comes into contact with a high voltage part or short circuits a component fire or electric shock might result m Refer servicing of the server to the service engineer Electrical Safety Precautions m Ensure that the voltage and frequency of the AC power source to be used matches the electrical rating labels on the server m Wear antistatic wrist straps when handling hard disk drives motherboard units or other printed circuit boards m Use grounded power outlets as described in
34. the position of the operator panel so that the light pipe on the front of the operator panel is aligned with the socket on the server front Connect the signal cable of the operator panel Attach the thumbscrew Tighten the four screws to install the front panel 15 4 Reassembling the Server 1 Attach the air duct For details see Section 5 3 3 Attaching the Air Duct on page 5 6 Attach the top cover For details see Section 5 2 2 Attaching the Top Cover on page 5 4 Slide the server into the equipment rack and secure the stabilizer bar For details see Section 5 1 2 Sliding the Server into the Equipment Rack on page 5 3 Power on the server This procedure includes the steps of reconnecting the power cord checking the LED status and turning the mode switch on the operator panel to the Locked position For details see Section 4 5 3 1 Power on by Using the XSCF Command on page 4 13 Check the hardware This procedure includes the steps of running a program to verify that all the components have been remounted and starting the Solaris OS For details see Section 4 5 4 Verifying the Hardware Operation on page 4 15 Chapter 15 Operator Panel Replacement 15 5 15 6 SPARC Enterprise M3000 Server Service Manual October 2008 APPENDIX A Components List This appendix explains the server nomenclature and component numbering If multiple FRUs of the same type are mounted the s
35. to start the Solaris OS For details see Section 4 5 4 Verifying the Hardware Operation on page 4 15 Chapter 7 Replacement and Installation of Memory 7 5 7 6 SPARC Enterprise M3000 Server Service Manual October 2008 CHAPTER 8 Replacement and Installation of PCIe Cards This chapter explains how to replace and install PCIe cards m Section 8 1 Accessing a PCle Card on page 8 3 m Section 8 2 Removing a PCle Card on page 8 4 m Section 8 3 Mounting a PCIe Card on page 8 4 m Section 8 4 Reassembling the Server on page 8 5 PCle cards are hot replacement components The same methods as those for replacement are used for installation of PCle cards 8 1 FIGURE 8 1 shows the locations of the PCIe slots PCle Slot Locations FIGURE 8 1 Component Location number PCle 0 PCle 1 PCIe slot PCIe 2 PCle slot PCle slot PCle slot PCIe 3 SPARC Enterprise M3000 Server Service Manual October 2008 8 2 Accessing a PCle Card Caution There is a risk of electrical failure if the power cords are not disconnected All the power cords must be disconnected to completely cut the power to the server Caution To ensure that you and bystanders are not exposed to harm and to prevent damage to the system observe the ESD safety precautions See Section 1 1 ESD Precautions on page 1 1
36. 00000 pci 0 pci 0 pci 0 FUSV e2tae 4 pci 0 600000 pci 0 pci 0 600000 pci 0 pci 8 pci 0 600000 pci 0 pci 2 pci 0 600000 pci 0 pci l pci 0 600000 pci 0 pci 0 pci 0 600000 pci 0 pci 8 pci 0 pci 0 600000 pci 0 pci 8 pci 0 FISV e2ta 4 1 pci 0 600000 pci 0 pci 8 pci 0 FISV e2ta 4 pci 0 600000 pci 0 pci 2 pci 0 pci 0 600000 pci 0 pci 2 pci 0 network 4 1 pci 0 600000 pci 0 pci 2 pci 0 network 4 pci 0 600000 pci 0 pci l pci 0 pci 0 600000 pci 0 pci 1 pci 0 network 4 1 pci 0 600000 pci 0 pci 1 pci 0 networke4 pci 0 600000 pci 0 pci 0 scsi 0 pci 0 600000 pci 0 pci 0 scsi 0 disk pci 0 600000 pci 0 pci 0 scsi 0 tape pci 8 4000 ebus 1 pci 8 4000 ebus 1 panel 14 280030 pci 8 4000 ebus 1 scfc 14 200000 pci 8 4000 ebus 1 serial 14 400000 pci 8 4000 ebus 1 flashprom 10 0 cmp 400 0 core 3 cmp 400 0 core 2 cmp 400 0 core l cmp 400 0 core 0 cmp 400 0 core 3 cpu l cmp 400 0 core 3 cpu 0 cmp 400 0 core 2 cpu 1 cmp 400 0 core 2 cpu 0 cmp 400 0 core 1 cpu 1 cmp 400 0 core 1 cpu 0 Chapter 4 FRU Replacement Preparation 4 17 The show devs output continued cmp 400 0 core 0 cpu l1 cmp 400 0 core 0 cpu 0 openprom client services packages obp tftp packages terminal emulator packages disk label packages deblocker packages SUNW builtin drivers packages SUNW probe error handler 0 ok 5 Type the probe scsi all command to confirm that the storage devices are mounted
37. 13 3 5 4 Using the fmdump Command 3 14 3 5 4 1 fmdump VCommand 3 14 3 5 4 2 fmdump e Command 3 15 3 5 5 Using the fmadm Command 3 15 3 5 5 1 Using the fmadm faultyCommand 3 15 3 5 5 2 fmadm repairCommand 3 16 3 5 5 3 fmadm configCommand 3 16 3 5 6 Using the fmstat Command 3 17 3 6 General Solaris Troubleshooting Commands 3 17 3 6 1 Using the iostat Command 3 18 3 6 1 1 Options 3 18 3 6 2 Using the prtdiag Command 3 19 vi SPARC Enterprise M3000 Server Service Manual October 2008 3 6 2 1 Options 3 19 3 6 3 Using the prtconf Command 3 21 6 3 1 Options 3 22 3 6 4 sing thenetstat Command 3 24 6 4 1 Options 3 24 6 5 1 Options 3 25 3 6 6 sing the ps Command 3 26 6 6 1 Options 3 27 3 U 3 3 6 5 Using the ping Command 3 25 3 U 3 U 3 6 7 sing the prstat Command 3 27 3 6 7 1 Options 3 28 FRU Replacement Preparation 4 1 4 1 4 2 4 3 4 4 4 5 Tools Required for Maintenance 4 1 FRU Replacement and Installation Methods 4 2 42 1 FRU Replacement 4 2 4 2 2 FRU Installation 4 2 Active Replacement Active Addition 4 5 4 3 1 Releasing a FRU from a Domain 4 5 43 2 FRU Removal and Replacement 4 6 4 3 3 Configuring a FRU ina Domain 4 6 4 3 4 Verifying the Hardware Operation 4 7 Hot Replacement Hot Addition 4 7 44 1 FRU Removal and Replacement 4 8 442 Verifying the Hardware Operation 4 10 Cold Replacement Cold Addition 4 11 4 5 1 Powering off the Server 4 11 4 5 1 1 Power off by Using the XSCF Com
38. 14 1 fan cage 14 4 fan cover 5 7 fan unit FAN_A 13 1 14 2 C 7 flowchart 3 2 front panel 2 1 FRU components C 1 H hard disk drive HDD 10 1 Hard Disk Drive Backplane HDDBP 10 2 hardware overview 2 1 K key 2 4 L LED 2 8 2 10 M maintenance tool 4 1 mode switch 2 6 motherboard unit MBU 6 1 CPU C 4 memory DIMM C 2 O operator panel 2 4 15 2 P PCle card 8 1 PCle slot 8 1 power supply unit PSU 12 1 C 6 Index 1 power switch 2 6 Predictive self healing 3 8 R rear panel 2 12 replacement method active replacement 4 5 cold replacement 4 11 hot replacement 4 7 S safety precautions 1 3 server precautions 1 3 shutter unit 6 4 slide rail 5 1 storage device CD RW DVD RW drive unit DVDU C 6 hard disk drive 9 1 C 5 switch 2 6 T top cover 5 3 5 4 troubleshooting command 3 10 U UPS controller UPC E 1 Cable Connector E 6 Configuration of Signal Lines E 3 Connection E 7 overview E 1 Power supply conditions E 5 Signal Cable E 2 UPC Port E 1 X XSCF unit C 4 Index 2 SPARC Enterprise M3000 Server Service Manual October 2008
39. 22222229 22222022222220 DODDDODDIDODODE 22222222222222 22222222222220 22222222222209 22222222222220 222222202222220 22222222222222 22222222222220 222222222222200 y ODOOOOOOOOOOOOO y FIGURE 5 5 Removing the Fan Cover 5 7 Internal Components Access Chapter 5 5 4 2 Attaching the Fan Cover 1 Align the tab on the left end of the fan cover in the predetermined position and then secure the fan cover in position 2 Tighten the one screw on the right side of the fan cover 3 Push the server back into the equipment rack For details see Section 5 1 2 Sliding the Server into the Equipment Rack on page 5 3 5 8 SPARC Enterprise M3000 Server Service Manual October 2008 CHAPTER 6 Motherboard Unit Replacement This chapter explains how to replace the motherboard unit m Section 6 1 Accessing the Motherboard Unit on page 6 4 m Section 6 2 Removing the Motherboard Unit on page 6 7 m Section 6 3 Mounting the Motherboard Unit on page 6 8 m Section 6 4 Reassembling the Server on page 6 9 The motherboard unit is a cold replacement component The server must be completely powered off the power cords must be disconnected and all DIMMs and PCle cards must be removed before the motherboard unit is replaced See Chapter 7 Replacement and Installation of Memory and Chapter 8 Replacement and Installation of PCIe Cards Note Do not replace the mother
40. 302 01 CPU Status Normal Freq 2 520 GHz Type 32 Core 4 Strand 2 Memory_Size 8 GB MEM 0A Status Normal Code ce0000000000000001M3 93T2950EZA CE6 4145 473b3c23 Type 1A Size 1 GB MEM 0B Status Normal Code 7f7ffe00000000004aEBE10RD4AJFA 5C E 3020 223b2918 Type 1A Size 1 GB MEM 1A Status Normal Code 7f7ffe00000000004aEBE10RD4AJFA 5C E 3020 223b28af Type 1A Size 1 GB MEM 1B Status Normal Code 7f7ffe00000000004aEBE10RD4AJFA 5C E 3020 223b28ab Type 1A Size 1 GB MEM 2A Status Normal Code 7 7 e00000000004aEBE1 0RD4AUFA 5C 1 Type 1A Size 1 GB MEM 2B Status Normal Code 7 7 e00000000004aEBE1 0RD4ANFA 5C 1 Type 1A Size 1 GB MEM 3A Status Normal Code 7 7 e00000000004aEBE10RD4ANFA 5C E 3020 223b2840 Type 1A Size 1 GB MEM 3B Status Normal Code 7 7 e00000000004aEBE10RD4AJFA 5C E 3020 223b2830 Type 1A Size 1 GB PCI 0 Name_Property fibre channel Card_Type Other PCI 1 Name_Property fibre channel Card_Type Other T Gl 3020 223b283e Lii 3020 223b2829 T Chapter 4 FRU Replacement Preparation 4 15 The showhardconf output continued PCI 2 Name_Property pci Card_Type Other PCI 3 Name_Property pci Card_Type Other OPNL Status Normal Ver 0101h Serial PP082202R8 FRU Part Number CA07082 D911 A1 541 3306 01 PSU 0
41. 4ATFA 5C E 3020 223b28ab Type 1A Size 1 GB MEM 2A Status Normal Code 7 7 fe00000000004aEBE1ORD4ATFA 5C E 3020 223b283e Type 1A Size 1 GB MEM 2B Status Normal Code 7 7 fe00000000004aEBE1ORD4ATFA 5C E 3020 223b2829 Type 1A Size 1 GB MEM 3A Status Normal Code 7 7 fe00000000004aEBE1ORD4ATFA 5C E 3020 223b2840 Type 1A Size 1 GB MEM 3B Status Normal Code 7 7 fe00000000004aEBE1ORD4ATFA 5C E 3020 223b2830 Type 1A Size 1 GB Chapter 3 Troubleshooting 3 11 The showhardconf output continued PCI 0 Name Property fibre channel Card_Type Other PCI 1 Name Property fibre channel Card Type Other PCI 2 Name Property pci Card Type Other PCI 3 Name _Property pci Card Type Other OPNL Status Normal Ver 0101h Serial PP0829045Y FRU Part Number CA07082 D912 AO 541 3306 01 i PSU 0 Status Normal Serial EA08260208 FR Part Number CA01022 0720 03C 300 2193 03 i Power_Status On PSU 1 Status Normal Serial EA08260210 FR Part Number CA01022 0720 03C 300 2193 03 Power_Status On FANBP_B Status Normal Ver 0101h Serial PPO82704TD FR Part Number CA20399 B12X 006AB 541 3304 02 FAN_A 0 Status Normal FAN_A 1 Status Normal For details see the showhardconf manual pages 5 92 Using the showlogs Command The showlogs command displays information of specified logs in the order o
42. C Enterprise M3000 Server Service Manual October 2008 10 4 5 Loosen the two screws to remove the hard disk drive backplane FIGURE 10 3 Removing the Hard Disk Drive Backplane 6 Place the hard disk drive backplane on a conductive mat 10 3 Mounting the Hard Disk Drive Backplane 1 Mount the hard disk drive backplane 2 Tighten the two screws to secure the hard disk drive backplane 3 Secure the two cables to the rear of the hard disk drive backplane Caution Do not forcibly mount the hard disk drive in a slot Forcibly pushing the component in a slot may damage the component or the server Chapter 10 Replacing the Hard Disk Drive Backplane 10 5 10 4 10 6 4 Mount the hard disk drives For details see Section 9 3 Installing a Hard Disk Drive on page 9 5 5 Mount the CD RW DVD RW drive unit For details see Section 11 3 Mounting the CD RW DVD RW Drive Unit on page 11 4 Reassembling the Server 1 Attach the air duct For details see Section 5 3 3 Attaching the Air Duct on page 5 6 Attach the top cover For details see Section 5 2 2 Attaching the Top Cover on page 5 4 Slide the server into the equipment rack and secure the stabilizer bar For details see Section 5 1 2 Sliding the Server into the Equipment Rack on page 5 3 Power on the server This procedure includes the steps of reconnecting the power cord checking the LED st
43. D 5 GND 5 GND 6 RXD 3 TXD 8 CTS 7RTS D 4 SPARC Enterprise M3000 Server Service Manual October 2008 APPENDIX E UPS Controller This appendix explains the UPS controller UPC that controls the uninterruptible power supply UPS unit E 1 Section E 1 Overview on page E 1 Section E 2 Signal Cable on page E 2 Section E 3 Configuration of Signal Lines on page E 3 Section E 4 Power Supply Conditions on page E 5 Section E 5 UPS Cable on page E 6 Section E 6 Connections on page E 7 Overview The UPS is a unit to ensure a stable supply of power to the system even in the event of a power supply failure or a massive blackout When the UPC port on the server is connected to the UPS signal cable that has a UPC interface if the UPS detects a power failure it notifies the server of the failure so that the server can perform emergency shutdown processing This stops the server safely E 1 FIGURE E 1 shows the location of the UPC port on the M3000 server FIGURE E 1 UPC Port Locations Rear view A s 12000000 GS 000 gt a a RH ET gt a oo SESE co UPC1 port UPCO port E 2 Signal Cable Use the shielded pair cable with the following specifications m Direct current resistance roundtrip pair 400 W km or less m Cable length Up to 10 m 33 ft E
44. ESURE AUTORISEE PAR LA LOI APPLICABLE Y COMPRIS NOTAMMENT TOUTE GARANTIE IMPLICITE RELATIVE A LA QUALITE MARCHANDE A LAPTITUDE A UNE UTILISATION PARTICULIERE OU A L ABSENCE DE CONTREFACON Contents Preface xiii Safety Precautions for Maintenance 1 1 1 1 ESD Precautions 1 1 1 2 Server Precautions 1 3 12 1 Electrical Safety Precautions 1 3 1 2 2 Equipment Rack Safety Precautions 12 3 Component Handling Precautions Hardware Overview 2 1 2 1 Name of Each Part 2 1 2 2 Operator Panel 2 4 2 21 Operator Panel Overview 2 5 2 2 2 Switches on the Operator Panel 2 6 2 2 3 LEDs on the Operator Panel 2 8 2 3 LED Functions of Components 2 10 2 4 External Interface Port on Rear Panel 2 12 2 5 Labels 2 16 Troubleshooting 3 1 3 1 Emergency Power Off 3 1 1 3 1 4 3 2 Failure Diagnostic Method 3 2 3 3 Checking the Server and System Configuration 3 4 3 3 1 Checking the Hardware Configuration and FRU Status 3 4 3 3 1 1 Checking the Hardware Configuration 3 5 3 3 2 Checking the Software and Firmware Configurations 3 5 3 3 2 1 Checking the Software Configuration 3 6 3 3 2 2 Checking the Firmware Configuration 3 7 3 3 2 3 Downloading Error Log Information 3 7 3 4 Error Conditions 3 7 3 4 1 Predictive Self Healing Tools 3 8 3 4 2 Monitoring Output 3 9 3 4 3 Messaging Output 3 10 3 5 Using Troubleshooting Commands 3 10 3 5 1 Using the showhardconf Command 3 11 3 5 2 Using the showlogs Command 3 12 3 5 3 Using the showstatus Command 3
45. Enterprise M3000 server with one exception When the motherboard of a Sun SPARC Enterprise M3000 server must be removed and that server is mounted above waist high in the cabinet two engineers or a platform must be used for safety xiii How This Document Is Organized This document is organized into the following chapters Chapter 1 explains safety precautions required for maintenance Chapter 2 explains the names of components and the LEDs on the operator panel and rear panel Chapter 3 explains fault diagnosis information Chapter 4 explains the method of preparing for the safe replacement of FRUs Chapter 5 explains how to access internal components Chapter 6 explains how to replace the motherboard unit Chapter 7 explains how to replace and install memory DIMMs Chapter 8 explains how to replace and install PCIe cards Chapter 9 explains how to replace and install a hard disk drive Chapter 10 explains how to replace the hard disk drive backplane Chapter 11 explains how to replace the CD RW DVD RW drive unit Chapter 12 explains how to replace a power supply unit Chapter 13 explains how to replace a fan unit Chapter 14 explains how to replace the fan backplane Chapter 15 explains how to replace the operator panel Appendix A explains the server nomenclature and component numbering Appendix B explains memory DIMM mounting conditions Appendix C explains FRUs Appendix D explains connector specifications for exter
46. Extends the PCI data transfer rate to twice the value of PCI X TABLE C 3 PCIe slot specifications Item Description Maximum number of PCIe cards 4 Location Rear of the server Active replacement No Hot replacement No Cold replacement Yes For the replacement procedure see Chapter 8 Appendix C FRU List C 3 C 2 3 C 2 4 C 4 CPU The SPARC 64 VII processor is mounted on the CPU The SPARC64 VII processor has the following features m Design adopting chip multithreading CMT in which a CPU seguentially performs multiple processes m CPU consisting of four core processors m SPARC instruction set architecture ISA m Visual instruction set VIS extended instructions that accelerate processing of multimedia networking encryption and Java TABLE C 4 shows the CPU specifications TABLE C 4 CPU Specifications Item Description Maximum number of CPUs 1 Number of CPU cores 4 cores Location On the motherboard unit Because the CPU is mounted on the motherboard unit it cannot be replaced singly For information on how to replace the motherboard unit see Chapter 6 XSCF Unit The service processor set that operates and manages the server is mounted in the XSCF unit The XSCF unit diagnoses and starts the entire system configures domains and detects and reports errors SPARC Enterprise M3000 Server Service Manual October 2008 The XSCF units provides the following functions by means of the
47. Manager Self Diagnosis 1 0 active 1 0 Retire Agent 1 0 active SNMP Trap Generation Agent 1 0 active SysEvent Transport Agent 1 0 active Syslog Messaging Agent 1 0 active ZFS Diagnosis Engine 1 0 active ZFS Retire Agent 3 16 SPARC Enterprise M3000 Server Service Manual October 2008 3 5 6 Using the fmstat Command The fmstat command reports statistical information and a set of modules that are associated with the module called Solaris Fault Manager By using the fmstat command statistical information about the diagnostic engine and diagnostic agent that are currently involved in fault management can be displayed The following output example shows that the fmd self diagnosis DE module displayed also on the console output has received accepted events fmstat module eft io retire syslog msgs zfs retire cpumem diagnosis cpumem retire disk transport event transport fabric xlate fmd self diagnosis snmp trapgen sysevent transport zfs diagnosis ev_recv ev_acpt wait svc_t w b open solve memsz bufsz 0 0 0 0 0 0 0 0 0 0 3 0K 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1793 8 0 0 0 0 40b 0 0 0 0 0 0 0 0 0 0 0 1 2M 0 0 0 0 0 0 0 0 0 0 0 210b 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 32b 0 0 0 0 0 2395 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 6 General Solaris Troubleshooting Commands Sup
48. PCle cards and shutter unit FIGURE 6 2 Locations of DIMMs PCIe Cards and Shutter Unit a N N lt TS SG CL EE 14 GEL oJ L L ae ll Z Location number Component 1 Memory DIMM 2 PCle card 3 Shutter unit 6 1 Accessing the Motherboard Unit disconnected All the power cords must be disconnected to completely cut the power Caution There is a risk of electrical failure if the power cords are not to the server 6 4 SPARC Enterprise M3000 Server Service Manual October 2008 Caution To ensure that you and bystanders are not exposed to harm and to prevent damage to the system observe the ESD safety precautions See Section 1 1 ESD Precautions on page 1 1 Power off the server This procedure includes the steps of turning the mode switch on the operator panel to the Service position verifying that the POWER LED is off and disconnecting the power cord For details see Section 4 5 1 Powering off the Server on page 4 11 Remove all the cables from the external interface block on the rear panel Caution To prevent the equipment rack from tipping over you must deploy the antitilt feature if applicable before you slide the server out of the equipment rack Note When the cable management arm is not mounted remove the cable ties that fix the PCI cable to the rear of the server and slide the server out 10 11 Pull the pow
49. S KYUN microsystems Sun SPARC Enterprise M3000 Server Service Manual Sun Microsystems Inc www sun com Part No 820 5683 10 October 2008 Revision A we weve Nt ati http www sun com hwdocs feedback Copyright 2008 FUJITSU LIMITED 1 1 Kamikodanaka 4 chome Nakahara ku Kawasaki shi Kanagawa ken 211 8588 Japan All rights reserved Sun Microsystems Inc provided technical input and review on portions of this material Sun Microsystems Inc and Fujitsu Limited each own or control intellectual property rights relating to products and technology described in this document and such products technology and this document are protected by copyright laws patents and other intellectual property laws and international treaties The intellectual property rights of Sun Microsystems Inc and Fujitsu Limited in such products technology and this document include without limitation one or more of the United States patents listed at http www sun com patents and one or more additional patents or patent applications in the United States or other countries This document and the product and technology to which it pertains are distributed under licenses restricting their use copying distribution and decompilation No part of such product or technology or of this document may be reproduced in any form by any means without prior written authorization of Fujitsu Limited and Sun Microsystems Inc and their applicable licensors i
50. Top No No No CD RW DVD RW drive unit Front top No No No DVDU Power supply unit PSU Rear No No No Fan unit FAN_A Top No No No Fan backplane FANBP_B Top No No No Operator panel OPNL Front top No No No The FRU is operated directly without using XSCF commands 4 4 SPARC Enterprise M3000 Server Service Manual October 2008 4 3 4 3 1 Active Replacement Active Addition In active replacement the target FRU is operated while the Solaris OS of the domain to which the FRU belongs is operating The target FRU is operated using Solaris OS commands or XSCF commands Because the power supply unit PSU and fan unit FAN do not belong to any domain they are operated by using XSCF commands regardless of the operating state of the Solaris OS Active replacement has the following four stages m Releasing a FRU from a Domain on page 4 5 m FRU Removal and Replacement on page 4 6 m Configuring a FRU in a Domain on page 4 6 m Verifying the Hardware Operation on page 4 7 For active installation see Section 4 3 3 Configuring a FRU in a Domain on page 4 6 and Section 4 3 4 Verifying the Hardware Operation on page 4 7 Releasing a FRU from a Domain 1 From the Solaris OS type the cfgadm command to obtain the component status t cfgadm a 2 Stop the application from using the component and disconnect the component from the Solaris OS The READY LED green of t
51. Type the cfga m x command to confirm that the CHECK LED is off cfgadm x led fault mode off Ap Id The Ap Id is shown in the output of cfgadm for example disk 0 The CHECK LED amber of the HDD is turned off 4 6 SPARC Enterprise M3000 Server Service Manual October 2008 3 Type the cfgadm command to verify that the component has been configured The configured component is displayed as being configured The READY LED green of the HDD goes on 4 3 4 Verifying the Hardware Operation m Confirm the status of the LED indicators For information on the LED status see TABLE 2 3 and TABLE 2 5 4 4 Hot Replacement Hot Addition In hot replacement the target FRU is operated while the domain to which the FRU belongs is stopped Depending on the target FRU there are two cases as follows m Power supply unit Fan unit operated with XSCF commands m Hard disk drive operated directly not by using XSCF commands For hot addition do the same operation as that for hot replacement Chapter 4 FRU Replacement Preparation 4 7 44 1 FRU Removal and Replacement Type the replacefru command from the XSCF shell prompt XSCF gt replacefru Maintenance Replacement Menu Please select a type of FRU to be replaced 1 FAN Fan Unit 2 PSU Power Supply Unit Maintenance Replacement Menu Please select a FAN to be replaced No FRU Status 1 FAN_A 0 Normal 2 FAN_A 1 Normal Select 1 2 b back 1
52. XSCF firmware m Control and monitoring of the main unit hardware m Monitoring of Solaris OS power on self test POST and OpenBoot PROM m Control and management of system administrator interfaces such as the maintenance terminal m Control of device information m Remote message output for various events Because the XSCF unit is mounted on the motherboard unit it cannot be replaced singly For information on how to replace the motherboard unit see Chapter 6 C 3 C 3 1 Drive The M3000 server consists of up to four hard disk drives and one CD RW DVD RW drive unit Hard Disk Drive TABLE C 5 shows hard disk drive specifications TABLE C 5 Hard Disk Drive Specifications Item Description Number of hard disk drives 4 Interface SAS Location Front of the server Active replacement active addition Yes only when software mirroring is used Hot replacement hot addition Yes Cold replacement cold addition Yes An SAS device has two data ports Because each data port resides in a different SAS domain this enables complete failover redundancy If one path fails communication is performed through the other independent path For the replacement procedure see Chapter 9 Appendix C FRU List C 5 3 2 CD RW DVD RW Drive Unit DVDU TABLE C 6 shows CD RW DVD RW drive unit specifications TABLE C 6 CD RW DVD RW Drive Unit Specifications Item Description Number of CD RW DVD RW drive units 1 Interface
53. acement In cold replacement a FRU is removed and replaced while the power is turned off After the FRU replacement power on the server SPARC Enterprise M3000 Server Service Manual October 2008 4 5 3 4 5 3 1 4 5 3 2 Powering on the Server This section explains how to power on the server Power on by Using the XSCF Command 1 Verify that the server has enough power supply units to operate in the desired configuration 2 Connect all the power cords to AC power outlets 3 Verify that the XSCF STANDBY LED on the operator panel is on 4 Turn the mode switch on the operator panel to the desired mode position Locked or Service 5 A user with platadm or fieldeng authority must log in to the XSCF shell and type the poweron command XSCF gt poweron a Soon the following activity is executed m The POWER LED on the operator panel is turned on m The power on self test POST is executed Then the server is completely powered on Note If automatic startup of the Solaris OS is specified use the sendbreak d domain_id command of the XSCF shell to display the ok prompt after the display console banner is displayed but before the system starts booting the Solaris OS For details see the SPARC Enterprise M3000 M4000 M5000 M8000 M9000 Servers XSCF User s Guide Power on by Using the Operator Panel 1 Verify that the server has enough power supply units to operate in the desired configuration 2
54. ands for Checking Hardware Configuration Continued Command Description showboards Displays information on the system board XSB showdc1 Displays the hardware resource configuration information of a domain showfru Displays the setting information of a device The status of each component can be checked based on the On or blinking state of the component LEDs For the component types and LED states see TABLE 2 3 and TABLE 2 5 For details of commands see the SPARC Enterprise M3000 M4000 M5000 M8000 M9000 Servers XSCF User s Guide and the SPARC Enterprise M3000 M4000 M5000 M8000 M9000 Servers XSCF Reference Manual Checking the Hardware Configuration To check the hardware configuration authority user authority to log in with the XSCF user account to the XSCF is required The following procedure can be used to check the hardware configuration from the maintenance terminal Ask the system administrator for the required information such as the user account and password For details see the SPARC Enterprise M3000 M4000 M5000 M8000 M9000 Servers XSCF User s Guide 1 Log in to XSCF shell 2 Type showhardconf XSCF gt showhardconf The showhardconf command displays hardware configuration information For details see the SPARC Enterprise M3000 M4000 M5000 M8000 M9000 Servers XSCE User s Guide Checking the Software and Firmware Configurations The software and firmware configurations and versions affect t
55. ard Disk Drive on page 9 3 m Section 9 2 Removing a Hard Disk Drive on page 9 3 m Section 9 3 Installing a Hard Disk Drive on page 9 5 m Section 9 4 Reassembling the Server on page 9 5 Hard disk drives are active hot replacement components Note Only when the disk mirroring software is used a redundant configuration is created with the hard disk drive Note If a hard disk drive is an unmirrored boot device it must be replaced by using the cold replacement procedure However if a boot device can be disconnected by means of a Solaris OS function or disk mirroring software function active replacement can also be performed The procedure for disconnecting a hard disk drive varies depending on the software being used For details see the manuals for the relevant software The same methods as those for replacement are used for installation of a hard disk drive 9 1 FIGURE 9 1 shows the locations of the hard disk drives FIGURE 9 1 Hard Disk Drive Location 4 Location number Component 1 Hard disk drive HDD 0 2 Hard disk drive HDD 1 3 Hard disk drive HDD 2 4 Hard disk drive HDD 3 9 2 SPARC Enterprise M3000 Server Service Manual October 2008 9 1 Accessing a Hard Disk Drive Caution To ensure that you and bystanders are not exposed to harm and to prevent damage to the system observe the ESD safety precautions Section 1 1 ESD Precautions on page 1 1 m
56. atus and turning the mode switch on the operator panel to the Locked position For details see Section 4 5 3 1 Power on by Using the XSCF Command on page 4 13 Note If automatic startup of the Solaris OS is specified use the sendbreak d domain_id command of the XSCF shell to display the ok prompt after the display console banner is displayed but before the system starts booting the Solaris OS 5 Check the hardware This procedure includes the steps of running a program to verify that all the components have been remounted and to start the Solaris OS For details see Section 4 5 4 Verifying the Hardware Operation on page 4 15 SPARC Enterprise M3000 Server Service Manual October 2008 CHAPTER 1 1 CD RW DVD RW Drive Unit DVDU Replacement This chapter explains how to replace the CD RW DVD RW driver unit m Section 11 1 Accessing the CD RW DVD RW Drive Unit on page 11 2 m Section 11 2 Removing the CD RW DVD RW Drive Unit on page 11 3 m Section 11 3 Mounting the CD RW DVD RW Drive Unit on page 11 4 m Section 11 4 Reassembling the Server on page 11 5 CD RW DVD RW drive unit is a cold replacement component The entire server must be powered off and the power cords must be disconnected to replace the CD RW DVD RW drive unit 11 1 FIGURE 11 1 shows the location of the CD RW DVD RW drive unit FIGURE 11 1 Location of the CD RW DVD RW Drive Unit Location number Comp
57. board unit and the operator panel at the same time Otherwise the system may not operate correctly Use the showhardconf command or showstatus command to verify that the replacement unit of the first replaced FRU is fully operational before replacing the other FRU Note When replacing the motherboard unit attach connection destination labels to each of the LAN cable and UPS cable connected to the XSCF unit before removing these cables Note When mounting the motherboard unit connect the LAN cable and UPS cable to the XSCF unit 6 1 Note After the replacement of the motherboard unit is completed the system clock must be reset For details of the setting method see the SPARC Enterprise M3000 M4000 M5000 M8000 M9000 Servers XSCF User s Guide Note After the replacement of the motherboard unit is completed the versions of the XCP and Solaris OS must be checked For details of version number checking and other such tasks see the SPARC Enterprise M3000 M4000 M5000 M8000 M9000 Servers XSCF User s Guide Because the XSCF unit is mounted on the motherboard unit it cannot be replaced singly For details of the XSCF unit see the Appendix C 2 4 6 2 SPARC Enterprise M3000 Server Service Manual October 2008 FIGURE 6 1 indicates the location of the motherboard unit FIGURE 6 1 Motherboard Unit Location 6 3 Chapter 6 Motherboard Unit Replacement FIGURE 6 2 indicates the locations of DIMMs
58. ce position Chapter 4 FRU Replacement Preparation 4 11 4 5 1 2 4 5 2 4 12 4 A user with platadm or fieldeng authority must log in to the XSCF shell and enter the password command XSCF gt poweroff a The following activity is executed when the poweroff command is used m The Solaris OS shuts down completely m The server is powered off and the server enters standby mode The power to the XSCF unit remains on For details see the SPARC Enterprise M3000 M4000 M5000 M8000 M9000 Servers XSCF User s Guide 5 Verify that the POWER LED on the operator panel is off 6 Disconnect all the power cords from the AC power outlets Caution There is a risk of electrical failure if the power cords are not disconnected All the power cords must be disconnected to completely cut the power to the server Power off by Using the Operator Panel 1 Notify users that the server is being powered off 2 Back up the system files and data to tape if necessary 3 Turn the mode switch on the operator panel to the Service position 4 Press the power switch on the operator panel for 4 seconds or more 5 Verify that the POWER LED on the operator panel is off 6 Disconnect all the power cords from the AC power outlets Caution There is a risk of electrical failure if the power cords are not disconnected All the power cords must be disconnected to completely cut the power to the server FRU Removal and Repl
59. chitecture I O backplane Liquid crystal display Light emitting diode Logical system board Large scale integration Media access control address Maintenance bus controller Memory modules Motherboard unit Network time protocol Non volatile random access memory Operator panel Operating system PCI express Power on self test POR PSB PSU PROM RAM SAS SATA SRAM SC TC TOD UE UPC UPS XCP XIR XSB XSCF XSCFU Power on reset Physical system board Power supply unit Programmable read only memory Random access memory Serial attached SCSI Serial ATA Static RAM System controller Throughput computing Time of day Uncorrectable error UPS controller Uninterruptible Power Supply XCSF control package Externally initiated reset eXtended system board eXtended System Control Facility eXtended system control facility unit SPARC Enterprise M3000 Server Service Manual October 2008 Index A air duct 5 4 appearance of the server 2 1 approval standards 2 16 C CD RW DVD RW Drive Unit DVDU 11 1 component A 2 configuration 3 4 D diagnosis 3 2 3 4 3 7 diganosis flowchart 3 2 DIMM 7 1 E error message 3 7 eXtended system control facility XSCF shell 3 4 eXtended System Control Facility XSCF C 5 external interface port GbE interface port 2 15 LAN port 2 14 RCI port 2 14 UPC port 2 14 SAS port 2 15 USB port 2 14 F fan backplane
60. ciex10b5 8533 N A pci 0 600000 pci 0 00 PCIe 0 aa 8533 10b5 3 0 0 okay 4 8 pci pciex10b5 8533 N A pci 0 600000 pci 0 pci 0 00 PCIe 0 aa 8533 10b5 3 1 0 okay 4 4 pci pciex10b5 8533 N A pci 0 600000 pci 0 pci l 00 PCIe 0 aa 8533 10b5 3 2 0 okay 4 4 pci pciex10b5 8533 N A pci 0 600000 pci 0 pci 2 00 PCIe 0 aa 8533 10b5 3 8 0 okay 0 8 pci pciex10b5 8533 N A pci 0 600000 pci 0 pci s 00 PCIe 0 8 58 1000 4 0 0 okay 4 8 scsi pciex1000 58 LSI 1068E pci 0 600000 pci 0 pci 0 scsi 0 00 PCIx 0 b5 103 1166 5 0 0 okay 133 133 pci pciex1166 103 N A pci 0 600000 pci 0 pci 1 pci 0 00 PCI 0 a3 1678 14e4 6 4 0 okay 133 network pcil4e4 1678 N A pci 0 600000 pci 0 pci l pci 0 network 4 3 20 SPARC Enterprise M3000 Server Service Manual October 2008 The prtdiag output continued 00 PCI 0 a3 1678 14e4 6 4 1 okay 133 network pcil4e4 1678 N A pci 0 600000 pci 0 pci l1 pci 0 network 4 1 00 PCIx 0 b5 103 1166 7 0 0 okay 133 133 pci pciex1166 103 N A pci 0 600000 pci 0 pci 2 pci 0 00 PCI 0 a3 1678 14e4 8 4 0 okay 133 network pcil4e4 1678 N A pci 0 600000 pci 0 pci 2 pci 0 network 4 00 PCI 0 a3 1678 14e4 8 4 1 okay 133 network pcil4e4 1678 N A pci 0 600000 pci 0 pci 2 pci 0 network 4 1 00 PCIe 1 aa 8533 10b5 2 0 0 okay 8 8 pci pciex10b5 8533 N A pci 1 700000 pci 0 00 PCIe 1 aa 8533 10b5 3 0 0 okay 0 8 pci pciex10b5 8533 N A pci 1 700000 pci 0 pci 0 00
61. ction 2 4 External Interface Port on Rear Panel on page 2 12 2 1 FIGURE 2 1 FIGURE 2 2 and FIGURE 2 3 are the internal view front view and rear view of the server respectively and they indicate the names and abbreviated names of main components FIGURE 2 1 Server Internal View Fan backplane FANBP_B CPU Memory DIMM XSCF unit XSCFU Fan unit FAN_A DC DC converter DDC Hard disk drive backplane HDDBP Motherboard unit MBU_A Power supply unit PSU CD RW DVD RW drive unit DVDU PCle card PCle 2 2 SPARC Enterprise M3000 Server Service Manual October 2008 FIGURE 2 2 Server Front View CD RW DVD RW drive unit DVDU Operator panel OPNL QP RS 2000 te Fan unit FAN_A Hard disk drive HDD FIGURE 2 3 Server Rear View Power supply unit PSU RCI port Serial port PCle slot USB port LAN port UPS controller SAS port Gigabit Ethernet GbE port UPC port Chapter 2 Hardware Overview 2 3 2 2 2 4 Operator Panel The operator panel has the important function of controlling the power of the server The operator panel is usually locked with a key to prevent the server from being mistakenly powered off during system operation Before starting maintenance work ask the system administrator to unlock the operator panel SPARC Enterprise M3000 Server Service Manual October 2008 2 21 Operato
62. d tiennent et contrdlent toutes deux des droits de propri t intellectuelle relatifs aux produits et technologies d crits dans ce document De m me ces produits technologies et ce document sont prot g s par des lois sur le copyright des brevets d autres lois sur la propri t intellectuelle et des trait s internationaux Les droits de propri t intellectuelle de Sun Microsystems Inc et Fujitsu Limited concernant ces produits ces technologies et ce document comprennent sans que cette liste soit exhaustive un ou plusieurs des brevets d pos s aux Etats Unis et indiqu s l adresse http www sun com patents de m me qu un ou plusieurs brevets ou applications brevet es suppl mentaires aux Etats Unis et dans d autres pays Ce document le produit et les technologies aff rents sont exclusivement distribu s avec des licences qui en restreignent l utilisation la copie la distribution et la d compilation Aucune partie de ce produit de ces technologies ou de ce document ne peut tre reproduite sous quelque forme que ce soit par quelque moyen que ce soit sans l autorisation crite pr alable de Fujitsu Limited et de Sun Microsystems Inc et de leurs ventuels bailleurs de licence Ce document bien qu il vous ait t fourni ne vous conf re aucun droit et aucune licence expresses ou tacites concernant le produit ou la technologie auxquels il se rapporte Par ailleurs il ne contient ni ne repr sente aucun engagement
63. e DIMMs on a motherboard unit are classified into group A and group B see FIGURE 7 1 Caution To replace or install a DIMM do not fail to confirm the DIMM information and comply with the conditions to mount the memory gt B 1 Confirmation of DIMM Information Confirm the DIMM information size rank in the following way m Execute the showhardconf 8 command on XSCFU B 1 The Type field shows the DIMM size and the DIMM rank XSCF gt showhardconf MBU_A Status Normal Ver 0101h Serial PP0829045F FRU Part Number CA07082 D902 A1 541 3302 01 CPU Status Normal Freq 2 520 GHz Type 32 Core 4 Strand 2 Memory_Size 8 GB MEM 0A Status Normal Code ce0000000000000001M3 93T2950EZA CE6 4145 473b3c23 Type 1A Size 1 GB MEM 0B Status Normal Code 7 7 e00000000004aEBE10RD4AUFA 5C E 3020 223b2918 Type 1A Size 1 GB MEM 1A Status Normal Code 7 7 e00000000004aEBE10RD4AUFA 5C E 3020 223b28af Type 1A Size 1 GB MEM 1B Status Normal Code 7 7 e00000000004aEBE10RD4AUFA 5C E 3020 223b28ab Type 1A Size 1 GB FIGURE B 1 shows the explanation of DIMM information FIGURE B 1 Explanation of DIMM Information MEM 0A Status Normal Memory slot No Code ce0000000000000001M3 93T2950EZA CE64145 473b3c23 DIMM size 1 1 GB Type 1A Size 1 GB 2 2 GB DIMM rank
64. e Server 8 5 Replacement and Installation of a Hard Disk Drive HDD 9 1 9 1 Accessing a Hard Disk Drive 9 3 9 2 Removing a Hard Disk Drive 9 3 9 3 Installing a Hard Disk Drive 9 5 9 4 Reassembling the Server 9 5 Replacing the Hard Disk Drive Backplane 10 1 10 1 Accessing the Hard Disk Drive Backplane 10 2 10 2 Removing the Hard Disk Drive Backplane 10 3 10 3 Mounting the Hard Disk Drive Backplane 10 5 10 4 Reassembling the Server 10 6 CD RW DVD RW Drive Unit DVDU Replacement 11 1 11 1 Accessing the CD RW DVD RW Drive Unit 11 2 11 2 Removing the CD RW DVD RW Drive Unit 11 3 113 Mounting the CD RW DVD RW Drive Unit 11 4 11 4 Reassembling the Server 11 5 Power Supply Unit Replacement 12 1 12 1 Accessing a Power Supply Unit 12 3 12 2 Removing the Power Supply Unit 12 3 123 Mounting the Power Supply Unit 12 5 Contents ix 12 4 Reassembling the Server 12 5 13 Fan Unit Replacement 13 1 13 1 Accessing a Fan Unit 13 3 13 2 Removing a Fan Unit 13 3 13 3 Mounting a Fan Unit 13 5 13 4 Reassembling the Server 13 5 14 Fan Backplane Replacement 14 1 141 Accessing the Fan Backplane 14 2 14 2 Removing the Fan Backplane 14 5 143 Mounting the Fan Backplane 14 6 144 Reassembling the Server 14 6 15 Operator Panel Replacement 15 1 15 1 Accessing the Operator Panel 15 3 15 2 Removing the Operator Panel 15 4 15 3 Mounting the Operator Panel 15 5 15 4 Reassembling the Server 15 5 A Components List A 1 B Memory Mounting Condition
65. e Slot on page C 3 m CPU on page C 4 m XSCF Unit on page C 4 Power is supplied from the power supply units to the motherboard unit To remove and replace the motherboard unit power off the server The replacement work is performed from the top of the server Because the CPU and the XSCF unit are connected directly to the motherboard unit neither of them can be replaced individually For the replacement procedure see Chapter 6 Memory DIMM The motherboard unit which includes an ASIC called a memory access controller JSC MAC and provides memory access control has eight memory slots To remove or mount DIMMs power off the server TABLE C 2 shows DIMM specifications C 2 SPARC Enterprise M3000 Server Service Manual October 2008 22 TABLE C 2 DIMM Specifications Item Description Maximum number of DIMMs 8 Location On the motherboard unit Active replacement No Hot replacement No Cold replacement Yes The memory consists of eight DIMMs in units of four The server uses DDR2 SDRAM memory that has the following functions m ECC error protection m Recovery from a memory chip error For the replacement procedure see Chapter 7 PCle Slot The M3000 server provides four PCle slots Up to four low profile type PCle slots can be mounted in these PCle slots A PCle slot has the following functions Supports the eight lane PCI Express bus High speed serial point to point interconnect
66. e redundant configuration the rated current supplied for each power cord is half of the value listed in TABLE C 7 For the replacement procedure see Chapter 12 C 5 Fan Unit The fan units generate an airflow between the inside and outside of the server If an error occurs in one fan unit the XSCF detects the error and the normally operating fan unit starts rotating at a higher speed to cool the server The M3000 server uses the fan units FAN A as its main cooling system TABLE C 8 shows the fan unit specifications TABLE C 8 Fan Unit Specifications Item Description Number of fan units 2 Redundancy 2 units 1 1 redundant configuration Location Front of the server Appendix C FRU List C 7 TABLE C 8 Fan Unit Specifications Continued Item Description Active replacement Yes Hot replacement Yes Cold replacement Yes For the replacement procedure see Chapter 13 C 8 SPARC Enterprise M3000 Server Service Manual October 2008 APPENDIX D External Interface Specifications This appendix explains the specifications of the external interface connectors provided on the M3000 server m Section D 1 Serial Port on page D 2 m Section D 2 UPC Port on page D 2 m Section D 3 USB Port on page D 3 m Section D 4 SAS port on page D 3 m Section D 5 Connection Diagram for Serial Cable on page D 4 D 1 D 1 Serial Port TABLE D 1 Serial Port Pin assignmen
67. ected by the system software m Diagnoses the errors m Initiates predictive self healing activities For example Solaris Fault Manager can disable faulty components m When possible causes the faulty FRU to provide an LED indication of the error in addition to populating system console messages with more details TABLE 3 3 shows typical messages generated when an error occurs Messages are displayed on your console and are recorded in the var adm messages file A message in TABLE 3 3 indicates that the fault has already been diagnosed If there was any corrective action that the system could take the system has already taken it If your server is still running the corrective action continues to be taken TABLE 3 3 Predictive Self Healing Messages Output displayed Description Nov 1 16 30 20 dt88 292 EVENT TIME Tue Nov 1 EVENT TIME The time stamp of the diagnosis 16 30 20 PST 2005 Nov 1 16 30 20 dt88 292 PLATFORM SUNW A70 PLATFORM A description of the server encountering CSN HOSTNAME dt88 292 the error Nov 1 16 30 20 dt88 292 SOURCE eft REV 1 13 SOURCE Information on the Diagnosis Engine used to determine the error Nov 1 16 30 20 dt88 292 EVENT ID afc7e660 d609 EVENT ID The Universally Unique event ID for this 4b2f 86b8 ae7c6b8d50c4 error Nov 1 16 30 20 dt88 292 DESC DESC A basic description of the error Nov 1 16 30 20 dt88 292 A problem was detected in the PCI Express subsystem 3 8 SPARC Enterpri
68. ection 4 5 3 1 Power on by Using the XSCF Command on page 4 13 Note If automatic startup of the Solaris OS is specified use the sendbreak d domain_id command of the XSCF shell to display the ok prompt after the display console banner is displayed but before the system starts booting the Solaris OS 4 Check the hardware This procedure includes the steps for running a program to verify that all the components have been remounted and to start the Solaris OS For details see Section 4 5 4 Verifying the Hardware Operation on page 4 15 Chapter 11 CD RW DVD RW Drive Unit DVDU Replacement 11 5 11 6 SPARC Enterprise M3000 Server Service Manual October 2008 CHAPTER 1 2 Power Supply Unit Replacement This chapter explains how to replace a power supply unit m Section 12 1 Accessing a Power Supply Unit on page 12 3 m Section 12 2 Removing the Power Supply Unit on page 12 3 m Section 12 3 Mounting the Power Supply Unit on page 12 5 m Section 12 4 Reassembling the Server on page 12 5 The power supply unit is an active hot replacement component To guarantee the redundant configuration only one power supply unit must be replaced at a time 12 1 FIGURE 12 1 shows the locations of power supply units FIGURE 12 1 Locations of Power Supply Units Location number Component 1 Power supply unit PSU 0 2 Power supply unit PSU 1 12 2 SPARC Enterprise M3000 Server Serv
69. ed or disabled for each domain using setdomainmode Only Power On is enabled Enabled auto boot Powered on or off Not powered on or off Chapter 2 Hardware Overview 2 7 220 LEDs on the Operator Panel TABLE 2 3 lists the server states displayed with the LEDs on the operator panel The three LED indicators on the operator panel indicate the following m General system status m System error warning m System error location Besides the states listed in TABLE 2 3 the operator panel also displays various states of the server using combinations of the three LEDs TABLE 2 4 indicates the states that are displayed in the course of operation from power on to power off of the server The blinking interval is 1 second 1 Hz TABLE 2 3 LEDs on the Operator Panel Icon Name Color Description POWER LED Green Indicates the server power status D On The power to the server is on e Off The power to the server is off Blinking The server is powered off e XSCF Green Indicates the XSCF unit status O STANDBY e On XSCF unit is functioning normally XSCF LED e Off XSCF unit is stopped e Blinking System initialization is in progress after AC power was turned on CHECK LED Amber Indicates that the server has detected an error This is AN sometimes called a locator e On An error that hinders startup was detected e Off Normal or AC power is not being supplied e Blinking Indicates that the unit is a maintena
70. er supply unit out several centimeters to the rear side Slide the server out from the equipment rack For details see Section 5 1 1 Sliding the Server Out from the Equipment Rack on page 5 1 Loosen the screws securing the cable management arm to the left rear of the server Pull the cable management arm out from the equipment rack Remove the top cover For details see Section 5 2 1 Removing the Top Cover on page 5 3 Remove the PCIe cards For details see Section 8 2 Removing a PCle Card on page 8 4 Remove the air duct For details see Section 5 3 2 Removing the Air Duct on page 5 5 Disconnect all the cables from the motherboard unit Loosen the two screws securing the shutter unit and slide the securing bracket on the power supply unit Chapter 6 Motherboard Unit Replacement 6 5 12 Remove the shutter unit FIGURE 6 3 Removing the Shutter Unit October 2008 SPARC Enterprise M3000 Server Service Manual 6 6 6 2 Removing the Motherboard Unit 1 Remove the four screws securing the motherboard unit FIGURE 6 4 Securing Screw Locations on Motherboard Unit L Screws HH QU
71. erpret error information of the Solaris OS and obtain information on fault management architecture FMA messages access the following web site http www sun com msg In addition see the SPARC Enterprise M3000 M4000 M5000 M8000 M9000 Servers XSCF User s Guide to obtain more information on faults This web site can be used in the event of a Solaris OS error or domain error or to look up specific FMA error messages that do not provide details on XSCF errors This web site provides the message ID displayed by software technical information on the error and corrective action for the error Information on errors and documentation are updated regularly Chapter 3 Troubleshooting 3 7 3 4 1 Predictive self healing is an architecture and methodology for automatically diagnosing reporting and handling software and hardware error conditions This new technology reduces the time required to debug a hardware or software problem and provides the administrator and service engineer with detailed data about each error Predictive Self Healing Tools In the Solaris OS Solaris Fault Manager runs in the background When an error occurs the system software recognizes the error and attempts to determine the faulty hardware component The system software also takes steps to prevent the faulty component from being used until it has been replaced The system software performs the following activities m Receives telemetry information about errors det
72. eruser commands of this type are useful to determine whether there is a problem with the server network or another server connected via the network This section explains the following commands m Using the iostat Command on page 3 18 m Using the prtdiag Command on page 3 19 m Using the prtconf Command on page 3 21 m Using the netstat Command on page 3 24 m Using the ping Command on page 3 25 m Using the ps Command on page 3 26 m Using the prstat Command on page 3 27 Chapter 3 Troubleshooting 3 17 3 6 1 Most of these commands are located in the usr bin directory or usr sbin directory Using the iostat Command The iostat command repeatedly reports terminal drive and I O activity as well as CPU utilization 3 6 1 1 Options TABLE 3 6 lists the options of the iostat command and how those options can help troubleshoot the server TABLE 3 6 Options for iostat Option Description How it can help No option Reports status of local I O devices A quick three line output of device status information c Reports the percentages of time the system has Quick report of CPU status spent in user mode in system mode waiting for I O and idling e Displays device error summary statistics Provides a short table with accumulated Displays the total number of errors hardware errors Identifies suspect I O devices errors software errors and transfer errors E Displays all device
73. f any The furnishing of this document to you does not give you any rights or licenses express or implied with respect to the product or technology to which it pertains and this document does not contain or represent any commitment of any kind on the part of Fujitsu Limited or Sun Microsystems Inc or any affiliate of either of them This document and the product and technology described in this document may incorporate third party intellectual property copyrighted by and or licensed from suppliers to Fujitsu Limited and or Sun Microsystems Inc including software and font technology Per the terms of the GPL or LGPL a copy of the source code governed by the GPL or LGPL as applicable is available upon request by the End User Please contact Fujitsu Limited or Sun Microsystems Inc This distribution may include materials developed by third parties Parts of the product may be derived from Berkeley BSD systems licensed from the University of California UNIX is a registered trademark in the U S and in other countries exclusively licensed through X Open Company Ltd Sun Sun Microsystems the Sun logo Java Netra Solaris Sun Ray Answerbook2 docs sun com OpenBoot and Sun Fire are trademarks or registered trademarks of Sun Microsystems Inc or its subsidiaries in the U S and other countries Fujitsu and the Fujitsu logo are registered trademarks of Fujitsu Limited All SPARC trademarks are used under license and are registered t
74. f time stamps The information with the oldest time stamp is displayed first The showlogs command displays the following logs m Error log m Power log n Event log m Temperature and humidity record m Monitoring message log m Console message log m Panic message log m IPL message log 3 12 SPARC Enterprise M3000 Server Service Manual October 2008 XSCF gt showlogs error Date Jun 17 Status FRU PS 11 05 32 JST 2008 Code 80000000 c3 0000 0173000600000000 Alarm Occurred Jun 17 11 05 32 522 JST 2008 U 1 Msg PSU shortage Date Jun 17 Status 13 41 46 JST 2008 Code 80002080 7801c201 0130000000000000 Alarm Occurred Jun 17 13 41 44 861 JST 2008 FRU MBU_A Msg Board control error MBC link error Date Jun 17 13 46 31 JST 2008 Code 60000000 cd01c701 0164010100000000 Warning Occurred Jun 17 13 46 31 158 JST 2008 FRU OPNL FANBP_B Status Msg TWI XSCF gt access error 3 5 3 Using the showstatus Command The showstatus command displays information about faulty or degraded units that are among the FRUs composing the server and information on the units on the layers immediately above the layers of the faulty or degraded units For each of the displayed units an asterisk indicating that the unit is faulty is displayed with any of the following status indicators which is displayed after Status m Normal Normal state m Faulted The unit is faulty and is not operati
75. g maintenance can be prevented Power button Used to control the server power Power on and power off are controlled by pressing this button in different patterns as described below Holding down the button Regardless of the mode switch setting the server is powered for a short time on less than 4 seconds At this time processing for waiting for facility air conditioners power on and warm up completion if set in the XSCE is skipped Holding down the button If power to the server is on OS shutdown processing is for a long time in Service executed for all domains before the system is powered off mode If the server is being powered on the power on processing 4 seconds or longer is cancelled and the server is powered off If the server is being powered off the operation of the power button is ignored and the power off processing is continued In normal operation the server is powered on only when the computer room environmental conditions satisfy the specified values Then the server remains in the reset state until the operating system is booted 2 6 SPARC Enterprise M3000 Server Service Manual October 2008 TABLE 2 2 shows the function of the mode switch TABLE 2 22 Mode Switch Function Function Mode switch Locked Service Break signal reception Power On Off by power button Power On Off by automatic power control Enabled Reception of the Disabled Break signal can be enabl
76. guration and component status information Confirm that the recovered state after maintenance is the same as that before maintenance If an error occurs in the server one of the following messages is displayed m Solaris Operating System message file m XSCF shell showhardconf 8 command and showstatus 8 command m Management console m Service processor log 3 9 1 Checking the Hardware Configuration and FRU Status To replace a faulty FRU and perform the maintenance on the server it is important to check and understand the hardware configuration of the server and the state of each hardware component The hardware configuration refers to information that indicates to which layer a hardware component belongs The status of each hardware component refers to information on the conditions of a standard or optional component in the server temperature power supply voltage CPU operating conditions and other status information To check the hardware configuration and the status of each hardware component use XSCF shell commands from the maintenance terminal See TABLE 3 1 for the commands used TABLE 3 1 Commands for Checking Hardware Configuration Command Description showhardconf Displays hardware configuration showstatus Displays the status of a component This command is used only when a faulty component is checked 3 4 SPARC Enterprise M3000 Server Service Manual October 2008 3 3 1 1 Ahd TABLE 3 1 Comm
77. hat shown elsewhere in this document This is normal 3 6 2 1 Options TABLE 3 7 lists the options of the prtdiag command and how those options can help troubleshooting TABLE 3 7 Options for prtdiag Option Description How it can help No option Lists components Shows CPU timing and PCIe cards installed v Verbose mode Displays the time of the most Provides the same information as no option recent AC power failure the most recent Additionally displays fan unit status hardware fatal error information and if temperatures and ASIC and PROM revisions necessary the environmental status Chapter 3 Troubleshooting 3 19 The following example shows output for the prtdiag command in verbose mode prtdiag v System Configuration Sun Microsystems sun4u SPARC Enterprise M3000 Server System clock frequency 1064 MHz Memory size 7808 Megabytes SSS gt gt S gt gt S gt gt gt CPUJS EE gt e CPU CPU Run L2 CPU CPU LSB Chip ID MHz MB Impl Mask 00 0 0 17 2 8 4 Deby 7 2520 5 0 7 145 Memory Configuration Memory Available Memory DIMM of Mirror Interleave LSB Group Size Status Size DIMMs Mode Factor 00 A 4096MB okay 1024MB 4 no 2 way 00 B 3712MB okay 1024MB 4 no 2 way IO Lane Frq LSB Type LPID RvID DvID VnID BDF State Act Max Name Model Logical Path 00 PCIe 0 aa 8533 10b5 2 0 0 okay 8 8 pci p
78. he HDD goes off Note If a hard disk drive is an unmirrored boot device it must be replaced by using the cold replacement procedure However if a boot device can be disconnected by means of a Solaris OS function or disk mirroring software function active replacement can also be performed 3 Type the cfgadm c command to disconnect the component from the Solaris OS cfgadm c unconfigure Ap Id Chapter 4 FRU Replacement Preparation 4 5 4 Type the cfgadm x command to confirm that the CHECK LED blinks cfgadm x led fault mode blink Ap i The Ap Id is shown in the output of cfgadm for example disk 0 The CHECK LED amber of the HDD blinks 5 Type the cfgadm command to verify that the component has been disconnected cfgadm a The disconnected component is displayed as being unconfigured 4 3 2 FRU Removal and Replacement After the disconnection of a FRU from a domain the same procedure as that for Hot Replacement Hot Addition applies See Section 4 4 Hot Replacement Hot Addition on page 4 7 4 3 3 Configuring a FRU in a Domain This section explains the procedure for active replacement installation by using Solaris OS commands For information on using the XSCF command see Section 4 4 Hot Replacement Hot Addition on page 4 7 1 Type the cfgadm c command from the Solaris OS to integrate the component into the Solaris OS cfgadm c configure Ap Id 2
79. he operation of the server To change a configuration or investigate a problem check the latest information and check for any problems in the software Chapter 3 Troubleshooting 3 5 Because the types and versions of installed software and firmware vary depending on the system they need to be checked m The software configuration and the version can be checked in the Solaris OS For details see documents of the Solaris OS To check software and firmware configuration information you can use the commands listed in TABLE 3 2 from the maintenance terminal if you are granted user authority by the system administrator To check the firmware configuration and version use XSCF shell commands from the maintenance terminal TABLE 3 2 Commands for Checking the Software and Firmware Configurations Command Description showrev Displays system configuration information and Solaris OS patch information uname Outputs current system information version Outputs current firmware version information showhardconf Outputs information on the components mounted on the server showstatus Displays the status of a component This command is used only when a faulty component is checked showboards Displays XSB information It can display information on an XSB that belongs to the specified domain and information on all XSBs mounted An XSB combines hardware resources on physical system boards The M3000 server consists of a single physical system board
80. hich are required for maintenance and management are shown on the system faceplate label m The standards label is affixed close to the system faceplate label and shows the approval standards a Safety NRTL C a Radio wave VCCI A FCC A DOC A MIC a Safety and radio wave CE The card should be instered in such a way that the standards label faces the outside of the server and the system faceplate label faces the inside of the server FIGURE 2 6 Label Locations 2 16 SPARC Enterprise M3000 Server Service Manual October 2008 CHAPTER 3 Troubleshooting This chapter provides fault diagnosis information Section 3 1 Emergency Power Off on page 3 1 Section 3 2 Failure Diagnostic Method on page 3 2 Section 3 3 Checking the Server and System Configuration on page 3 4 Section 3 4 Error Conditions on page 3 7 Section 3 5 Using Troubleshooting Commands on page 3 10 Section 3 6 General Solaris Troubleshooting Commands on page 3 17 9 1 Emergency Power Off This section explains how to power off in an emergency Caution In an emergency such as smoke or flames coming from the server immediately stop using the server and turn off the power supply Regardless of the type of business give top priority to fire prevention measures 1 Press the power switch for more than 4 seconds to power off the server 3 1 FIGURE 3 1 2 Remove the AC power cord clamp a
81. ialization starts A immediately after power on This state does not indicate an error Blinking Indicates that an error requiring the replacement of the HDD amber occurred Off Indicates that the HDD is in the normal state READY On green Indicates that the HDD is operating The HDD cannot be removed cannot be replaced Blinking Indicates that the HDD is performing communication green The HDD cannot be removed cannot be replaced Off The HDD can be replaced Power supply unit DC On green Indicates that power is turned on and being supplied PSU AC On green Indicates that power is being supplied to the power supply unit but the power to the server has not been turned on Off Indicates that power is not being supplied to the PSU CHECK On amber Indicates that an error occurred in the PSU Blinking Indicates that an error requiring the replacement of the power AN amber supply unit occurred Off Indicates that the PSU is in the normal state Fan unit FAN_A CHECK On amber Indicates that an error occurred in the fan unit Blinking Indicates that an error requiring the replacement of the fan AN amber unit occurred Off Indicates that the fan unit is in the normal state Chapter 2 Hardware Overview 2 11 TABLE 2 5 Component LEDs and Their Functions Continued Component Name Color Description LAN port display ACTIVE On green Indicates that communica
82. ice Manual October 2008 12 1 Accessing a Power Supply Unit Caution To ensure that you and bystanders are not exposed to harm and to prevent damage to the system observe the ESD safety precautions See Section 1 1 ESD Precautions on page 1 1 1 From the XSCF shell prompt use the replacefru command to disable the power supply unit to be removed XSCF gt replacefru The replacefru command is a menu driven interactive command The replacefru continues operating while the power supply unit is removed Afterwards it performs the power supply unit test For details see Section 4 4 1 FRU Removal and Replacement on page 4 8 2 Loosen the thumbscrew of the cable management arm that is located on the right side when you are facing the rear of the server 3 Pull out the cable management arm while keeping it from coming into contact with the power supply unit 12 2 Removing the Power Supply Unit 1 Verify that the CHECK LED of the power supply unit is blinking and the DC LED is off 2 Release the lock of the AC cable clamp 1 in FIGURE 12 2 3 Remove the power cord from the power supply unit 2 in FIGURE 12 2 Chapter 12 Power Supply Unit Replacement 12 3 4 While pressing the lever 3 in FIGURE 12 2 pull the handle outward 4 in FIGURE 12 2 FIGURE 12 2 Removing the Power Supply Unit A S amp Geesssses OS Handle Lever in 5 With o
83. ign and a sequential number are added to each of their names to differentiate between them FIGURE A 1 shows the locations of the M3000 server components FIGURE A 1 Location of Components A 2 SPARC Enterprise M3000 Server Service Manual October 2008 Location number Component Location number Component 1 Top cover 2 Fan cover 3 Air duct 4 Memory MEM 00A 5 Memory MEM 00B 6 Memory MEM 01A 7 Memory MEM 01B 8 CPU CPU 0 9 Memory MEM 03B 10 Memory MEM 03A 11 Memory MEM 02B 12 Memory MEM 02A 13 Fan backplane FANBP_B 14 Fan unit FAN_A 1 15 Fan unit FAN A 0 16 Fan cage 17 Operator panel OPNL 18 Front panel 19 CD RW DVD RW drive unit DVDU 20 Hard disk drive HDD 0 21 Hard disk drive HDD 1 22 Hard disk drive HDD 2 23 Hard disk drive HDD 3 24 Hard disk drive backplane HDDBP 0 25 Chassis 26 Shutter unit 27 Power supply unit PSU 0 28 Power supply unit PSU 1 29 Rear panel 30 Motherboard unit MBU_A 31 PCle card PCIe 0 32 PCIe card PCIe 1 33 PCIe card PCIe 2 34 PCle card PCIe 3 35 XSCF unit Appendix A Components List A 3 A 4 SPARC Enterprise M3000 Server Service Manual October 2008 APPENDIX B Memory Mounting Conditions This appendix explains the memory DIMM mounting conditions m Section B 1 Confirmation of DIMM Information on page B 1 m Section B 2 Memory Mounting Conditions on page B 3 Th
84. ing a time stamp UUID and message ID m The second line is a declaration of the certainty of diagnosis In this case we are 100 percent sure the failure is in the ASIC described If the diagnosis may involve multiple components you may see two lines here with 50 in each of the two lines m The FRU line indicates what component must be replaced to return the server to a fully operational state m The rsrc line indicates the component that has become unusable because of this error SPARC Enterprise M3000 Server Service Manual October 2008 3 5 4 2 3 5 5 3 5 5 1 fmdump e Command To get information of the errors that caused this failure you can use the e option as shown in the following example fmdump e TIME CLASS Nov 02 10 04 14 3008 ereport io fire jbc mb_per Using the fmadm Command Using the fmadm faulty Command The fmadm faulty command can be used by administrators and service personnel to view and modify system configuration parameters that are maintained by the Solaris fault manager The command is primarily used to determine the status of a component involved in a fault as shown in the following example fmadm faulty STATERESOURCE UUID degraded dev pci le 600000 0ee65618 2218 4997 c0dc b5c410ed8ec2 fmadm repair 0ee65618 2218 4997 c0dc b5c410ed8ec2 The PCle slot has been degraded and it is associated with the same UUID as above Also the faulted status ma
85. ing the power cord For details see Section 4 5 1 Powering off the Server on page 4 11 2 Slide the server out from the equipment rack For details see Section 5 1 1 Sliding the Server Out from the Equipment Rack on page 5 1 Note To prevent the equipment rack from tipping over you must deploy the antitilt feature if applicable before you slide the server out of the equipment rack Note When the cable management arm is not mounted remove the cable ties that fix the PCI cable to the rear of the server and slide the server out 3 Remove the top cover For details see Section 5 2 1 Removing the Top Cover on page 5 3 4 Remove the air duct from the server For details see Section 5 3 2 Removing the Air Duct on page 5 5 10 2 Removing the Hard Disk Drive Backplane 1 Pull out all the hard disk drives including HDD filler panels from the server by several centimeters 1 in FIGURE 10 2 Chapter 10 Replacing the Hard Disk Drive Backplane 10 3 2 Remove the two cables from the rear of the hard disk drive backplane 2 in FIGURE 10 2 FIGURE 10 2 Removing the Cables of the Hard Disk Drive Backplane RW drive unit 3 Remove the cables connected to the CD RW DVD 4 Pull the CD RW DVD RW drive unit out by several centimeters on RW DVD RW Drive Unit Removing the CD For details see Section 11 2 page 11 3 SPAR
86. interface BUI Through CLI or BUI the user or system administrator monitors the server displays status operates domains and displays information on the console By connecting an uninterruptible power supply UPS unit that has the UPS controller UPC interface stable power supply is provided in the event of a failure in the power supply or even a large scale power failure If a single power feed is used connect a UPS cable to UPC port 0 In a dual power feed connect UPS cables to UPC ports 0 and 1 2 14 SPARC Enterprise M3000 Server Service Manual October 2008 TABLE 2 6 External Interface Port Functions Continued Location number Component Description 8 GbE port 0 for OS I Up to 4 100Base TX 1000Base T cables can be connected to GbE ports High capacity data can be transferred at a high GLAN 0 speed 9 GbE port 1 for OS is _ 10 GbE port 2 for OS E Eia 11 GbE port 3 for OS is _ 12 SAS port Accommodates external Serial Attached SCSI SAS devices such as a tape drive Chapter 2 Hardware Overview 2 15 25 Labels This section explains the labels affixed to the server A label affixed card that can be inserted or extracted is provided near the power supply unit at the right side at the rear of the server see TABLE 2 6 The information on the label might differ from that shown on the affixed labels m The model number serial number and hardware version all of w
87. ion 13 3 Mounting a Fan Unit on page 13 5 m Section 13 4 Reassembling the Server on page 13 5 A fan unit is an active hot replacement component To guarantee the redundant configuration only one fan unit must be replaced at a time 13 1 FIGURE 13 1 indicates the location of the fan unit FIGURE 13 1 Fan Unit Locations Location number Component 1 Fan unit FAN_A 0 2 Fan unit FAN_A 1 13 2 SPARC Enterprise M3000 Server Service Manual October 2008 13 1 Accessing a Fan Unit Caution To ensure that you and bystanders are not exposed to harm and to prevent damage to the system observe the ESD safety precautions See Section 1 1 ESD Precautions on page 1 1 1 From the XSCF shell prompt use the replacefru command to disable the fan unit to be removed XSCF gt replacefru The replacefru command is a menu driven interactive command The replacefru continues operating while the fan unit is removed Then later it performs the fan unit test For details see Section 4 4 1 FRU Removal and Replacement on page 4 8 Caution To prevent the equipment rack from tipping over you must deploy the antitilt feature if applicable before you slide the server out of the equipment rack Note When the cable management arm is not mounted remove the cable ties that fix the PCI cable to the rear of the server and slide the server out 2 Slide the server out from
88. laced in cold replacement mode The 5 4 SPARC Enterprise M3000 Server Service Manual October 2008 5 3 1 Before the air duct is removed the top cover must be removed For details see Section 5 2 Removing and Attaching the Top Cover on page 5 3 Accessing the Air Duct 5 0 2 Removing the Air Duct m Hold the air duct and lift it upwards FIGURE 5 3 Removing the Air Duct FI 5 5 Chapter 5 Internal Components Access 5 3 3 Attaching the Air Duct m Set the tab at the front of the air duct in place and then lower the air duct FIGURE 5 4 Prevent cables from interfering each other FIGURE 5 4 Attaching the Air Duct 5 6 SPARC Enterprise M3000 Server Service Manual October 2008 Removing and Attaching the Fan Cover 5 4 Removing the Fan Cover 5 4 1 the server must be pulled out from the 7 Caution Before the fan cover is removed equipment rack For the procedure for pulling the server out from the equipment rack see Section 5 1 Sliding the Server Into and Out of the Equipment Rack on page 5 1 ight of the fan cover Loosen one screw at the r 1 2 Raise the right end of the fan cover and remove it FIGURE 5 5 22222222222220 22222222222222 22222220222220 DODDODDDDODDDDE 22222222222229 22222222222220 Y DODDDDDDDDDDDDE 222222222220229 2222202222202229 2D0000900000000 2220202022222200 22222222292209 2222222022222200 22222222222220 222222
89. lacement 13 5 13 6 SPARC Enterprise M3000 Server Service Manual October 2008 CHAPTER 1 4 Fan Backplane Replacement This chapter explains how to replace the fan backplane m Section 14 1 Accessing the Fan Backplane on page 14 2 m Section 14 2 Removing the Fan Backplane on page 14 5 m Section 14 3 Mounting the Fan Backplane on page 14 6 m Section 14 4 Reassembling the Server on page 14 6 The fan backplane is a cold replacement component The entire server must be powered off and the power cords must be disconnected to replace the fan backplane 14 1 FIGURE 14 1 shows the location of the fan backplane FIGURE 14 1 Fan Backplane Location Location number Component 1 Fan backplane FANBP_B 14 1 Accessing the Fan Backplane disconnected All the power cords must be disconnected to completely cut the power Caution There is a risk of electrical failure if the power cords are not to the server 14 2 SPARC Enterprise M3000 Server Service Manual October 2008 Caution To ensure that you and bystanders are not exposed to harm and to prevent damage to the system observe the ESD safety precautions See Section 1 1 ESD Precautions on page 1 1 1 Power off the server This procedure includes the steps of turning the mode switch on the operator panel to the Service position verifying that the POWER LED is off and disconnecting the power cord For details see
90. llowing commands U U U Y Y TJ sing the showhardconf Command on page 3 11 sing the showlogs Command on page 3 12 sing the showstatus Command on page 3 13 sing the fmdump Command on page 3 14 sing the fmdump Command on page 3 14 sing the fmstat Command on page 3 17 SPARC Enterprise M3000 Server Service Manual October 2008 3 5 1 Using the showhardconf Command The showhardconf command displays information on each FRU The following information is displayed m Current configuration and status m Number of mounted units m Domain information m Name properties of the PCle card XSCF gt showhardconf SPARC Enterprise M3000 Serial IKK0813023 Operator_Panel_Switch Locked Power Supply System Single SCF ID XSCF 0 System Power On System Phase Cabinet Power On Domain 0 Domain Status OpenBoot Execution Completed MBU_A Status Normal Ver 0101h Serial PP0829045F_ FRU Part Number CA07082 D902 A1 541 3302 01 CPU Status Normal Freq 2 520 GHz Type 32 Core 4 Strand 2 Memory_Size 8 GB MEM 0A Status Normal Code ce0000000000000001M3 93T2950EZA CE6 4145 473b3c23 Type 1A Size 1 GB MEM 0B Status Normal Code 7 7 fe00000000004aEBE1ORD4ATFA 5C E 3020 223b2918 Type 1A Size 1 GB MEM 1A Status Normal Code 7 7 fe00000000004aEBE1ORD4ATFA 5C E 3020 223b28af Type 1A Size 1 GB MEM 1B Status Normal Code 7 7 fe00000000004aEBE1ORD
91. m the socket 3 Place the DIMM on a conductive mat FIGURE 7 2 Removing the DIMMs 7 4 SPARC Enterprise M3000 Server Service Manual October 2008 7 3 Installing the DIMMs 1 Push the DIMM evenly into the DIMM socket 2 Push the DIMM eject levers inward to secure the DIMM in position When mounting DIMMs align the indentation with the corresponding connector part 7 4 Reassembling the Server 1 Attach the air duct For details see Section 5 3 3 Attaching the Air Duct on page 5 6 2 Attach the top cover For details see Section 5 2 2 Attaching the Top Cover on page 5 4 3 Slide the server into the equipment rack and secure the stabilizer bar For details see Section 5 1 2 Sliding the Server into the Equipment Rack on page 5 3 4 Power on the server This procedure includes the steps of reconnecting the power cord checking the LED status and turning the mode switch on the operator panel to the Locked position For details see Section 4 5 3 1 Power on by Using the XSCF Command on page 4 13 Note If automatic startup of the Solaris OS is specified use the sendbreak d domain_id command of the XSCF shell to display the ok prompt after the display console banner is displayed but before the system starts booting the Solaris OS 5 Check the hardware This procedure includes the steps for running a program to verify that all the components have been remounted and
92. mand 4 11 45 12 Power off by Using the Operator Panel 4 12 Contents vii 4 5 2 FRU Removal and Replacement 4 12 4 5 3 Powering on the Server 4 13 4 5 3 1 Power on by Using the XSCF Command 4 13 4 5 3 2 Power on by Using the Operator Panel 4 13 4 5 4 Verifying the Hardware Operation 4 15 5 Internal Components Access 5 1 5 1 Sliding the Server Into and Out of the Equipment Rack 5 1 5 1 1 Sliding the Server Out from the Equipment Rack 5 1 5 1 2 Sliding the Server into the Equipment Rack 5 3 5 2 Removing and Attaching the Top Cover 5 3 5 2 1 Removing the Top Cover 5 3 5 22 Attaching the Top Cover 5 4 5 3 Removing and Attaching the Air Duct 5 4 5 3 1 Accessing the Air Duct 5 5 5 3 2 Removing the Air Duct 5 5 5 3 3 Attaching the Air Duct 5 6 5 4 Removing and Attaching the Fan Cover 5 7 5 41 Removing the Fan Cover 5 7 5 42 Attaching the Fan Cover 5 8 6 Motherboard Unit Replacement 6 1 6 1 Accessing the Motherboard Unit 6 4 6 2 Removing the Motherboard Unit 6 7 6 3 Mounting the Motherboard Unit 6 8 6 4 Reassembling the Server 6 9 7 Replacement and Installation of Memory 7 1 7 1 Accessing the DIMMs 7 3 7 2 Removing the DIMMs 7 4 viii SPARC Enterprise M3000 Server Service Manual October 2008 10 11 12 7 3 Installing the DIMMs 7 5 74 Reassembling the Server 7 5 Replacement and Installation of PCIe Cards 8 1 8 1 Accessing a PCIe Card 8 3 82 Removing a PCIe Card 8 4 8 3 Mounting a PCIe Card 8 4 8 4 Reassembling th
93. mber CA07082 D901 A1 CPU Status Normal Freq 2 520 GHz Type 32 Core 4 Strand 2 Memory_Size 8 GB PCI PCI PCI PCI WN H MEM 0A Sta Code Type 1 MEM 0B Sta Code Type 1 MEM 1A Sta Code Type 1 MEM 1B Sta Code Type 1 MEM 2A Sta Code Type 1 MEM 2B Sta Code Type 1 MEM 3A Sta Code Type 1 MEM 3B Sta Code Type Name Name Name Name tus Normal A Size 1 GB tus Normal 7 7 e00000000004aEBE10RD4AUFA 5C A Size 1 GB tus Normal A Size 1 GB tus Normal A Size 1 GB tus Normal A Size 1 GB tus Normal A Size 1 GB tus Normal 7 7 e00000000004aEBE10RD4AUFA 5C A Size 1 GB tus Normal 7 7 e00000000004aEBE10RD4AUFA 5C 1A Size 1 GB _ Property fibre channel Card Type _ Property fibre channel Card Type _Property pci Card_Type Other _Property pci Card_Type Other ti 7 7 e00000000004aEBE1 0RD4AJFA 5C 1 T 7 e00000000004aEBE10RD4AJFA 5C 7T 7 e00000000004aEBE1 0RD4AJFA 5C 7T 7 e00000000004aEBE1 0RD4AJFA 5C t Operator_Panel_Switch Locked ce0000000000000001M3 93T2950EZA CE6 Gl Gl Gl Gl Gp E Domain 0 Domain_Status OpenBoot Execution Completed 1 541 3302 01 4145 473b3c23 3020 223b2918 3020 223b28af
94. n number Limits the number of output lines Limits the amount of data displayed and displays processes consuming many resources s key Enables the sorting of list contents by key Useful keys are cpu default time and size parameter v Verbose mode Displays additional parameters The following example shows output for the prstat command prstat n 5 s size PID USERNAME SIZE RSS STATE PRI NICE TIME CPU PROCESS NLWP 100463 root 66M 61M sleep 59 O 0 01 03 0 0 fmd 19 100006 root 11M 9392K sleep 59 0 0 00 09 0 0 svc configd 16 100004 root 10M 8832K sleep 59 0 0 00 04 0 0 svc startd 14 100061 root 9440K 6624K sleep 59 0 0 00 01 0 0 snmpd 1 100132 root 8616K 5368K sleep 59 O 0 00 04 0 0 nscd 35 Total 52 processes 188 lwps load averages 0 00 0 00 0 00 3 28 SPARC Enterprise M3000 Server Service Manual October 2008 CHAPTER 4 FRU Replacement Preparation This chapter explains the method of preparing for the safe replacement of FRUs m Section 4 1 Tools Required for Maintenance on page 4 1 m Section 4 2 FRU Replacement and Installation Methods on page 4 2 m Section 4 3 Active Replacement Active Addition on page 4 5 m Section 4 4 Hot Replacement Hot Addition on page 4 7 m Section 4 5 Cold Replacement Cold Addition on page 4 11 4 1 Tools Required for Maintenance The actual maintenance work described in Chapter 5 to Chapter 15 requires maintenance software to
95. n page 15 5 m Section 15 4 Reassembling the Server on page 15 5 The operator panel is a cold replacement component The entire server must be powered off and the power cords must be disconnected to replace the operator panel Caution If the motherboard unit and operator panel are replaced at the same time the server may fail to operate correctly Before starting the replacement of the next FRU execute the showhardconf or showstatus command to verify that the component replaced earlier is operating correctly 15 1 FIGURE 15 1 shows the location of the operator panel FIGURE 15 1 Operator Panel Location Location number Component 1 Operator panel OPNL 15 2 SPARC Enterprise M3000 Server Service Manual October 2008 15 1 Accessing the Operator Panel Caution There is a risk of electrical failure if the power cords are not disconnected All the power cords must be disconnected to completely cut the power to the server Caution To ensure that you and bystanders are not exposed to harm and to prevent damage to the system observe the ESD safety precautions See Section 1 1 ESD Precautions on page 1 1 1 Power off the server This procedure includes the steps of turning the mode switch on the operator panel to the Service position verifying that the POWER LED is off and disconnecting the power cord For details see Section 4 5 1 Powering off the Server on page 4 11
96. n the driver for the device is corrupt or missing D Similar to the output of no option but device Lists the drivers needed or used by the driver names are listed operating system to enable the device p Similar to the output of no option yet is Provides a brief list of the devices abbreviated V Displays the version and date of the Useful for a quick check of the firmware OpenBoot PROM firmware version The following example shows output for the prtconf command prtconf System Configuration Memory size 7616 Megabytes System Peripherals SUNW SPARC Enterprise scsi_vhci instance 0 packages SUNW builtin drivers disk label obp tftp ufs file system chosen openprom client services virtual memory pseudo console 3 22 Sun Microsystems SUNW probe error handler sun4u Software Nodes driver not attached driver not attached driver not attached deblocker driver not attached driver not attached terminal emulator driver not attached driver not attached driver not attached driver not attached driver not attached driver not attached options instance 0 aliases driver not attached memory driver not attached driver not attached instance 0 SPARC Enterprise M3000 Server Service Manual October 2008 The prtconf output continued nvram driver not attached pseudo mc instance 0 cmp driver not attached core driver not attached cpu
97. nal interfaces Appendix E explains the UPS controller UPC that controls the uninterruptible power supply UPS unit Abbreviations Index xiv SPARC Enterprise M3000 Server Service Manual October 2008 Related Documentation The documents listed as online are available at http docs sun com app docs For late breaking information about hardware software or documentation for the Sun SPARC Enterprise M3000 server refer to the Sun SPARC Enterprise M3000 Server Product Notes Application Overview Planning Safety Compliance Getting Started Installation Software Administration Software Administration Glossary Hardware Software Product Notes Title Sun SPARC Enterprise M3000 Server Overview Guide Sun SPARC Enterprise M3000 Site Planning Guide Sun Fire M3000 Server Safety and Compliance Sun SPARC Enterprise M3000 Server Getting Started Guide Sun SPARC Enterprise M3000 Server Installation Guide Sun SPARC Enterprise M3000 M4000 M5000 M8000 M9000 Servers Administration Guide Sun SPARC Enterprise M3000 M4000 M5000 M8000 M9000 Servers XSCF User s Guide Sun SPARC Enterprise M3000 M4000 M5000 M8000 M9000 Servers Glossary Sun SPARC Enterprise M3000 Server Product Notes Format PDF HTML PDF HTML Hard Copy PDF HTML Printed PDF PDF PDF HTML PDF HTML PDF HTML PDF HTML Location Online Online Shipping kit Online Shipping kit Online
98. nce target In service mode break signals can be suppressed If the key position is switched to Service the server will boot into service mode the next time it reboots Service is selected by default at the initial power on 2 8 SPARC Enterprise M3000 Server Service Manual October 2008 TABLE 2 4 State Display by Combination of LEDs on the Operator Panel Name Description POWER XSCF STANDBY CHECK A Off Off Off AC power is not being supplied Off Off On AC power has been turned on Off Blinking Off The XSCF unit is being initialized Off Blinking On An error occurred in the XSCF unit Off On Off The XSCF unit is in the standby state The server is waiting for power on of the air conditioning facilities in the computer room On On Off Warm up standby processing is in progress power is turned on after the end of processing The power on sequence is in progress The server is in operation Blinking On Off The power off sequence is in progress The fan units are stopped after the end of processing READY LED is referred to when the XSCF unit status is indicated Chapter 2 Hardware Overview 2 9 2 3 LED Functions of Components This section explains the LEDs of each component When replacing a FRU check in advance the states of LEDs Normal system state can be confirmed by checking the operator panel If an error occurs in an individual hardware component in the server the LEDs of the FRU con
99. nd disconnect the cable Power off Method amp DSSSSSBB OS Or 3 2 Failure Diagnostic Method When an error occurs a message is displayed on the maintenance monitor in many cases Use the flowchart in FIGURE 3 2 to find the correct methods for diagnosing failures SPARC Enterprise M3000 Server Service Manual October 2008 FIGURE 3 2 Diagnostic Method Flowchart error message OS panic or performance Is the power OK or AC OK LED off The XSCF mail function sent an E mail its connection Check the power supply unit and YES and XSCF console Check var adm messages in the Solaris OS FMA message YES Vv Execute fmadm to display fault information an the message NO Check whether an error message is displayed on the OS console The XSCF console displays an error message 4 Execute showlogs or fmadm in the XSCF to display fault information y ID be used YES y Enter the message ID in http sun com msg to refer to fault information Make a memo of the displayed fault information Contact your service engineer Chapter 3 Troubleshooting 3 3 3 3 Checking the Server and System Configuration The operating conditions must remain the same before and after maintenance If an error occurs in the server save the system confi
100. ne hand supporting the bottom of the power supply remove the power supply unit Power cord 6 Place the power supply unit on a conductive mat 12 4 SPARC Enterprise M3000 Server Service Manual October 2008 123 Mounting the Power Supply Unit Caution Do not forcibly push the power supply unit into the slot Forcibly pushing the component in a slot may damage the component or the server 1 Align the top of the unit with the top of the slot Doing so prevents the bottom of the slot from cutting the power cord at the bottom of the unit 2 Set the handle so that it is perpendicular to the server and then push the power supply unit into the slot 3 Insert the power supply unit until it reaches the predetermined position and then return the handle to its original position 4 Secure the power cord with the AC cable clamp 12 4 Reassembling the Server m From the XSCF shell prompt use the showhardconf command to verify that the new power supply unit has been installed XSCF gt showhardconf For details see Section 4 4 2 Verifying the Hardware Operation on page 4 10 Chapter 12 Power Supply Unit Replacement 12 5 12 6 SPARC Enterprise M3000 Server Service Manual October 2008 CHAPTER 1 3 Fan Unit Replacement This chapter explains how to replace a fan unit m Section 13 1 Accessing a Fan Unit on page 13 3 m Section 13 2 Removing a Fan Unit on page 13 3 m Sect
101. network host Depending on how the ping command is configured troublesome network links or nodes can be identified from the displayed output The destination host is specified in the variable hostname 3 6 5 1 Options TABLE 3 10 lists the options of the ping command and how those options can help troubleshooting TABLE 3 10 Options for ping Option Description How it can help hostname The probe packet is sent to hostname and Verifies that a host is active on the network returned g hostname Forcibly routes the probe packet through a By sending the probe packet through different specified gateway routes to the target host individual routes can be tested for quality i interface Specifies through which interface to send and Enables a simple check of secondary network receive the probe packet interfaces Chapter 3 Troubleshooting 3 25 TABLE 3 10 Options for ping Continued Option Description How it can help n Replaces host names with IP addresses and displays them s Continues to repeat ping at intervals of 1 second Pressing CTRL C stops the execution After it is stopped statistics are displayed svR Displays the route the probe packet followed in 1 second intervals Used when an IP address is more useful than a host name Helps identify intermittent or long duration network events By piping ping output to a file overnight activity can be viewed all at once Indicates the probe packet route
102. ng m Degraded The unit is operating The unit is partly faulty or degraded and some error has been detected Although a faulty state is displayed for the unit it is operating normally m Deconfigured There is no problem with the unit itself but it is degraded due to a configuration problem environmental problem or the degradation of another unit m Maintenance Maintenance is being performed replacefru 8 or addfru 8 is being executed XSCF gt showstatus FANBP_B Status Normal a FAN_A 0 Status Faulted XSCF gt Chapter 3 Troubleshooting 3 13 3 5 4 Using the fm ump Command The fmdump command displays the contents of the log managed by the module called Fault Manager This example assumes that only one error exists fmdump TIME UUID SUNW MSG ID Nov 02 10 04 15 4911 Oee65618 2218 4997 c0dc b5c410ed8ec2 SUN4 8000 0Y 3 5 4 1 fmdump V Command To get more detailed information you can use the e option as shown in the following example Nov 100 FRU fmdump V u 0ee65618 2218 4997 c0dc b5c410ed8ec2 TIMI E UUID SUNW MSG ID 02 10 04 15 4911 0ee65618 2218 4997 c0dc b5c410ed8ec2 SUN4 8000 0Y fault io fire asic hc product id SUNW A70 motherboard 0 rsrc hc motherboard 0 hostbridge 0 pciexrc 0 3 14 The output method using the V option displays at least three additional lines m The first line is the same information shown for console messages above includ
103. ng Precautions Caution The server is easily damaged by static electricity To prevent damage to printed circuit boards wear a wrist strap and connect it to the server prior to starting maintenance Caution Do not bend the motherboard unit MBU or the components mounted on circuit boards might be damaged To prevent the motherboard unit from being bent observe the following precautions m Hold the motherboard unit by the handle where the board stiffener is located m When removing the motherboard unit from the packaging keep the motherboard unit horizontal until you lay it on the cushioned conductive mat m Connectors and components on the motherboard unit have thin pins that bend easily Therefore do not place the motherboard unit on a hard surface m Be careful not to damage the small parts located on both sides of the motherboard unit Caution The heat sinks can be damaged by incorrect handling Do not touch the heat sinks while replacing or removing motherboard units If a heat sink is loose or broken obtain a replacement motherboard unit When storing or carrying a motherboard unit ensure that the heat sinks have sufficient protection SPARC Enterprise M3000 Server Service Manual October 2008 Caution When removing a cable such as the LAN cable if your fingers do not reach the latch lock of the connecter use a flat head screwdriver to push the latch to disconnect the cable If you forcibly inser
104. nit and secure it with the two screws 2 Mount the DIMMs For details see Section 7 3 Installing the DIMMs on page 7 5 3 Reconnect all the cables to their original connection destinations on the motherboard unit 4 Attach the air duct For details see Section 5 3 3 Attaching the Air Duct on page 5 6 5 Mount the PCle cards For details see Section 8 3 Mounting a PCle Card on page 8 4 6 Attach the top cover For details see Section 5 2 2 Attaching the Top Cover on page 5 4 7 Slide the server into the equipment rack and secure the stabilizer bar For details see Section 5 1 2 Sliding the Server into the Equipment Rack on page 5 3 8 Mount the power supply unit correctly 9 Power on the server This procedure includes the steps of reconnecting the power cord checking the LED status and turning the mode switch on the operator panel to the Locked position For details see Section 4 5 3 1 Power on by Using the XSCF Command on page 4 13 Note If automatic startup of the Solaris OS is specified use the sendbreak d domain_id command of the XSCF shell to display the ok prompt after the display console banner is displayed but before the system starts booting the Solaris OS 10 Check the hardware This procedure includes the steps of running a program to verify that all the components have been remounted and starting the Solaris OS For details see Section 4 5 4 Verif
105. on E 4 SPARC Enterprise M3000 Server Service Manual October 2008 E 4 E 4 1 E 4 2 Power Supply Conditions TABLE E 2 and TABLE E 3 show the power supply conditions of the UPC interface Input Circuit TABLEE 2 Input Power Supply Conditions Signal name Input conditions BPS UALM No voltage relay contact BTL Contact rating DC 12 V 10 mA or greater 0 5 maximum ACOFF Use of a gold plated contact or reed relay is recommended Limit the signal line chattering period to 1 ms Output Circuit TABLEE 3 Output Power Supply Conditions Signal name Output conditions ER Output voltage VOH DC 3 1 V minimum VOL DC 0 0 4 V maximum Output current IOH 4 MA maximum IOL 4 mA maximum Appendix E UPS Controller E 5 E 5 UPS Cable The UPS cable specifications are as follows m Connector shape D SUB9 pin male receptacle side female DEU 9PF F0 m Terminal array FIGURE E 3 shows the pin signals of the UPC port and UPS cable Do not use any pin that is not defined Pin 2 3 4 and 8 in FIGURE E 3 The pins on the cable side are as follows FIGURE E 3 Correspondence between the UPC Ports and the UPS Cable Pins UPC port side UPS cable side Pin Signal name number 9 BPS UALM 9 77 SG Ci BTL BTL a ACOFF BPS UALM ACOFF Note Do not connect to the ER signal pin If a UPS cable is required it must be prepared separately For details contact your sales representa
106. onent 1 CD RW DVD RW drive unit DVDU 11 1 Accessing the CD RW DVD RW Drive Unit Caution There is a risk of electrical failure if the power cords are not disconnected All the power cords must be disconnected to completely cut the power to the server 11 2 SPARC Enterprise M3000 Server Service Manual October 2008 Caution To ensure that you and bystanders are not exposed to harm and to prevent damage to the system observe the ESD safety precautions See Section 1 1 ESD Precautions on page 1 1 1 Power off the server This procedure includes the steps of turning the mode switch on the operator panel to the Service position verifying that the POWER LED is off and disconnecting the power cord For details see Section 4 5 1 Powering off the Server on page 4 11 2 Slide the server out from the equipment rack For details see Section 5 1 1 Sliding the Server Out from the Equipment Rack on page 5 1 Caution To prevent the equipment rack from tipping over you must deploy the antitilt feature if applicable before you slide the server out of the equipment rack Note When the cable management arm is not mounted remove the cable ties that fix the PCI cable to the rear of the server and slide the server out 3 Remove the top cover For details see Section 5 2 1 Removing the Top Cover on page 5 3 11 2 Removing the CD RW DVD RW Drive Unit 1
107. op Cover Caution All internal components can be replaced in cold replacement mode The server must be powered off and the power cord must be disconnected from the AC power supply Before the top cover is removed the server must be pulled out from the equipment rack For the procedure for pulling the server out from the equipment rack see Section 5 1 Sliding the Server Into and Out of the Equipment Rack on page 5 1 Removing the Top Cover Caution All internal components can be replaced in cold replacement mode The server must be powered off and the power cord must be disconnected from the AC power supply 1 Loosen the three screws at the top rear of the server Chapter 5 Internal Components Access 5 3 2 To remove the top cover slide it toward the rear side and raise it FIGURE 5 2 FIGURE 5 2 Removing the Top Cover 9 22 Attaching the Top Cover 1 Align the top cover 2 Tighten the three screws at the top rear of the server to secure the top cover in the predetermined position 3 Pushthe server back into the eguipment rack For details see Section 5 1 2 Sliding the Server into the Eguipment Rack on page 5 3 90 Removing and Attaching the Air Duct server must be powered off and the power cord must be disconnected from the AC power supply Before the top cover is removed the server must be pulled out from the eguipment rack Caution All internal components can be rep
108. placing the Hard Disk Drive Backplane This chapter explains how to replace the hard disk drive backplane m Section 10 1 Accessing the Hard Disk Drive Backplane on page 10 2 m Section 10 2 Removing the Hard Disk Drive Backplane on page 10 3 m Section 10 3 Mounting the Hard Disk Drive Backplane on page 10 5 m Section 10 4 Reassembling the Server on page 10 6 The hard disk drive backplane is a cold replacement component The entire server must be powered off and the power cords must be disconnected to replace the hard disk drive backplane 10 1 FIGURE 10 1 shows the location of the hard disk drive backplane FIGURE 10 1 Hard Disk Drive Backplane Location Location number Component 1 Hard disk drive backplane HDDBP 0 10 1 Accessing the Hard Disk Drive Backplane disconnected All the power cords must be disconnected to completely cut the power Caution There is a risk of electrical failure if the power cords are not to the server 10 2 SPARC Enterprise M3000 Server Service Manual October 2008 Caution To ensure that you and bystanders are not exposed to harm and to prevent damage to the system observe the ESD safety precautions Section 1 1 ESD Precautions on page 1 1 1 Power off the server This procedure includes the steps of turning the mode switch on the operator panel to the Service position verifying that the POWER LED is off and disconnect
109. r Panel Overview The system administrator or service engineer checks the operating status of the server with LEDs or operates the power supply with the power switch FIGURE 2 4 shows the location of the operator panel FIGURE 2 4 Operator Panel Location Location number Component POWER LED XSCF STANDBY LED CHECK LED Power button a AeA ODO N e Mode switch key switch Chapter 2 Hardware Overview 2 5 222 Switches on the Operator Panel TABLE 2 1 depicts the functions of the switches on the operator panel The switches on the operator panel include the mode switch for setting the operation mode and the power switch for turning on and off the server TABLE 2 1 Switches Operator Panel Switch Name Description of function Mode This switch is used to set the operation mode for the server l Switch Insert the special key that is under the customer s control to switch between modes Locked Normal operation mode A e The system can be powered on with the power button but it cannot be powered off with the power button The key can be pulled out at this key position Service Mode for maintenance Y The system can be powered on and off with the power button e The key cannot be pulled out at this key position e To stop and maintain the server set the mode to Service Because remote power control and automatic power control for the server can be disabled in Service mode unintended power on durin
110. rademarks of SPARC International Inc in the U S and other countries Products bearing SPARC trademarks are based upon architecture developed by Sun Microsystems Inc SPARC64 is a trademark of SPARC International Inc used under license by Fujitsu Microelectronics Inc and Fujitsu Limited The OPEN LOOK and Sun Graphical User Interface was developed by Sun Microsystems Inc for its users and licensees Sun acknowledges the pioneering efforts of Xerox in researching and developing the concept of visual or graphical user interfaces for the computer industry Sun holds a non exclusive license from Xerox to the Xerox Graphical User Interface which license also covers Sun s licensees who implement OPEN LOOK GUIs and otherwise comply with Sun s written license agreements United States Government Rights Commercial use U S Government users are subject to the standard government user license agreements of Sun Microsystems Inc and Fujitsu Limited and the applicable provisions of the FAR and its supplements Disclaimer The only warranties granted by Fujitsu Limited Sun Microsystems Inc or any affiliate of either of them in connection with this document or any product or technology described herein are those expressly set forth in the license agreement pursuant to which the product or technology is provided EXCEPT AS EXPRESSLY SET FORTH IN SUCH AGREEMENT FUJITSU LIMITED SUN MICROSYSTEMS INC AND THEIR AFFILIATES MAKE NO REPRESENTATIONS OR
111. s Yes d Chapter 9 Hard disk drive backplane HDDBP Top Yes No No Chapter 10 CD RW DVD RW drive unit Front top Yes No No Chapter 11 DVDU Power supply unit PSU Rear Yes Yes Yes t Chapter 12 Fan unit FAN_A Top Yes Yes t Yes t Chapter 13 Fan backplane FANBP_B Top Yes No No Chapter 14 Operator panel OPNL Front top Yes No No Chapter 15 The FRU is operated directly without using XSCF commands The FRU is operated with XSCF commands d m The hard disk drive has a redundant configuration only when disk mirroring software is used m f a hard disk drive is an unmirrored boot device it must be replaced by using the cold replacement procedure However if a boot device can be disconnected by means of a Solaris OS function or disk mirroring software function active replacement can also be performed The procedure for disconnecting a hard disk drive varies depending on the software being used For details see the man uals for the relevant software Chapter 4 FRU Replacement Preparation 4 3 TABLE 4 3 lists the access location and applicable installation methods for each FRU TABLE 4 3 FRU Access Locations and Installation Methods Access Active Where to find the FRU location Cold addition Hot addition addition procedure Motherboard unit Top No No No MBU_A Memory DIMM Top Yes No No Chapter 7 PCIe card PCIe Top Yes No No Chapter 8 Hard disk drive HDD Front Yes Yes Yes Chapter 9 Hard disk drive backplane HDDBP
112. s B 1 B 1 Confirmation of DIMM Information B 1 B 2 Memory Mounting Conditions B 3 C FRU List C 1 C 1 Server Overview C 1 C2 Motherboard Unit C 2 C 21 Memory DIMM C 2 C 2 2 PCleSlot C 3 x SPARC Enterprise M3000 Server Service Manual October 2008 C 3 C 4 C 5 C 2 3 CPU C4 C 2 4 XSCF Unit C 4 Drive C 5 C 3 1 Hard Disk Drive C 5 C 3 2 CD RW DVD RW Drive Unit DVDU C 6 Power Supply Unit C 6 Fan Unit C 7 External Interface Specifications D 1 D 1 D 2 D 3 D 4 D 5 E 1 E 2 E 3 E 4 E5 E 6 Serial Port D 2 UPC Port D 2 USB Port D 3 SAS port D 3 Connection Diagram for Serial Cable D 4 UPS Controller E 1 Overview E 1 Signal Cable E 2 Configuration of Signal Lines E 3 Power Supply Conditions E 5 E 4 1 Input Circuit E 5 E 4 2 Output Circuit E 5 UPS Cable E 6 Connections E 7 Abbreviations Abbreviations 1 Index Index 1 Contents xi xii SPARC Enterprise M3000 Server Service Manual October 2008 Preface The Sun SPARC Enterprise M3000 Server Service Manual describes how to service the Sun SPARC Enterprise M3000 server References herein to the SPARC Enterprise M3000 server and the M3000 server are references to the Sun SPARC Enterprise M3000 server This document is written for maintenance providers who have received formal service training or customers who have received training under a self maintenance contract A single engineer service model is used for servicing the SPARC
113. se M3000 Server Service Manual October 2008 TABLE 3 3 Predictive Self Healing Messages Continued Output displayed Description Nov 1 16 30 20 dt88 292 Refer to WEB SITE Where to find specific information and http sun com msg SUN4 8000 0Y for more actions for this error information Nov 1 16 30 20 dt88 292 AUTO RESPONSE One or AUTO RESPONSE What if anything the system did more device instances may be disabled to alleviate any follow on problems Nov 1 16 30 20 dt88 292 IMPACT Loss of services IMPACT A description of what is considered to be the provided by the device instances associated with this impact of the fault fault Nov 1 16 30 20 dt88 292 REC ACTION Schedule a REC ACTION A brief description of the corrective repair procedure to replace the affected device Use action the system administrator should take Nov 1 16 30 20 dt88 292 fmdump v u EVENT_ID to identify the device or contact Sun for support 3 4 2 Monitoring Output To understand error conditions collect monitoring output information For the collection of the information use the commands shown in TABLE 3 4 TABLE 3 4 XSCF Commands for Checking Monitoring Output Command Operand Description showlogs XSCF console Displays the console of a domain coma monitor Logs messages that are displayed in the message window panic Logs output to the console during a panic ipl Collects console data generated during the period of the power on of a
114. see the SPARC Enterprise Equipment Rack Mounting Guide Sliding the Server Out from the Equipment Rack Caution To prevent the equipment rack from tipping over you must deploy the antitilt feature if applicable before you slide the server out of the equipment rack Note If cable management arms are not attached remove the cable ties securing the PCI cables to the rear of the server and slide the server out 5 1 Caution To ensure that you and bystanders are not exposed to harm and to prevent damage to the system observe the ESD safety precautions See Section 1 1 ESD Precautions on page 1 1 1 If the equipment rack is supplied with a stabilizer be sure to install it 2 Slide the server out as far as possible When the server is drawn out completely the server is automatically locked in the predetermined position 3 Loosen the two screws at the front of the server FIGURE 5 1 FIGURE 5 1 Pulling the Server Out from an Equipment Rack 4 Slide the server out 5 2 SPARC Enterprise M3000 Server Service Manual October 2008 5 1 2 Sliding the Server into the Equipment Rack 1 Push the server back into the equipment rack 2 Tighten the two screws at the front of the server to secure it in the equipment rack FIGURE 5 1 3 Return the stabilizer of the equipment rack to its original position 9 2 52 1 Removing and Attaching the T
115. t Pin number Signal name Input output Description 1 RST Output Request to Send 2 DTR Output Data Terminal Ready 12345678 3 TXD Output Send Data GND Ground 5 GND Ground 6 RXD Input Receive Data 7 DSR Input Data Set Ready 8 CTS Input Clear to Send D 2 UPC Port TABLE D 2 UPC Port Pin assignment Pin number Signal name Input output Description 1 ER tes Sa 54321 2 NC Not connected 3 NC Not connected s 4 NC Not connected 9876 5 SG gt Ground 6 BPS Input UPS hardware error signal 7 BTL Input Battery end of discharge warning signal 8 NC Not connected 9 ACOFF Input Power failure detection signal D 2 SPARC Enterprise M3000 Server Service Manual October 2008 D 3 USB Port TABLE D 3 USB Port Pin assignment Pin number Signal name Input output Description 1 VBUS Output Power supply 2 DATA Input output Data 3 DATA Input output Data 4 GND Ground D 4 SAS port The SAS port connects the server to an external device such as a tape drive which has a SASinterface For information on which devices can be connected contact a service engineer Note that one or two lanes can be used with this port Appendix D External Interface Specifications D 3 D 5 Connection Diagram for Serial Cable FIGURE D 1 Connection Diagram for Serial Cable RJ45 D sub 9pin ARTS __ _ SCOTS L 1CD 2 DTR 6 DSR 7 DSR 4 DTR 3TXD 2RXD 4 GN
116. t your fingers into the service clearance the LAN port of the motherboard unit of PCI Express PCle cards may be damaged Chapter 1 Safety Precautions for Maintenance 1 5 1 6 SPARC Enterprise M3000 Server Service Manual October 2008 CHAPTER 2 Hardware Overview This chapter explains the names of components and also explains the LEDs on the operator panel and rear panel m Section 2 1 Name of Each Part on page 2 1 m Section 2 2 Operator Panel on page 2 4 m Section 2 3 LED Functions of Components on page 2 10 m Section 2 4 External Interface Port on Rear Panel on page 2 12 m Section 2 5 Labels on page 2 16 2 1 Name of Each Part This section explains the names of parts mounted on the SPARC Enterprise M3000 server Among these parts those which can be replaced in the field by a certified field engineer are called Field Replaceable Units FRU For information on the actual replacement expansion procedure for FRUs see Chapter 6 to Chapter 15 The server consists of a chassis in which various components are mounted top cover to protect the mounted components front panel and rear panel An operator panel is located on the front panel and ports used to connect external interfaces are located on the rear panel From the LEDs on the operator panel and rear panel error and other status information can be checked For details see Section 2 2 Operator Panel on page 2 4 to Se
117. taining the hardware component which caused the error will indicate the error location However some FRUs such as DIMMs do not have LEDs To check the state of a FRU that has no LEDs use an XSCF shell command such as showhardconf in the maintenance terminal For details see TABLE 3 1 TABLE 2 5 describes the component LEDs and their functions TABLE 2 5 Component LEDs and Their Functions Component Name Color Description Motherboard unit POWER Indicates whether the MBU is operating MBU On green Indicates that the motherboard is operating The motherboard nn D cannot be removed from the server while the POWER LED is 00 on AO Blinking Indicates that the MBU is being incorporated into the system green or being disconnected from the system Off Indicates that the MBU is stopped The MBU can be disconnected and replaced CHECK Indicates the motherboard unit status On amber Indicates that an error occurred in the MBU A Off Indicates that the MBU is in the normal state 2 10 SPARC Enterprise M3000 Server Service Manual October 2008 TABLE 2 5 Component LEDs and Their Functions Continued Component Name Color Description Hard disk drive Indicates that the hard disk drive can be removed However HDD this LED is not used 7 CHECK On amber Indicates that an error occurred in the HDD However this LED stays on for several minutes until init
118. ted All the power cords must be disconnected to completely cut the power to the server Caution To ensure that you and bystanders are not exposed to harm and to prevent damage to the system observe the ESD safety precautions See Section 1 1 ESD Precautions on page 1 1 1 Power off the server This procedure includes the steps of turning the mode switch on the operator panel to the Service position verifying that the POWER LED is off and disconnecting the power cord For details see Section 4 5 1 Powering off the Server on page 4 11 2 Slide the server out from the equipment rack For details see Section 5 1 1 Sliding the Server Out from the Equipment Rack on page 5 1 Caution To prevent the equipment rack from tipping over you must deploy the antitilt feature if applicable before you slide the server out of the equipment rack Note When the cable management arm is not mounted remove the cable ties that fix the PCI cable to the rear of the server and slide the server out 3 Remove the top cover For details see Section 5 2 1 Removing the Top Cover on page 5 3 4 Remove the air duct from the server For details see Section 5 3 2 Removing the Air Duct on page 5 5 Chapter 7 Replacement and Installation of Memory 7 3 7 2 Removing the DIMMs 1 Pull the DIMM eject levers outward to release the DIMM 2 Pull the DIMM upwards to remove the DIMM fro
119. the SPARC Enterprise M3000 Server Installation Guide Caution Do not make mechanical or electrical modifications We are not responsible for regulatory compliance of modified servers Equipment Rack Safety Precautions m The equipment racks must be anchored to the floor ceiling or to adjacent frames m Some equipment racks are supplied with a stabilizer feature which supports the weight of the server when it is extended on its slide rails This prevents the equipment from toppling over during installation or maintenance m In the following cases a safety evaluation must be conducted by the service engineer prior to installation or maintenance work When no stabilizers are attached and the equipment rack is not anchored to the floor ensure safety by confirming that the server does not fall over when it is pulled out from the slide rails Chapter 1 Safety Precautions for Maintenance 1 3 1 4 When the equipment rack is mounted on a raised floor ensure that the raised floor has sufficient strength to withstand the weight upon it when the server is extended on its slide rails Fix the equipment rack through the raised floor to the concrete floor below it using a proprietary mounting kit for this purpose Caution If more than one server is installed in an equipment rack maintain the servers one at a time For details of equipment racks see the SPARC Enterprise Equipment Rack Mounting Guide Component Handli
120. the equipment rack For details see Section 5 1 1 Sliding the Server Out from the Equipment Rack on page 5 1 3 Remove the fan cover For details see Section 5 4 1 Removing the Fan Cover on page 5 7 13 2 Removing a Fan Unit 1 While pressing the latch 1 in FIGURE 13 2 lift the fan unit 2 in FIGURE 13 2 Chapter 13 Fan Unit Replacement 13 3 2 Remove the fan unit from the server and place it on a conductive mat gt E FIGURE 13 2 Removing a Fan Unit SPARC Enterprise M3000 Server Service Manual October 2008 13 4 13 3 Mounting a Fan Unit Caution Do not forcibly mount a fan unit in a slot Forcibly pushing a fan unit in a slot may damage the fan unit or the server 1 Adjust the position of the fan unit so that its latch is on the right when you are facing the front of the server 2 Insert the fan unit in the slot 13 4 Reassembling the Server 1 Attach the fan cover For details see Section 5 4 2 Attaching the Fan Cover on page 5 8 2 Slide the server into the equipment rack and secure the stabilizer bar For details see Section 5 1 2 Sliding the Server into the Equipment Rack on page 5 3 3 From the XSCF shell prompt use the showhardconf command to verify that the new fan unit has been installed XSCF gt showhardconf For details see Section 4 4 2 Verifying the Hardware Operation on page 4 10 Chapter 13 Fan Unit Rep
121. tion is being performed through the part LAN port rl Off Indicates that communication is not being performed through FES the LAN port LINK On amber Indicates that the communication speed of the LAN port is 1 SPEED Gbps On green Indicates that the communication speed of the LAN port is 100 Mbps Off Indicates that the communication speed of the LAN port is 10 Mbps 2 4 External Interface Port on Rear Panel This section shows the location of the external interface ports located on the server rear panel and explains their functions 2 12 SPARC Enterprise M3000 Server Service Manual October 2008 FIGURE 2 5 External Interface Port Locations 32209 ae 02880807 985 i 7 10 Chapter 2 Hardware Overview 2 13 TABLE 2 6 External Interface Port Functions Location number Component Description Used to connect the server to a peripheral device 1 RCI port 2 USB port for XSCF 3 Serial port for XSCF 4 LAN port 1 for XSCF 5 LAN port 0 for XSCF 6 UPC port 1 7 UPC port 0 having a PCI connector to enable power interlocking and error monitoring Exclusive for maintenance personnel Cannot be connected to general purpose USB devices Connects to the XSCF unit through serial connection to set up and manage the server Accommodates a 100Base TX LAN cable to set up the server and display status e XSCF shell command line interface CLI e XSCF Web browser user
122. tive E 6 SPARC Enterprise M3000 Server Service Manual October 2008 E 6 Connections This section explains the connection between the UPC port and UPS m UPC 0 connects to UPS 0 and UPC 1 connects to UPS 1 m Single power feed uses only UPC 0 m Dual power feed uses UPC 0 and UPC 1 Appendix E UPS Controller E 7 E 8 SPARC Enterprise M3000 Server Service Manual October 2008 Abbreviations ASIC AT ATAPI BUI CH CMP CMT CB CLKU CLI CMU CH CE CPU Application specific integrated circuit Advanced technology AT attachment packet interface Browser based user interface Channel Chip multi processor Chip multi threading Circuit breaker Clock control unit Command line interface CMU channel Correctable error Central processing unit DCL DDC DE DID DIMM DRAM E Ecache ECC F FANBP FMA FRU GBps GHz GUI Domain component list DC to DC converter Diagnosis engine Domain ID Dual inline memory module Dynamic random access memory External cache Error correction code Fan backplane Fault management architecture Field replaceable unit Gigabyte per second Gigahertz Graphical user interface Abbreviations 1 HDD HDDBP I2C bus ISA IOBP L LCD LED LSB LSI MAC MBC MEM MBU NTP NVRAM O OPNL OS p PCle POST Abbreviations 2 Hard disk drive Hard disk drive backplane Inter integrated circuit bus Instruction set ar
123. ut The pid pcpu Provides only most important information pmem and comm options display process ID Knowing the percentage of resource percent CPU consumption percent memory consumption helps identify processes that are consumption and the relevant executable file affecting performance and might be hung respectively 0 The following example shows output for the ps command ps PID TTY TIME CMD 101042 pts 3 0 00 ps 101025 pts 3 0 00 sh When using sort with the r option the column headings are output so that the value in the first column is equal to zero Using the prstat Command The prstat utility repeatedly examines all the active processes in the system and reports statistics based on the selected output mode and sort order The prstat command provides output similar to the ps command Chapter 3 Troubleshooting 3 27 3 6 7 1 Options TABLE 3 12 lists the options of the prstat command and how those options can help troubleshooting TABLE 3 12 Options for prstat Option Description How it can help No option Displays a list of the processes sorted in Output identifies the process ID user ID used descending order of consumption amount of amount of memory state CPU consumption CPU resources The list is limited to the height and command name of the terminal window and the total number of processes Output is automatically updated every 5 seconds Pressing CTRL C stops the execution
124. ving the Fan Backplane 1 Remove the signal cable for the operator panel 1 in FIGURE 14 3 The cable is on the right side of the fan backplane 2 Remove the signal cable connector for the motherboard unit 2 in FIGURE 14 3 The cable connector is on the rear side of the fan backplane 3 Remove the four screws securing the fan backplane 3 in FIGURE 14 3 FIGURE 14 3 Removing the Fan Backplane 4 Remove the fan backplane from the server and place it on a conductive mat Chapter 14 Fan Backplane Replacement 14 5 143 Mounting the Fan Backplane 1 Place the fan backplane on the bottom of the chassis Adjust the fan backplane so that it is level and it does not catch on any cables Tighten the four screws to secure the fan backplane in position Connect the signal cable for the operator panel The cable is on the right side of the fan backplane Mount the signal cable connector for the motherboard unit The cable connector is to be mounted on the rear side of the fan backplane 14 4 Reassembling the Server 1 Mount the fan cage Set the button of the fan cage at the A part in FIGURE 14 3 and insert the fan cage projections in the tabs 1 in FIGURE 14 1 Mount the fan units For details see Section 13 3 Mounting a Fan Unit on page 13 5 Attach the air duct to the server For details see Section 5 3 3 Attaching the Air Duct on page 5 6 Attach the top cover For details
125. y be displayed Chapter 3 Troubleshooting 3 15 3 5 5 2 3 5 5 3 fmadm repair Command When the fmadm faulty command displays a fault the fmadm repair command must be executed to clear the FRU information in the domain after replacement of the motherboard unit that has encountered the error If the fmadm repair command is not executed the error message is not cleared If the fmadm faulty command displays a fault clearing the FMA resource cache on the operating system side causes no problem Data in the cache does not need to match the hardware fault information held by the XSCF fmadm repair STATERESOURCE UUID degraded dev pci le 600000 0ee65618 2218 4997 c0Odc b5c410ed8ec2 fmadm config Command The fmadm config command output displays the version number and current status of the diagnosis engine that is being used by the server Whether the latest engine is being used can be determined by consulting the SunSolve web site fmadm config MODULI cpumem diagnosis cpumem retire disk transport eft event transport fabric xlate fmd self diagnosis io retire snmp trapgen sysevent transport syslog msgs zfs diagnosis zfs retire VERSION STATUS DESCRIPTION 1 6 active CPU Memory Diagnosis 1 1 active CPU Memory Retire Agent 1 0 active Disk Transport Agent 1 16 active eft diagnosis engine 2 0 active Event Transport Module 1 0 active Fabric Ereport Translater 1 0 active Fault
126. ying the Hardware Operation on page 4 15 Chapter 6 Motherboard Unit Replacement 6 9 6 10 SPARC Enterprise M3000 Server Service Manual October 2008 CHAPTER 7 Replacement and Installation of Memory This chapter explains how to replace and install memory DIMMs m Section 7 1 Accessing the DIMMs on page 7 3 m Section 7 2 Removing the DIMMs on page 7 4 m Section 7 3 Installing the DIMMs on page 7 5 m Section 7 4 Reassembling the Server on page 7 5 DIMMs are cold replacement components This means that the entire server must be powered off and the power cords must be disconnected to replace the DIMMs For information on the mounting conditions of DIMMs see Appendix B The same methods as those for replacement are used for installation of DIMMs 7 1 Locations of DIMMs and Memory Slots vdt N KK E Na memory slot group A memory slot group A memory slot group B 1 1 7 MEM 00B memory slot group B MEM 01A memory slot group A MEM 01B memory slot group B MEM 02A memory slot group A MEM 02B memory slot group B MEM 03A MEM 00A MEM 03B Component FIGURE 7 1 shows the locations of DIMMs and memory slots FIGURE 7 1 Location number SPARC Enterprise M3000 Server Service Manual October 2008 7 2 Accessing the DIMMs Caution There is a risk of electrical failure if the power cords are not disconnec
Download Pdf Manuals
Related Search
Related Contents
Samsung WB210 User Manual Owner`s Manual Guide du Propriétaire Manual del Usuario R\311GIMEN DE CONTRATO DE TRABAJO Manual de usuario 取扱説明書 AD-6350 Copyright © All rights reserved.
Failed to retrieve file