Home
IBM RS/6000 SP User's Manual
Contents
1. Old style 2 01 m frame New style 2 01 m frame wa wa wa we W3 v8 W7 w3 W7 wa sie H10 we W2 s16 H10 we I vi v6 F ay F HH w5 HNI _ Ws IA Hon w 14 H9 p Ha wi S14 H9 Cn HH I o v7 N 45 u HL N OHO a s12 H8 E O Ho s12 __H8 OH U OHO H v2 e HH H OHH ale Hd HOA So S10 H7 bo IIH H OH s10 H7 HO H Hogi Ar ve HE HH H H HE f HY H qHH Ba s08 H6 aE HE D d HH s08 H6 HAC HH va HAS Im AH H Da sos HS N sog H5 so a v5 N l s04 H4 k N s04 H4 v4 N so2 H3 N so2 H3 N i Power N Power H1 H1 E1 E2 r N ai E1 a Ss ED Gas __ 3 a E4 E4 Figure 2 10 Frame cable routing paths in rear of RS 6000 SP multi switch frame F C 2030 1 2 01 m frame 2 12 RS 6000 SP SP Switch Service Guide Switch data cables SPS data cables describes the attachment locations and routing for the internal SPS Switch data cables Table 2 2 SPS Switch data cable chart Cable Part Plug from Plug to Number Location Location 46H9710 E00 S00 BH J7 E00 NO1 BH PA 46H9711 E00 S00 BH J8 E00 NO2 BH PA 46H9712 E00 S00 BH J26 E00 N03 BH PA 46H9713 E00 S00 BH J25 E00 N04 BH PA 46H9714 E00 S00 BH J9 E00 NO05 BH PA 46H9715 E00 S00 B
2. 1 25 m frame New style 2 01 m frame Old style 2 01 m frame 16 H10 15 v 16 H10 15 y v8 vi O v 14 H9 13 D r 14 H9 13 F 2 H8 11 v g 12 H8 11 VE 5 is Hv2 HH v 10 Hr 9 Oo y 10 H7 9 HH 7 3 HH ve HH _ 8 H 7 M 8 He 7 Y HH 8 HE 7 v gE v 6 H5 58 v 6 H5 5 KR H5 5 H 4 H4 3 v A Ha 3 v Am 3 ws NI v 283 Mi 2m f yay 2 H3 1 N Switch i i Switch A Switch N g N Power Power j Power H1 H1 2 H1 N Ei E E1 g E1 E2 E3 E3 Zee Figure 2 8 Frame cabling routing path in rear of RS 6000 SP frame 2 01 m frame Note For a multi switch frame F C 2030 1 refer to Eigure 2 7 2 10 RS 6000 SP SP Switch Service Guide able 2 1 shows external cable routing in a RS 6000 SP frame populated with 16 processor nodes Refer Table 2 1 External cable routing 9 to see the routing paths Cable routing in a multi switch frame F C 2030 1 and Figure 2 10 on page 2 12 show back views of the RS 6000 SP multi switch frame with frame extension showing the horizontal and vertical paths of cable routing from connector to connector The depths has been amplified on the drawing Note W1 consists of vertical raceways V1 V4 W5 consists of vertical raceways V5 V8 Chapter 2 Locations Frame Horizontal Slot Number Cable Budget Entra
3. J Switch Asm S16 1 Switch Asm S14 Switch Asm 12 I SwitchAsmS10 i Switch Asm S8 I Switch Asm S6 0 Switch Asm S4 Switch Asm S2 MainPower L Switch with A LED Loo H H H LeftSkit Right Skirt J 2 A B C 48 V Power Modules Front Cover and Skirt and Air Filter Removed Figure 2 2 Front view of multi switch frame locations Figure 2 3 on page 2 5 shows a front view of the Model 3AX 49 inch frame 2 4 RS 6000 SP SP Switch Service Guide slot 7 Wide Processor Node 5 slots 5 and 6 High Processor Node SP frame locations 1 25 m and 1 4 m frames front view El Bi i Thin Processor Node 8 Thin Processor Node 7 slot 8 5 nt slots 1 4 l Switch assembly slot 17 ale 48 V power module C optional D locations Air filter removed Figure 2 3 Front view of 49 inch frame locations See notes below Figure notes 1 7 Wide processor nodes take up an entire shelf position two thin processor node slots They are identified by the odd numbered position In a F C 2030 1 frame switch assemblies take up an entire shelf partition They are identified by the even numbered position Processor node slots are numbered up to N8 The single phase SEPBU pow
4. Front chassis cable Green Yellow P2 LEDs Figure 1 1 SP Switch high level diagram 1 2 RS 6000 SP SP Switch Service Guide Switch description and problem determination MAP 0590 There are two LEDs on the front of each SP Switch For quick reference their definitions are as follows m Environment Yellow Off No environmental problems detected by switch supervisor card On Warning of environmental condition out of nominal range Preventative Maintenance should be scheduled for this switch Flashing Serious environmental condition detected power shut off m Power Green Off No 48 V dc power available at the SP Switch Flashing Power available at the SP Switch but switch logic is Off On Power available at the SP Switch and logic is On Note Refer to from service position Step 0590 001 Read the following warning and then follow steps to ensure continuity of customer s jobs in the queue 9 for placing or removing the SP Switch into or Attention Servicing a processor node or SP Switch will interrupt customer usage of the processor node and the remainder of the switch network If the switch feature must be replaced in a multi frame system refer to Zi for information on isolating the SP Switch Attention It is possible that the customer has modified switch cabling from standar
5. SRN Service Request Number source Failing component s Description D 762 100 Problem detected with device or through device data corrupted You may 762 199SPS adapter need to reinstall ssp css code software system or I O planar 762 200 D external clock Problem detected with external through SPS adapter clock 762 299 wrap plug 762 2A0 D SPS adapter Problem detected with internal through clock 762 2A9 762 300 D SPS adapter Problem detected with POS through system or I O planar registers 762 399 762 400 D SPS adapter TBIC test failed through software 762 499 762 500 D SPS adapter SRAM test failed through 762 599 762 600 D SPS adapter Microprocessor test failed through 762 699 762 700 D SPS adapter Interrupt test failed through 762 799 762 800 D SPS adapter FIFO test failed through 762 899 762 900 D SPS adapter DMA engine test failed through VO planar 762 999 data cable 762 Ax3 D SPS adapter Card wrap test failed See note terminator 762 Ax4 D SPS data cable Cable wrap test failed See note wrap plug 763 1xx D SPS MX adapter Problem detected with a device See note software or device data corrupted system or I O planar Note You may need to reinstall ssp css software 1 24 RS 6000 SP SP Switch Service Guide SP Switch function MAP 0620 Table 1 9 Service Request Number SRN table for SP Switch adapters continued SRN Serv
6. High voltage present Perform Lockout safety procedures to remove primary power to the frame and high voltage transformer if present SPSFDO09 DANGER High voltage present at test points Use high voltage test probes SPSFDO10 DANGER High energy present Do not short 48V to frame or 48VRtn Shorting will result in system outage and possible physical injury SPSFD011 DANGER If a unique power module fails all LEDs will be off The high voltage LED will be off even though the high voltage is still present SPSFDO12 X _RS 6000 SP SP Switch Service Guide DANGER The remaining steps of the procedure contain measurements that are taken with power on Remember that hazardous voltages are present SPSFDO13 DANGER The frame main circuit breaker and the controller must not be switched on again now Before disconnecting the power cables from the power receptacles ensure that the customer s branch distribution circuit breakers customer power source circuit breakers are Off and tagged with DO NOT OPERATE tags S229 0237 Refer to Lockout safety procedures in Bsn Spl before proceeding SPSFDO14 DANGER Before connecting ac power cabless to electrical outlets ensure that e The customer s branch distribution circuit breakers customer power source circuit breakers are off and tagged with DO NOT OPERATE tags S229 0237 or national language equi
7. w Step 0600 005 After placing the switch into the service position you performed some basic inspections and found that the switch supervisor control cable S00 SP P102 appears to be okay 1 Disconnect S00 PC P4 and S00 PC P4 at the power supply 2 Using a digital multimeter measure resistance between the appropriate pins 3 Compare results with values in able 1 3l Table 1 3 Switch connector resistance values Measure from To GND Acceptable range Voltage positive lead negative lead in ohms 5 V Pin 1 Pin 2 black 1K 5M 12 V Pin 3 Pin 2 black 1K 5M 5 V Pin 5 Pin 2 black 2 20 Pin 7 Pin 2 black 2 20 4 Is the measured resistance in the acceptable range e If yes go to If no go to Step 0600 0064 Step 0600 006 The resistance value you measured in Table 1 3 was not within an acceptable range 1 Replace the inner chassis cable 2 Repeat resistance measurement from Step 0600 005 3 Is the measured resistance in the acceptable range e If yes go to If no go to Step 0600 0071 Step 0600 007 You disconnected SO0 CL P5 at the switch clock card but the measured resistance was still outside the acceptable range 1 Replace the front chassis cable 2 Repeat resistance measurement from Step 0600 005 d to verify fix 1 6 _RS 6000 SP SP Switch Service Guide SP Switch environment MAP 0600 3 Is the measured resistance now within the
8. b Refer to Starting a service call MAP 0100 in for information on using the pg command 8 Problems are indicated at the end of a text line by the following sequence ErrorMessage Examples of error messages 1 uninitialized 3 R faulty link from VOP 9 Does var adm SPlogs css out top file report any problems If yes use the SPS Error Conditions table Table 1 7 on page 1 19 to continue service e If no go to Step 0620 0021 Step 0620 002 No functional problems found 1 Have you replaced any components or fixed problems e If yes a You have resolved the switch problem b Go to End of call MAP 0650 in e If no go to Step 0620 0031 Step 0620 003 You did not find any functional problems and you did not fix or replace anything 1 Did Estart command start okay e If yes call the next level of support e If no a Consult the Diagnosing Switch Problems section of IBM RS 6000 SP PSSP Diagnosis Guide b Return to 1 18 _RS 6000 SP SP Switch Service Guide Table 1 7 SP Switch error conditions SP Switch function MAP 0620 Error Message condition Description and action 2 Initialized Description Initialization detected a wrapped port where a SPS processor node or dependent node was expected this may result from isolation procedures or else a disconnected cable Action 1 If this is an unexpected condition check cabling against
9. 0 position 2 Remove power cable J1 and supervisor cable J2 from the rear of the SP Switch 3 Remove the front panel assembly 4 Replace power supply PS2 5 Take the switch out of the service position refer to 6 Step 0600 031 You have a PSFuseGood problem that did not go away when you removed power supply PS2 1 Put the circuit breaker in the Off 0 position 2 Remove power cable J1 and supervisor cable J2 from the rear of the SP Switch 3 Remove the front panel assembly 4 Replug power supply PS2 and replace the switch supervisor card 5 Reinstall the SPS front panel assembly being careful to align the guide pins on the P1 connector to the inner chassis 6 Replug the power cable J1 and supervisor cable J2 at the rear of the assembly 7 Put the circuit breaker in the On 1 position 8 Does the failure condition still exist e If yes a Replace the SP switch inner chassis assembly b Go to e If no go to SP Switch power MAP 0610 Purpose of this MAP This MAP provides diagnostic information for problems related to the SP Switch power supply Note Refer to removing it from the service position 9 for placing the switch into the service position or Chapter 1 Maintenance Analysis Procedures MAPs 1 13 SP Switch power MAP 0610 Step 0610 001 A message in ea ee indicated that you have a power problem and Table 1 1 on page 1 4 or directed you to
10. 2 Determine which connection is the other end of the switch data cable checked in the step above e Refer to Appendix A in 3 Check the switch data cable at the other SP Switch for any obvious problem such as a loose cable connection or swapped cables 4 Do the switch data cables and any data cable interposers appear to be okay If yes go to Step 0620 0381 e If no a Fix the obvious cable connection problem b Go to to verify fix Step 0620 038 The switch data cables and the data cable interposers appear to be okay 1 Swap the data cable with another one at SOO BH J3 S00 BH J4 or SOO BH J5 2 To provide a good clock swap the switch data cable at the switch assembly with another one 3 Reset the clock setting to that recorded in FSten 0620 0361 4 Run advanced diagnostics on css0 on the test processor node and its associated switch port e Do not perform cable wrap test e Refer to Running diagnostics in a processor node in 1 34 RS 6000 SP SP Switch Service Guide SP Switch function MAP 0620 5 Look for an SRN indicating a clock problem such as External clock being listed as one of the failing componenis 6 Do the css0 diagnostics fail with indication of a clock problem e If yes a Problem is the clock selection in this SP Switch b Return to e If no go to EStep 0620 0391 Step 0620 039 The css0 diagnostics did not fail This indicates th
11. 7 Inthe maintenance shell set the boot list Run diagnostics diag Choose the Service Aids panel Choose the Display Alter Bootlist panel Choose Normal Mode Choose Alter Current Bootlist f Set the bootlist the way it was before the node was serviced 8 Close the console window 9 On the control workstation set the node to boot from disk For example spbootins r disk 2 12 1 10 On the control workstation use Perspectives to power off the node and then power it back on The node will now boot from the device that you specified in step 7 with the correct time oaoop Installing firmware updates on SP nodes Firmware updates for example IPL ROS updates for SP nodes are available at hitp www rs6000 ibm com support micro download html Alternatively you can search AIXTOOLS for the latest versions of the firmware updates for example look for P2SC_IPL on AIXTOOLS for the latest version of IPL ROS on SP Nodes Follow the instructions in the README file within the package Installing adapter microcode packages Certain adapters are shipped with an adapter firmware diskette For factory configured systems the microcode is installed on the SP nodes However for field installations the adapter firmware must be installed This adapter firmware must be installed on the SP nodes along with the adapter The following procedure outlines the adapter microcode installation Updates are periodically made to micro
12. e In raised floor installations mechanically safe moldings should be installed around floor cutouts Extreme caution should be used when moving frames during installation or removal because of the proximity of floor cutouts to casters SPSFCOT15 CAUTION When using step ladder or step stool be sure that the work surface is level and the step ladder or step stool is in good working order SPSFCO16 CAUTION Portable ladders present a serious safety hazard if not used properly Follow these general guidelines e Make sure the ladder is firm and steady and has no defective rungs or braces e Work only on a level surface e Never use a metal ladder near electrical power lines e Never overreach Instead move the ladder Be as careful on a short ladder as on a 30 foot extension ladder False security can lead to carelessness and falls which can cause painful injuries SPSFCO017 CAUTION All IBM laser modules are designed so that there is never any human access to laser radiation above a class 1 level during normal operation user maintenance or prescribed service conditions Data processing environments can contain equipment transmitting on system links with laser modules that operate at greater than class 1 power levels For this reason never look into the end of an optical fiber cable or open receptacle Only trained service personnel should perform the inspection or repair of optical fiber cable assemblies and receptacles S
13. 6 The problem is Device status Device not Description Possible hardware problem 1 responding SPS Link status Link as been removed from network other stage faulty 2 Link status Wrap plug is installed Description Initialization detected a wrap plug or cable rather SPS than the expected cabling A for next problem to verify fix 1 19 Chapter 1 Maintenance Analysis Procedures MAPS SP Switch function MAP 0620 Table 1 7 SP Switch error conditions continued Error Message condition Description and action 3 Device status Device has been Description Possible hardware problem SPS removed from network because of a bad signature Action Go to Link status Not operational 4 Device status Device has been Description Switch network not wired as specified in switch SPS removed from network faulty topology or problem with connection between switch and device Link status Link has been removed Note You may get this error number for a jack which could from network or miswired faulty connect to a node but instead has a wrap plug In this case this message is a warning only and can be ignored Action 1 Check this connection against the actual hardware cabling 2 Ifthe cabling does not match correct the cabling problem then go to step 5 below 3 Check the processor node s hostname and switch node number against the configuration using the command s
14. Due to weight of each thin node under 18 Kg 40 Ibs use care when removing and replacing thin nodes above shoulder height SPSFC005 CAUTION The wide node weight may exceed 32 Kg 70 5 Ibs SPSFCO006 CAUTION Do not open more than one wide node or switch assembly drawer at a time SPSFC007 CAUTION Make sure the stability foot and wheel chocks are installed on the frame These are required to maintain frame balance and position during service operations SPSFCO008 CAUTION Outer edges of chassis may be sharp Care must be taken when removing and installing chassis SPSFC009 CAUTION The ground strip may have sharp edges SPSFCO10 CAUTION Do not remove wide nodes or switch assemblies from the mounting slides Caution must be observed when working with mounting slides to prevent pinched fingers or accidental release of the unit SPSFCO11 CAUTION Do not remove the drawer case mounting screws at the bottom of both sides SPSFCO12 CAUTION Once the latch is released push the drawer closed Do not pull as the drawer may disengage from the rails creating a safety hazard SPSFCO13 CAUTION Due to the weight of each wide node use care when sliding and closing wide processor nodes above shoulder height SPSFC014 Xii RS 6000 SP SP Switch Service Guide CAUTION e When moving frames into position team members should work together Using one person on each corner of the frame can prevent strain
15. SP Switch high level diagram 1 2 SPS assembly placing into service position 3 9 SPS assembly removing 4 2 SPS assembly replacing 4 2 SPS assembly replacing from service position 3 9 SPS fan control cable removing 4 3 SPS fan control cable replacing 4 4 SPS fans removing 4 3 SPS fans replacing 4 3 SPS front chassis cable 1 8 SPS partitioning 3 8 SPS power card removing 4 6 SPS power card replacing 4 7 SPS resources removing 3 7 SPS resources restoring 3 7 SPS supervisor card removing 4 4 SPS supervisor card replacing 4 5 SPS supervisor power cable removing 4 5 SPS supervisor power cable replacing 4 6 SPS Switch data cables 2 13 SPS switch inner chassis cable 1 8 SPS 8 Switch data cables 2 13 SPS SPS 8 feature 5 2 5 4 5 6 SPS SPS 8 switch feature 5 4 SPS SPS 8 switch feature 5 2 5 6 SRN Service Request Number table for SP Switch adapters 1 24 static sensitive devices 4 1 supervisor verification frame 3 5 node 3 4 switch 3 5 switch adapters SP Switch Service Request Number SRN table 1 24 switch assembly from active configuration removing 3 7 switch assembly to active configuration restoring 3 8 switch assembly removing 4 2 switch assembly replacing 4 2 switch cables feature 5 8 switch clock source determining the correct 3 6 switch clock source selecting the 3 6 switch clocks selecting appropriate 3 6 Switch data cables SPS 2 13 Switch data cables SPS 8 2 13 switch fans removing 4 3 s
16. 3 4 3 4 4 4 5 4 5 4 6 4 6 4 7 Attention Components in the frame are susceptible to damage from static discharge Always use an ESD wristband when working inside frame covers See more details Do not touch the pins or circuitry on these components This chapter describes the removal and replacement of RS 6000 SP product specific Field Replaceable Unit FRU components Handling static sensitive devices Attention Adapters planars disk drives supervisor cards and memory cards are sensitive to static electricity discharge These devices are wrapped in antistatic bags or containers to prevent this damage Perform the following procedures to prevent damage to these devices 1 Do not remove the device from the antistatic bag or container until you are ready to install the device in the system unit 2 You must wear an ESD wristband while installing or removing any static sensitive devices 3 With the device still in its antistatic bag touch it to a metal frame of the system 4 Grasp cards and boards by the edges Hold drives by the frame Avoid touching the solder joints and pins 5 Handle the devices carefully in order to prevent permanent damage E m mn Fe Antistatic bag Figure 4 1 Handling an anti static device Copyright IBM Corp 1999 2002 4 1 Procedures for switch assemblies CAUTI
17. 40X S for multi switch frame C for control workstation Z for another frame such as a server e 2nd and 3rd characters are the frame number 00 for any all frames designates location inside any all frames Copyright IBM Corp 1999 2002 2 1 01 99 for frames 1 99 specific to that frame Notes 1 E01 designates RS 6000 SP physical frame 1 2 LOO designates any all RS 6000 SP logical frames 3 S00 designates any all RS 6000 SP multi switch frames 4 For locations inside a frame the Frame WWW and or Major Assembly XXX strings may be omitted making the format YY ZZZ maor assembly XXX 1st character is the major assembly type all three characters if the assembly occurs only once in a frame N for processor node assembly S for switch assembly PDU for power distribution unit assembly ADC for ac dc Converter assembly FRA for frame e 2nd and 3rd characters are the major assembly number 00 for any all major assemblies designates location inside any all major assemblies 01 99 for major assembly 1 99 specific to that major assembly Sub assembly YY 1st and 2nd characters are the assembly designation inside the major assembly This string may be omitted in some cases Refer to the lists of two character designations associated with each major assembly throughout this chapter Example SC denotes a switch card Connection location ZZZZ e 1st character is the connect
18. Chassis Cable P4 Power Card 2x1 x P6 Power Card 2x2 Front Chassis Cable P7 P8 CB P9 LED Step 0600 012 You performed the recommended action in Hable 1 4 Component replaced or reseated 1 2 3 4 5 Reconnect all cables inside the SP Switch Install front panel assembly and reinstall front retaining screws see P4 Fan 3 P5 Fan4 P6 Fan 5 2x2 2x2 2x2 Connect only power J1 and supervisor J2 cables at the rear ofthe SP Switch Put the SP Switch s circuit breaker into the On 1 1 8 _RS 6000 SP SP Switch Service Guide position SP Switch environment MAP 0600 6 Check the Environmental yellow LED for an ON or FLASHING condition 7 Is the Environmental yellow LED ON or FLASHING If the Environmental LED is on or flashing a Put the SP Switch s circuit breaker into the Off 0 position j and continue service with the next highest priority e If the Environmental LED is not on or flashing a You have resolved the problem Olto verify fix Step 0600 013 Perspectives indicated a shutdown temperature condition and Hable 1 2 on page 1 4 directed you to this step Temperature is out of specified range however no serious electrical current or fan speed problems have been detected 1 Check for airflow blockage at air intakes and exhaust of the SP Switch and system frame Also check air temperatu
19. If necessary follow the first 2 steps under Fencing Nodes above Ensure that the node is selected Click on On and then click on Do Command This will power on the node and run power on diagnostics Click on Unfence and then click on Do Command to place the node into operation in the partition 3 8 RS 6000 SP SP Switch Service Guide r Efence of primary and primary backup nodes By design Efence of primary and primary backup nodes is not allowed If you attempt to fence either of these nodes you will get the following responses Efence 0028 147 Node number designates the Primary Node Efence 0028 166 Node number designates the Primary Backup Node You should assign a new primary or primary backup node and initiate a restart to be able to remove these nodes from the network for service Service position procedures Note When preparing to place processor node s and or switch assembly s into service position ensure that the customer has removed the processor node s and or switch assembly s from the active configuration Placing a switch assembly into service position 1 Set circuit breaker in the Off 0 position 2 Remove power J1 and supervisor J2 cables from rear of switch assembly 3 Remove the four screws from the front of the switch assembly 4 Remove the SPS front panel assembly from the inner chassis by pulling at the side of the front panel assembly 5 Ifthe inner
20. Model Frames 1 2 3 30x 1 4 L01 S00 i LO1 S00 1 LO1 S00 2 LO2 S00 1 LO2 S00 i LO2 S00 2 L03 S00 1 L03 S00 2 LO3 S00 i LO4 S00 1 L04 S00 2 LO4 S00 3 30x 5 LO1 S00 i LO1 S00 1 LO1 S00 2 LO2 S00 1 LO2 S00 i LO2 S00 3 L03 S00 1 LO3 S00 3 L03 S00 i L04 S00 1 L04 S00 2 L04 S00 3 L05 S00 2 L05 S00 2 L05 S00 3 40x 4 8 S01 S02 i S01 S06 i S01 S10 i All others 1 All others 2 All others 3 Note On an 8 port switch the clock always equals 0 See samples in etc SP Removing and restoring switch resources This procedure can be performed to allow customer to use a switch feature while extended service actions are performed on an individual frame of a multi frame system with the switch feature DO NOT PERFORM this procedure unless the required service operation will take the switch out of the switch configuration for a minimum of two hours for example a part must be ordered or a switch assembly or frame must be repeatedly powered on off and or the customer specifically requests it Care should be taken to understand the consequences on any partitions that might be sharing switch resources See Attention This procedure is intended to allow the customer to use the switch feature during extended repair action The customer must stop all parallel jobs prior to starting the repair Once the repair is complete the customer must stop all parallel jobs again to reconfigure the switch to in
21. acceptable range e If yes go to d to verify fix If no go to FStep 0600 0081 Step 0600 008 You replaced the inner chassis cable and the front chassis cable but the measured resistance is still outside of the acceptable range 1 This indicates that there is still a problem 2 Replace the switch supervisor card Q to verify fix Step 0600 009 You have voltage shutdown condition the supervisor control cable appears to be okay and the resistance value you measured in n liable aan page 12A was within an acceptable range Based on these symptoms directed you to this location 1 Reconnect cable S00 PC P4 at switch power supply 1 2 Repeat resistance measurement from 3 Is the measured resistance within the acceptable range e If yes replace switch power supply 1 and the switch supervisor card Olto verify fix e If no go to EStep 0600 0101 Step 0600 010 With cable SOO PC P4 reconnected at the switch power supply 1 the measured resistance values were still outside the acceptable range 1 Reconnect cable SO0 PC P6 at switch power supply 2 2 Repeat resistance measurement from 3 Is the measured resistance within the acceptable range e If yes replace switch power supply 2 Q Olto verify fix e If no the problem is in the switch supervisor card a Go to Step 0600 0117 Priority 3 Step 0600 011 You arrived at this step because you received a fanfail message or you f
22. customer remove the SP Switch from the active configuration and power off the SP Switch b Go to EStep 0610 01071 e If no a Fix any cable connection problems b Return to Step 0610 010 You have an SP type switch that is either tripping the circuit breaker to the Off position or the circuit breaker is on but the Power green LED is not lit 1 Turn off the circuit breaker 2 Unplug cables connected to J1 and J2 at rear of the SP Switch 3 Remove the front panel assembly from the inner chassis 4 Unplug inner chassis cable from the switch power cards SO0 PC P4 S00 PC P6 and the switch supervisor card SOO SP P5 5 Using a digital pennies check for inner chassis cable continuity from the switch tailgate and the connectors in Table 1 5 Inner chassis cable continuity From To S00 BH_J1 pin 5 amp 9 P6 pin 2 S00 BH J1 pin 5 amp 9 P4 pin 2 S00 BH J1 pin 1 amp 6 P3 pin 16 P3 pin 17 p6 pin 1 P3 pin 17 p4 pin 1 6 On the fan assembly with circuit breaker in ul On 1 position check for front chassis cable continuity between the connectors in Table 1 6 Front chassis cable continuity From To P1 pin 17 P1 pin 16 7 Is there continuity e If you have continuity on all cables go to e If you do not have continuity on an inner chassis cable go to e If you do not have continuity on an front chassis cable go to FStep 0410 0121 Step 0610 011 An inner chassis ca
23. icon is displayed next to the frame icon only if a switch is installed in the frame Click the Notebook icon on the tool bar e When the Notebook window opens make certain that the Switch Status tab is selected The Node failure attribute displays the status of the switch supervisor e No displayed in a green box indicates that the switch supervisor has not failed and the supervisor is responding to communication from the frame supervisor oap Chapter 3 Service procedures 3 5 e Yes displayed in a red box indicates that the switch supervisor has failed and it is not responding to the frame supervisor Note Clicking Help in the Notebook window s lower right corner displays attribute descriptions Selecting appropriate switch clocks The following procedure describes how to select a clock source for a switch assembly followed by an explanation on how to determine which clock source to use on each assembly Attention Changing switch clocks will interrupt use of the switch feature therefore switch clocks should be reselected only due to a component failure or scheduled service of part of the machine Even momentary interruption of the switch clocks which occurs when reselecting switch clocks might require recovery step s to bring all the processor nodes on line In this case use of all processor nodes in the frame or frame s might be interrupted during the recovery procedure Selecting the switch clock sourc
24. initialize the switch using the Estart command Cables connected to the switch assembly and processor nodes which were restored in this procedure should appear in the out top file without any error messages If switch re initialization was successful the customer can start running parallel jobs again Return to MAPS to continue service action s Viewing switch partitions Open the System Monitor window and click on the View option to display any partitions With switch partitioning the nodes are grouped under a partition name Partitioning allows the SP to have different software levels installed and to isolate groups of nodes from each other The Estart command fault service and applications see only a single partition rending nodes 3 4 Click on View to select one of the partitions then click on SP to display an options menu Click on Global Controls to display the frame s and node s in this partition Click on the node you want to fence then click on the Shutdown option Now click on the Do Command option to display options Click on Fence and then click on Shutdown to shutdown the node Note Selecting Enable Autojoin during this sequence will automatically place the node back into the operations of the partition on successful power on of the Node It is suggested that Autojoin not be selected at any time when performing a service action u neing nodes
25. makes them obsolete Copyright IBM Corp 1999 2002 xvii xviii RS 6000 SP SP Switch Service Guide Chapter 1 Maintenance Analysis Procedures MAPS This chapter provides information for identifying problems and guides you to the most likely failed Field Replaceable Unit FRU The MAPS then refer you to the FRU Removal Replacement procedures for the corrective action SP Switch MAPs SP Switch description and problem determination MAP 0590 Purpose of this MAP This MAP describes the physical characteristics of each switch type and provides a table Table 1 1 on containing diagnostic information Each switch network has the following components e Switch adapter cards one per processor node e SP Switch e Switch internal data cables e Switch power cable e Switch wrap plugs male 77G0818 and female 46H9688 for SPS Refer to Eigure 1 2 on page 1 4 for views of the wrap plug e Switch external data cables multi frame only Attention 1 Switch data plug jack connector pins are easily bent Check for bent pins on male plugs or bent pin guides on female jacks if a cable is difficult to plug Problems with bent pins or pin guides can propagate to new plugs jacks if not corrected first 2 All connected SP Switches must be running from the same master clock If the SP Switches have not been set properly all processor nodes in a logical frame will be uninitialized Refer to G for these procedures Refer to Figu
26. no a The processor node has a power problem b Go to Processor node diagnostics and descriptions MAP 0130 in IRS 6000 SP Syster to fix problem Step 0620 011 The Power green LED is lit indicating the processor node is powered on 1 Run advanced diagnostics in service mode on device css0 on this processor node and its associated switch port Notes a Advanced cable wrap tests will not run successfully for 15 and 20 meter SPS data cables If 15 or 20 meter data cables are used swap or change the suspected cable b Advanced css0 diagnostics will require a male and a female wrap plug to perform the card wrap and cable wrap tests When testing the SPS adapter cable part number 46H9699 will also be required to perform card diagnostics This cable is part of the SPS feature bill of materials c If unable to run diagnostics reseat or replace the card 2 Do you get a Service Request Number SRN from cssO diagnostics Chapter 1 Maintenance Analysis Procedures MAPs 1 23 SP Switch function MAP 0620 Table 1 9 Service Request Number SRN table for SP Switch adapters If yes a An SRN was obtained from diagnostics b Use the following SRN table Table 1 9 to continue service c After using able 1 91 go to If no a The diagnostics did not detect a problem therefore the problem is in SP Switch b Go to EStep 0620 041 on page 1 35
27. screws at the rear of the switch assembly 5 Remove four screws from front of switch assembly and slide switch assembly out of the front of the frame 6 Remove the front chassis and then remove the inner chassis from the outer chassis sleeve 7 Return to the procedure that directed you here Replacing the switch assembly Perform these procedures to replace the switch assembly 1 Replace the inner chassis into the outer chassis sleeve and then replace the front chassis 2 If wrap plugs are installed they can be plugged on the new switch 3 Slide switch through the front of the frame and reinstall four screws that hold the switch to the frame 4 If this is a switch drawer in F C 2030 1 and you are reinstalling the switch assembly reinstall the two screws at the rear of the switch assembly 5 Reattach all cables to rear of switch assembly 6 Turn the power switch on the switch assembly to On 1 7 Refer to 8 Return to the Edle that directed you neta 4 2 RS 6000 SP SP Switch Service Guide Removing the switch fans Note Refer to devices 9 to place the switch assembly into the service position Remove the top cover of the switch assembly by removing the screws Locate the fan Push shock mounts toward center of fan Pull fan out of the fan bracket Keep the shock mounts for mounting the new fan Disconnect fan cable from supervisor control cable DPN Replacing the switch fans Perform t
28. switch connection have customer update the switch configuration file appropriately to show wrap plugs before continuing e For code level 1 02 and higher use the Etopology command 3 Make sure processor nodes that were put in Service mode are returned to Normal mode Make sure all processor nodes are IPLed completely Have customer verify that the switch code is running on all processor nodes From the control workstation of the primary processor node type Estart followed by ENTER Check var adm SPlogs css out top file for problems regarding the jacks indicated Does out top file show a problem on any jacks indicated e If yes a Problem is in the SP Switch which is still reporting a problem b Go to Step 0620 041 on page 1 35 to service or replace the switch assembly e If no a If this is a switch to switch connection remove both wrap plugs from the SP Switches b Replace the frame to frame switch data cable that was disconnected before j to verify fix NO oS Step 0620 024 Problem reported with all switch data cables from one or more frames This may be caused by a clocking or grounding problem between frames Note The master clock switch indicates the switch which is supplying the master clock for the system the master clock will have the clock input multiplexor set to internal clock 0 A problem switch indicates any switch which is experiencing problems 1 2 Do th
29. the configuration file var adm SPlogs css out top 2 If the processor node or dependent node can be reconnected remove the wrap plug and connect the data cable between the switch assembly and the processor node or dependent node 3 If the processor node or dependent node is to be permanently removed have the customer update the switch topology 4 Link status Operational Description Link status is operational ps Z for next problem or go to to verify fix 0 Uninitialized Description Switch adapter has not been initialized Processor SPS node may not recognize adapter due to hardware failure or bad software configuration Action 1 If there are any other errors in the var adm SPlogs css out top file address those errors first Note If the problem is on all nodes in the frame check for any hardware problems with the SP Switch Also check for switch to switch problems on all jacks of the SP Switch 2 If all the nodes are uninitialized except for the primary node verify that the primary node is connected to the correct port 3 Have customer verify that the correct switch software is installed and running on this node rc switch or css_restart_node starts the fault_service_Worm_RTG_SP daemon and the fs_monitor daemon 4 Log into processor node and enter Iscfg grep css 5 If you get no result try swapping in another switch adapter Repeat step 3 If you get a result the problem is next problem or go to verify fix
30. the diskette Record the package name results for example escon cuu This will be useful if you decide to store other adapter microcode in the same directory e bffcreate t tmp microcode d dev fd0 all This will copy the data to the designated directory and update a table of contents file toc 3 NFS Export that directory to the nodes exportfs i tmp microcode 4 Either use the dsh command to control one or more nodes directly from the control workstation or telnet to each individual node Commands in following steps would be executes as in the example but without the dsh prefix Note Refer to BM RS 6000 SP Administration Guide for help on using dsh 5 dsh a umount mnt 6 dsh a mount lt control wks gt tmp microcode mnt Ti dsh a installp qacXd mnt all The all can be replaced by the individual microcode package as recorded earlier 8 dsh a umount mnt 9 exportfs u tmp microcode Chapter 3 Service procedures 3 11 To complete the microcode update it is usually necessary to remove and then replace the device from the configuration The most reliable method to do this is to reboot the node Some adapters can actually require a power off cycle to complete the microcode update Others can be updated simply by running cfgmgr Note During microcode download for SSA adapters there is a possibility that the download process could result in an error When an unrecoverable error loss of po
31. the problem Olto verify fix Step 0600 016 You found an obvious airflow blockage or abnormally high temperature source near air intakes 1 Have customer remove the switch from the active configuration and power off the SP Switch Chapter 1 Maintenance Analysis Procedures MAPs 1 9 SP Switch environment MAP 0600 2 Remove blockage 3 Ifrequired put the switch into service position refer to i pag 4 With all cables replugged and Environmental yellow LED OFF power on the SP Switch 5 Go to ESten 0600 017 to verify fix Step 0600 017 You have repaired or replaced a component 1 If necessary reinstall SP Switch cover 2 If required take the switch out of the service position refer to 3 If necessary reconnect all cables at the rear of the SP Switch 4 Put the SP Switch s circuit breaker into the On 1 position Step 0600 018 Perspectives gave you a Shutdown or Failure message and Table 1 2 on page 1 4 directed you to this step 1 One or more of the following conditions exist P33Low P33Hignh Pvolt33Range e PS1Fail e PS2Fail PSFuseGood Have the customer remove the SP Switch from the active configuration and power off the SP Switch Switch off the circuit breaker Unplug cables connected to J1 and J2 at rear of the SP Switch Remove the front panel assembly from the inner chassis Put the s
32. 0 0 on page 8 f Replace the switch clock cable or switch data cable that was just reseated in step 2 above Continue service at 3 Data cable Reseat switch data cable at switch adapter of this processor node 3 of 7 b Run advanced diagnostics on the device css0 and its associated switch port c If diagnostics fail with the previous SRN replace switch data cable d Continue service at 4 SPS Adapter a Replace the switch adapter card on this processor SP System Attachment node 4 of 7 Adapter b Reconnect all cables to the processor node c Continue service at Note Before disconnecting cables or performing service actions on an SP Attached Server perform Decouplin and coupling code for SP Attached Servers in Baao e e ge 5 System or I O planar a Run advanced diagnostics on the Base System or I O planar If a problem is detected use SRN to 5 of 7 service b Replace the system or I O planar on this processor node Make sure to reinstall all parts and cables c Reconnect all cables to the processor node d Continue service at 6 Wrap plug or Check the wrap plug or terminator again to make sure Terminator that it is not at fault 6 of 7 7 All components replaced Call next level of support 7 of 7 Chapter 1 Maintenance Analysis Procedures MAPs 1 27 SP Switch function MAP 0620 Step 0620 013 An SRN listed in indicated that you have a switch clock problem and Priority 2 in directed you to thi
33. 0655 11P0656 1624766 26H7389 46H9305 77G0961 Description SPS Switch assembly reference only SPS 8 Switch assembly reference only Inner chassis assembly SPS includes all assembly in shaded area Inner chassis assembly SPS 8 includes all assembly in shaded area Screw 6 mm Cable inner chassis Card switch supervisor Card power Chapter 5 Parts catalog 5 5 SPS SPS 8 Switch assembly feature view 3 TE E 5 6 RS 6000 SP SP Switch Service Guide Table 5 3 SPS SPS 8 Switch assembly feature view 3 Assembly Part number index e 26H7391 32G1547 46H9778 77G1079 46H9778 46H9779 51H9428 81F7977 26H7395 26H7255 31G9309 67G4985 oO NO I fF WWW WwW WwW PP Units Description SPS switch assembly reference only SPS 8 switch assembly reference only Cable front chassis Screw hex M4 x 5 Cup guide 2 01 m order kit Pin guide 2 01 m order kit Cup guide new style Pin guide new style Kit cup and pin guide new style Mount fan shock Fan assembly Chassis weld front Circuit breaker Card LED Chapter 5 Parts catalog 5 7 Switch cables feature Table 5 4 Switch cables feature Assembly Part number index 11J6091 11J6092 11J6093 11J6094 11J6095 11J6096 11J6097 11J6098 11J6099 11J6100 11J6101 11J6102 11J6103 11J6104 11J6105 11J6106 11P0006 1143975 11J3976 1143977 1143978 1143979 1143980 1
34. 15 16 13 14 11 12 9 10 D Thin Processor Node 8 Thin Processor Node 7 _ slot 8 slot 7 5 Wide Processor Node 5 _ slots 5 and 6 1 High Processor Node 1 _ slots 1 4 i j 0 Switch assembly Main power switch with LED i Fl ils slot 17 Left skirt Sj Sj Right skirt 48 V power module A B C D locations Front cover skirt and air filter removed Figure 2 1 Front view of frame locations See notes below Figure notes 1 Wide processor nodes take up an entire shelf position two thin processor node slots They are identified by the odd numbered position 2 In a F C 2030 1 frame switch assemblies take up an entire shelf partition They are identified by the even numbered position 3 Processor node slots are numbered up to N16 4 A High node or SMP High node takes up 2 shelf positions slots It is identified by the least odd number position of the occupied slots 5 Frames equipped with the SP Redundant Power Supply must have four power modules books installed in the SEPBU Chapter 2 Locations 2 3 Figure 2 3 shows a front view of the RS 6000 SP multi switch frame SP Switch Frame Locations Front View
35. 1J3981 1143982 11P0006 46G5686 46H9698 46H9699 46H9700 46H9701 Units AR AR AR AR AR AR AR AR AR AR AR AR AR AR AR AR AR AR AR AR AR AR AR AR AR AR AR AR AR AR AR Description SP Switch data cables SPS Cable Switch data node 01 1345 mm Cable Switch data node 02 1040 mm Cable Switch data node 03 1320 mm Cable Switch data node 04 1400 mm Cable Switch data node 05 1675 mm Cable Switch data node 06 1370 mm Cable Switch data node 07 1700 mm Cable Switch data node 08 1780 mm Cable Switch data node 09 1780 mm Cable Switch data node 10 1855 mm Cable Switch data node 11 2110 mm Cable Switch data node 12 1805 mm Cable Switch data node 13 2110 mm Cable Switch data node 14 2165 mm Cable Switch data node 15 2165 mm Cable Switch data node 16 2310 mm Cable Switch data node xx 2615 mm SP Switch Data Cables SPS 8 Cable Switch data node 01 1030 mm Cable Switch data node 02 1640 mm Cable Switch data node 03 1800 mm Cable Switch data node 04 2200 mm Cable Switch data node 05 2210 mm Cable Switch data node 06 2340 mm Cable Switch data node 07 2060 mm Cable Switch data node 08 2260 mm Cable Switch data node xx 2615 mm SP Switch External Cables SPS SPS 8 Cable Switch power Cable S S external 5 m Cable S S external 10 m Cable S S ex
36. 1J6100 1196101 11J6102 11J6103 11J6104 11J6105 11J6106 11P0006 11P0655 11P0656 1624766 1624776 17H5026 2113932 26H7198 26H7212 26H7255 26H7389 26H7391 26H7395 31G9309 32G1547 46G5686 46H9220 46H9305 46H9698 46H9699 46H9700 46H9701 46H9778 46H9779 49 inch frame locations 2 4 Copyright IBM Corp 1999 2002 5 8 5 8 5 11 5 11 5 11 5 11 5 8 5 8 5 8 5 8 5 8 5 8 5 8 5 8 5 8 5 8 5 8 5 8 5 8 5 8 5 8 5 8 5 8 5 5 5 5 5 5 5 11 5 11 5 11 5 11 5 3 5 3 5 7 5 5 5 7 5 3 5 7 5 7 5 7 5 3 5 8 5 3 5 3 5 5 5 8 5 8 5 8 5 8 5 7 5 7 A active configuration removing a switch assembly from the 3 7 active configuration restoring a switch assembly to 3 8 adapter microcode packages installing 3 10 adapters SP switch Service Request Number SRN table 1 24 assembly naming standard 2 2 audience of this book xv authentication kerberos 3 1 C cables switch 5 8 clock source determining the correct switch 3 6 clock source selecting the switch 3 6 component connector details 2 9 components failing SRNs Service Request Numbers 1 24 connector location naming standard 2 2 css snap script 3 3 D determining the correct switch clock source 3 6 E ESD procedures 3 1 requirements 3 1 external cable routing 2 11 F failing components SRNs Service Request Numbers 1 24 feature switch cables 5 8 female wrap plug 1 4 fencing nodes 3 8 files overview serv
37. Any reference to an IBM product program or service is not intended to state or imply that only that IBM product program or service may be used Any functionally equivalent product program or service that does not infringe any IBM intellectual property right may be used instead However it is the user s responsibility to evaluate and verify the operation of any non IBM product program or service IBM may have patents or pending patent applications covering subject matter described in this document The furnishing of this document does not give you any license to these patents You can send license inquiries in writing to IBM Director of Licensing IBM Corporation North Castle Drive Armonk NY 10504 1785 U S A The following paragraph does not apply to the United Kingdom or any other country where such provisions are inconsistent with local law INTERNATIONAL BUSINESS MACHINES CORPORATION PROVIDES THIS PUBLICATION AS IS WITHOUT WARRANTY OF ANY KIND EITHER EXPRESS OR IMPLIED INCLUDING BUT NOT LIMITED TO THE IMPLIED WARRANTIES OF NON INFRINGEMENT MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE Some states do not allow disclaimer of express or implied warranties in certain transactions therefore this statement may not apply to you This information could include technical inaccuracies or typographical errors Changes are periodically made to the information herein these changes will be incorporated in new editions of the
38. Class A product In a domestic environment this product may cause radio interference in which case the user will be required to take adequate measures Radio protection for Germany Dieses Ger t ist berechtigt in bereinstimmung mit Dem deutschen EMVG vom 9 Nov 92 das EG Konformit tszeichen zu f hren Der Aussteller der Konformit tserkl rung ist die IBM Germany Dieses Ger t erf llt die Bedingungen der EN 55022 Klasse A F r diese von Ger ten gilt folgende Bestimmung nach dem EMVG Ger te d rfen an Orten f r die sie nicht ausreichend entst rt sind nur mit besonderer Genehmigung des Bundesministers f r Post und Telekommunikation oder des Bundesamtes f r Post und Telekommunikation betrieben werden Die Genehmigung wird erteilt wenn keine elektromagnetischen St rungen zu erwarten sind Auszug aus dem EMVG vom 9 Nov 92 Para 3 Abs 4 Hinweis Dieses Genehmigungsverfahren ist von der Deutschen Bundespost noch nicht ver ffentlicht worden Notices A 3 A 4 RS 6000 SP SP Switch Service Guide Index Numerics 0375867 5 11 04H9469 5 11 08J5557 5 11 11J3975 5 8 11J3976 5 8 11J3977 5 8 11J3978 5 8 11J3979 5 8 11J3980 5 8 51H9428 54G2877 54G2947 54G3281 67G4985 77G0818 77G0961 77G1079 81F7977 84X4841 93G1065 5 7 5 11 5 11 5 11 5 7 5 3 5 5 5 7 5 3 5 11 11J3981 11J3982 11J5189 11J5191 11J5193 11J5195 11J6091 11J6092 11J6093 11J6094 11J6095 11J6096 11J6097 11J6098 11J6099 1
39. EStep 0600 021 Step 0600 021 With connectors SOO PC P4 SOO PC P6 disconnected the measured resistance between pins 12A and 12B is outside the specified range 1 Remove power supply card PS1 refer to 2 Repeat the resistance measurement from 3 Is the measured resistance in the acceptable range e If yes e If no go to Step 0600 0221 Step 0600 022 Connectors S00 PC P4 and S00 PC P6 are disconnected and you removed power supply card PS1 but the measured resistance between pins 12A and 12B is still outside the specified range 1 If this is the first time through this step a Remove power supply card PS2 Q and repeat the resistance measurement 2 If you have already been through this step a Replace the switch assembly Go to Step 0600 023 You arrived at this location from i a PSxFail or PSxFuseGood problem 1 If you have a PSFuseGood problem go to 2 Ifyou have a PS1Fail problem go to 3 If you have a PS2Fail problem go to Step 0600 0241 Step 0600 024 You have a PS2Fail problem 1 Replace power supply card PS2 2 Reinstall the SPS front panel assembly being careful to align the guide pins on the P1 to the inner chassis cup guide 3 Replug the power cable J1 and supervisor cable J2 to the rear of the assembly 4 Put the circuit breaker in the On 1 position 5 Does the PS2Fail condition still exist e If yes go to e If no go to Chapter 1 Maintenance Analysis Procedu
40. FCC emission limits IBM is not responsible for any radio or television interference caused by using other than recommended cables and connectors or by unauthorized changes or modifications to this equipment Unauthorized changes or modifications could void the user s authority to operate the equipment This device complies with Part 15 of the FCC Rules Operation is subject to the following two conditions 1 this device may not cause harmful interference and 2 this device must accept any interference received including interference that may cause undesired operation European Union EU statement This product is in conformity with the protection requirements of EU Council Directive 89 336 EEC on the approximation of the laws of the Member States relating to electromagnetic compatibility The manufacturer cannot accept responsibility for any failure to satisfy the protection requirements resulting from a non recommended modification of the product including the fitting of option cards supplied by third parties Consult with your dealer or sales representative for details on your specific hardware This product has been tested and found to comply with the limits for Class A Information Technology Equipment according to CISPR 22 European Standard EN 55022 The limits for Class A equipment were derived for commercial and industrial environments to provide reasonable protection against interference with licensed communication equipment Atten
41. H J10 E00 N06 BH PA 46H9716 E00 S00 BH J24 E00 N07 BH PA 46H9717 E00 S00 BH J23 E00 N08 BH PA 46H9718 E00 S00 BH J31 E00 N09 BH PA 46H9719 E00 S00 BH J32 E00 N10 BH PA 46H9720 E00 S00 BH J18 E00 N11 BH PA 46H9721 E00 S00 BH J17 E00 N12 BH PA 46H9722 E00 S00 BH J33 E00 N13 BH PA 46H9723 E00 S00 BH J34 E00 N14 BH PA 46H9724 E00 S00 BH J16 E00 N15 BH PA 46H9725 E00 S00 BH J15 E00 N16 BH PA Note PA refers to connector on SPS adapter Note For external frame to frame cable locations see SPS B data cables describes the attachment locations and routing for the internal SPS 8 Switch data cables Table 2 3 SPS 8 Switch data cable chart Cable Part Plug from Plug to Number Location Location 46G9690 E00 S00 BH J7 E00 NO1 BH PA 46G9691 E00 S00 BH J8 E00 NO2 BH PA 46G9692 E00 S00 BH J26 E00 N03 BH PA 46G9693 E00 S00 BH J25 E00 N04 BH PA 46G9694 E00 S00 BH J9 E00 NO5 BH PA 46G9695 E00 S00 BH J10 E00 NO6 BH PA 46G9696 E00 S00 BH J24 E00 NO7 BH PA 46G9697 E00 S00 BH J23 E00 N08 BH PA Note PA refers to connector on SPS adapter Chapter 2 Locations 2 13 2 14 RS 6000 SP SP Switch Service Guide Chapter 3 Service procedures Personal ESD requirements Tools and files overview Using the css snap script Switch supervisor self test Verification tests using Perspectives Node supervisor verification Frame supervisor verification Switch supervisor verification Selecting appropr
42. K a koa oOo ro e w o 29 Cable routing ee 2 2D Cable routing in a multi switch frame FIC 2030 1 peor ae ae He a es Shoe Ae ae eee Switch data cables 22 a a a a a a a a a a 218 SPS data cables 2 a 4 s u a u u We doa oa Woe oa ao a a wa oa Ae aoa e el SPS 8 data cables 0a a a a a a aa a a a 218 Naming standard for RS 6000 SP components The purpose of this section is to define a naming standard for all components in the RS 6000 SP system This standard provides a consistent logical naming convention system necessary for documentation including details assembly drawings schematics manufacturing documents service documents and customer publications Format structure The RS 6000 SP system is structured in a modular fashion with different levels of assembly which can be independently described These levels are 1 System level 2 Frame level 3 Major assembly level e g processor node 4 Sub Assembly level e g cards fan assembly The format structure is used to individually identify any connection location at any level in the assembly The main use of this format is to describe connector cabling and schematic locations shown in tables and diagrams throughout this manual Example of format structure Format FRAME WWW MAJOR ASSEMBLY XXX SUBASSEMBLY YY CONNECTOR NUMBER ZZZZ ar WWW 1st character is the frame type E for RS 6000 SP frame L for logical RS 6000 SP frame used for models 30X and
43. ON The unit weight exceeds 18 Kg 40 Ibs and requires two service personnel to lift SPSFCOO2 Attention Components in the frame are susceptible to damage from static discharge Always use an ESD wristband when working inside frame covers See more details Do not touch the pins or circuitry on these components Note There are different levels of the switch assembly hardware The following lists indicate serviceable FRUs for each level of switch assembly SPS Fans circuit breaker LED card switch supervisor card switch power card s fan assembly cable supervisor power cable complete assembly SPS 8 Fans circuit breaker LED card switch supervisor card switch power card s fan assembly cable supervisor power cable complete assembly Removing the switch assembly Perform these procedures to remove the switch assembly 1 Ensure that the switch is offline shutdown from the control workstation and powered off from the control workstation Attention Removing power from one switch assembly may affect other switch assemblies and processor nodes attached to it 2 Turn the power switch on the switch assembly to Off 0 3 Detach all cables from rear of switch assembly noting where external frame cables are attached If wrap plugs are installed remove them and save for the new switch 4 If this is a switch drawer in multi switch frame F C 2030 1 and you are replacing the switch assembly remove the two
44. PSFCO18 Laser safety information The RS 6000 SP might contain certain communication adaptors such as ESCON or FDDI which are fiber optic based and use lasers Laser Compliance All lasers are certified in the U S to conform to the requirements of DHHS 21 CFR Subchapter J for class 1 laser products Outside the U S they are certified to be in compliance with the IEC 825 first edition 1984 as aclass 1 laser product Consult the label on each part for laser certification numbers and approval information Environmental notices Product recycling and disposal This product contains materials such as circuit boards cables electromagnetic compatibility gaskets and connectors which might contain lead and copper beryllium alloys that require special handling and disposal at end of life Before this unit is disposed of these materials must be removed and recycled or discarded according to applicable regulations IBM offers product return programs in several countries You can find country specific instructions at www ibm com ibm environment products prp phtml Safety and environmental notices Xili This product might contain nickel cadmium or lithium batteries in communication adapters The batteries must be recycled or disposed of properly Recycling facilities might not be available in your area In the United States IBM has established a collection process for reuse recycling or proper disposal of used sealed lead acid nickel cadmium
45. RA G1 Chapter 2 Locations 2 7 Switch assembly locations Ji J2 Supervisor Bus Card wer u Switch Planar oO gt a AIF Barrle Power Supervisor Power Supply 2 Card Supply 1 P5 P6 P4 Fan 5 Fan 4 Fan 3 Fan 2 Fan 1 AAN CB amp LED Card Figure 2 5 SPS SPS 8 assembly 2 8 RS 6000 SP SP Switch Service Guide Connector details Figure 2 d shows RS 6000 SP component connector details 2x1 power 3 j 2 1 oo 3x3 power 3x1 power 3x4 power 4 j 3 2 1 3x5 power 4x1 power A1 A2 A3 4 8x1 power 3 pin power Key 123 23 24 25 00000 Y zs 0000 9 pin D shell 2x25 Berg Key 123 4 111213 VIIBODOODODOD Wa amp WB ODODOOOODOOBS 25 pin D shell 2x13 Berg 123 23 24 25 4321 4x1 DASD power 50 pin SCSI Figure 2 6 RS 6000 SP connector deta
46. RS 6000 SP E SS SFE SP Switch Service Guide GA22 7443 04 RS 6000 SP E SS SFE SP Switch Service Guide GA22 7443 04 Note Before using this information and the product it supports read the information in bage bl and MaCs on nage Aci Fifth Edition April 2002 This book replaces GA22 7443 03 IBM welcomes your comments A form for readers comments may be provided at the back of this publication or you may address your comments to the following address International Business Machines Corporation Department 55JA Mail Station P384 2455 South Road Poughkeepsie NY 12601 5400 United States of America FAX United States amp Canada 1 845 432 9405 FAX Other Countries Your International Access Code 1 845 432 9405 IBMLink United States customers only IBMUSM10 MHVRCFS Internet e mail mhvrcfs us ibm com If you would like a reply be sure to include your name address telephone number or FAX number Make sure to include the following in your comment or note e Title and order number of this book e Page number or topic related to your comment When you send information to IBM you grant IBM a nonexclusive right to use or distribute the information in any way it believes appropriate without incurring any obligation to you Copyright International Business Machines Corporation 1999 2002 All rights reserved US Government Users Restricted Rights Use duplication or disclosure restricte
47. Shutdown Voltage Log Shutdown Fan e Log Shutdown Temperature Errpt Warning e Log Warning Environment Yellow LED on or flashing 2 of 3 Power problem e Power Green LED off or flashing e Switch has no power All voltages at zero 3 of 3 Switch function problem Error s in var adm SPlogs css out top see note e Problem s with switch_responds e Problem s returned from errpt a N Worm e Problem s detected by CE customer Notes 1 out top is located in the primary node To find the primary node issue Eprimary 2 Verify that the file has a valid recent time stamp using Is 1 P N 46H9688 D amp P N 77G0818 Female wrap plug Figure 1 2 SP Switch wrap plugs Notes 1 SPS advanced diagnostics use the 10 meter data cable provided by the SPS feature bill of material 2 Advanced cable wrap tests will not run successfully for 15 and 20 meter SPS data cables If 15 or Male wrap plug 20 meter data cables are used swap change the suspected cable 1 4 RS 6000 SP SP Switch Service Guide SP Switch environment MAP 0600 SP Switch environment MAP 0600 Purpose of this MAP This MAP provides diagnostic information for switch problems that are related to the operating environment Note Refer to on page for removing the switch from the
48. agnostics in a processor node in 7 Look for an SRN indicating a clock problem such as External clock being listed as one of the otiia components Note Do not exit from diagnostics at this time as it may be used in the following steps 8 Do the css0 diagnostics fail with an indication of a clock problem e If yes the clock problem is resolved Go to to verify fix e If no go to Step 0620 041 on page 1 35 to replace the problem switch assembly Step 0620 036 The css0 diagnostics failed at Step 0620 033 on page 1 33 and indicated a clock problem 1 Record current clock setting for the switch internal external 2 From the control workstation select the internal clock for this switch assembly 3 Run advanced diagnostics on css0 on the test processor node and its associated switch port e Do not perform cable wrap test e Refer to Running diagnostics in a processor node in 4 Look for an SRN indicating a clock problem such as External clock being listed as one of the Sailing componenis 5 Do the css0 diagnostics fail with indication of a clock problem e If yes go to ae If no go to Step 0620 037 You have run the cssO diagnostics 1 At the SP Switch check the data cable interposer if present and the switch data cable corresponding to the original external clock selection for any obvious problem such as a loose cable connection or swapped cables
49. an control cable 4 4 Removing the SP Switch supervisor card 4 5 Removing the SP Switch inner chassis cable 4 6 Removing the SP Switch power supply cards Copyright IBM Corp 1999 2002 2 12 4 7 vi _RS 6000 SP SP Switch Service Guide Tables 2 D oo oo oo I N 2 PAO oe aN Switch problem diagnostics Environmental messages for switches Switch connector resistance values Fan failure diagnostics Inner chassis cable continuity Front chassis cable continuity SP Switch error conditions Problem messages from the out top file eb Service Request Number SRN table for SP Switch adapters Switch problem priority listing External cable routing SPS Switch data cable chart SPS 8 Switch data cable chart Service procedure tools Setup output files Tuning output files f Setting switch clock sources ah SPS SPS 8 Switch assembly feature view 1 SPS SPS 8 Switch assembly feature view 2 SPS SPS 8 Switch assembly a view 3 Switch cables feature un an E Multi switch frame F C 2030 1 Copyright IBM Corp 1999 2002 vii viii RS 6000 SP SP Switch Service Guide Safety and environmental notices For general information concerning safety refer to Electrical Safety for IBM Customer Engineers S229 8124 For a copy of the publication contact your IBM account representative or the IBM branch office serving your locality Safety notices The followin
50. and nickel metal hydride batteries and battery packs from IBM equipment For information on proper disposal of batteries in this product please contact IBM at 1 800 426 4333 For information on disposal of batteries outside the United States contact your local waste disposal or recycling facility XiV _RS 6000 SP SP Switch Service Guide About this book This book is part of the RS 6000 SP hardware service library and applies to the RS 6000 SP Switch Use this book to assist you in performing the following tasks e Identify field replaceable unit FRU locations e Isolate RS 6000 SP failures using Maintenance Analysis Procedures MAPS e Perform diagnostic service procedures e Perform removal and replacement procedures e Identify FRUs and their corresponding part numbers If you are attempting to isolate an SP s Siem failure use zie Maintenance esi Procedures MAPs beginning with the Start MAP in complete RS 6000 SP hardware service library see Who should use this book This book is intended for RS 6000 SP product trained service personnel Related information The following books make up the complete RS 6000 SP hardware service library RS 6000 SP Safety Information GA22 7467 Safety notices in English and translated into other national languages which are compiled from all the book in the library elocatiorl GA22 7441 Installation and relocation procedures maintenance agreement and qualification procedures f
51. at the problem is not in this SP Switch 1 Return all cables to their original positions 2 Return to Step 0620 040 The clock settings on the problem switch appear to be okay and the problem switch is mounted in a frame that also contains processor nodes The ground straps appeared to be okay so you set up a test processor node and went through the cssO advanced diagnostics The diagnostics passed on the first run through so you disconnected the switch data cable and ran the diagnostics a second time On the second run the diagnostics failed This indicated that the problem was not in the SP Switch 1 Make certain that you replaced the switch data cable corresponding to the selected clock at both SP Switches 2 Run advanced diagnostics on cssO on the test processor node and its associated switch port e Do not perform cable wrap test e Refer to Running diagnostics in a processor node in 3 Look for an SRN indicating a clock problem such as External clock being listed as one of the failing components 4 Do the css0 diagnostics fail with indication of a clock problem e If yes go to Step 0620 0411 Replace the master clock SP Switch e If no a Interframe clock problem resolved 6 to verify fix Step 0620 041 You have determined that there is a problem in the SP Switch and that you need to replace the switch assembly 1 Put the SP Switch s circuit breaker into the O
52. ble failed the continuity test 1 Replace inner chassis cable 2 Return to Step 0610 012 An front chassis cable failed the continuity test 1 Ensure that the circuit breaker is in the On 1 position 2 Check for continuity between the tabs of the circuit breaker 3 Is there continuity e If yes 4j to verify the replacement cable 1 16 RS 6000 SP SP Switch Service Guide SP Switch power MAP 0610 a Replace the front chassis cable b Return to 4 to verify the replacement cable e If no a Replace the circuit breaker b Return to 4 to verify the replacement breaker Step 0610 013 You have continuity on all cables 1 If this is the first time through this step replace the supervisor card SP Switch function MAP 0620 Purpose of this MAP This MAP provides diagnostic information for problems related to the SP Switch function Notes 1 Refer to Appendix A in cables Logical frame to physical frame iranclation iniormation can also gt found i in thig spendin service position Step 0620 001 A message in Perspectives indicated that you have a switch function problem and Table 1 1 on page 1 4 directed you to this MAP Note If out top does not match the physical switch configuration run Eannotator Refer to Parallel System Support Programs for AIX Installation and Migration Guide GA22 7347 1 To complete the following procedures you will need to have primary node r
53. cable to your tasks OOOOU0 Please tell us how we can improve this book Thank you for your responses May we contact you Yes No Dissatisfied Dissatisfied Very Dissatisfied Very Dissatisfied When you send comments to IBM you grant IBM a nonexclusive right to use or distribute your comments in any way it believes appropriate without incurring any obligation to you Name Address Company or Organization Phone No Readers Comments We d Like to Hear from You GA22 7443 04 Fold and Tape Please do not staple Fold and Tape IBM Corporation Department 55JA Mail Station P384 2455 South Road Poughkeepsie NY 12601 5400 PLACE POSTAGE STAMP HERE Fold and Tape GA22 7443 04 Please do not staple Fold and Tape Cut or Fold Along Line Cut or Fold Along Line GA22 7443 04
54. ce Guide SP Switch function MAP 0620 Step 0620 033 All ground straps connecting the frames make adequate contact at both ends 1 Disconnect clock source data cable at SOO BH J3 J5 J7 or J9 then reconnect to SP Switch This is done to eliminate clocking noise from cable 2 Find a processor node in the problem frame which is usable for service This will be used as the test processor node 3 Run advanced diagnostics on cssO on the test processor node and its associated switch port Do not perform cable wrap test e Refer to Running diagnostics in a processor node in 4 Look for an SRN indicating a clock problem such as External clock being listed as one of the failing components Note Do not exit from diagnostics at this time as it may be used in the following steps 5 Do the css0 diagnostics fail with an indication of a clock problem e If yes go to e If the diagnostics did not fail with an indication of a clock problem a If this the first time at this step go to Step 0620 0341 b Ifthis the second time through this step go to Step 0620 034 The first time you went through Step 0620 033 the diagnostics did not fail with an indication of a clock problem 1 Disconnect the switch data cable corresponding to the selected clock at the SP Switch 2 Run advanced diagnostics on cssO on the test processor node and its associated switch port e Do not perf
55. ces are now available for customer use If switch re initialization was successful the customer can start running parallel jobs again Return to MAPS to continue service action s Restoring a switch assembly to the active configuration OaPPon gt 10 11 Identify the switch assembly which is to be restored to the active configuration Have customer stop all current parallel jobs and suspend all parallel jobs on the job queue Make sure that the circuit breaker at the front of the switch assembly in the Off 0 position Connect all cables to the switch assembly Pay attention to the labels on the cables Put the circuit breaker at the front of the switch assembly in the On 1 position Use switch front panel or the Eclock command to select the appropriate clock input for this switch assembly For SSP code level 1 02 and higher use the Eclock r command If the primary processor node was changed during the isolation procedure the customer can change the primary processor node back to the original selection however this is NOT required The primary node is set by the Eprimary command Refer to BM RS 6000 SP Administration Guide for more information If the master clock was changed during the isolation procedure the customer can select the clocks on all other switches however this is not suggested The previous clock selection for this switch assembly was recorded for possible use See The customer can re
56. chassis or supervisor power cable need to be replaced a Detach all cables from the rear of the switch assembly noting where the external frame cables and wrap plugs are attached b Remove the inner chassis from the outer chassis sleeve by pushing on the inner chassis from the rear of the switch assembly Replacing a switch assembly from service position 1 If the inner chassis or supervisor power cable needed to be replaced a Replace the inner chassis into the outer chassis sleeve b Reattach all cables to the rear of the switch assembly 2 Reinstall the SPS front panel assembly into the inner chassis being careful to align the guide pins of the P1 connector to the inner chassis weld Then apply light even pressure at the sides of the front panel assembly 3 Reinstall the four screws into the front of the switch assembly 4 Reattach power J1 and supervisor J2 cables to the rear of switch assembly 5 Set circuit breaker in the On 1 position Resetting the clock and bootlist after servicing a node When servicing a node the node becomes disconnected from its power source for a period of time Since nodes normally do not have a real battery the NVRAM will loose its memory when disconnected from power for about 10 minutes Sometimes less This will cause the date to be reset to January 1 1970 and the bootlist to be cleared This can cause some problems with booting It is highly recommended to reset the clock and bootl
57. clude the resource again If the service action is expected to be complete in a short period of time for example two hours or less this additional interruption of all parallel jobs will probably cost the customer more time than was saved by use of the switch feature during that short period Removing a switch assembly from the active configuration 1 Identify the switch which is to be removed from the active configuration for an extended period of time Displa ihe clock selection for this switch assembly and record it for later use See FSalecting 2 Ifthe primary processor node usually in Frame 1 is connected to the switch identified in step 1 the customer must select an available processor node to be the new primary processor node The primary node is set by the Eprimary command Refer to BM RS 6000 SP Administration Guide for more information 3 If the master clock selected for the switch is from the switch assem identified in Step 1 the customer must select a new master clock Refer to 4 Have customer stop all current parallel jobs and suspend all parallel abe on the job queue 5 Put circuit breaker at the front of switch assembly in the Off 0 position Chapter 3 Service procedures 3 7 The customer can re initialize the switch using the Estart command The frame and processor nodes which were removed in this procedure will appear in the out top file with error messages however the remainder of the switch resour
58. code and your service representative can search AIXTOOLS for the latest version of Adapter Microcode 3 10 RS 6000 SP SP Switch Service Guide The following 3 adapters require functional microcode to be installed Adapter Package ESCON Control Unit Adapters Feature 2756 ESCON BLKMUX S 370 Control Unit Feature 2755 BLKMUX FDDI Adapters Features 2723 2724 2725 2726 FDDI These adapters might need updating to the latest level in their FLASH EPROM Adapter Package SSA Adapters Features 6214 6216 6217 7133 Drives SSAFLASH SCSI Adapters Features 2412 2415 2416 ECA192 Note The ECA192 instructions differ from the above and are included with the ECA192 Package Note This procedure is similar to that used for performing software updates PTF s to SP nodes You can Refer to Performing Software Maintenance in Parallel System Support Programs for AIX Installation and Migration Guide GA22 7347 for a general idea of how to perform the installation Locate the diskette either shipped with your adapter or obtained from the TOOLS disk 2 Copy the adapter microcode to a temporary directory on the control workstation a Insert the diskette in the control workstation diskette drive b Log on as root c Select a name in a temporary directory to store the microcode image such as tmp microcode or _ tmp escon d bffcreate 1 d dev fd0 This will list the contents of
59. curred while running the SPS Adapter diagnostics Note An x in an SRN indicates a don t care digit Step 0620 012 An SRN was obtained and Table 1 9 on page 1 24 was used to determine the list of possible failing components 1 From the list of possible failing components use the following prioritized table Table 1 10 on to perform service 1 26 RS 6000 SP SP Switch Service Guide Table 1 10 Switch problem priority listing SP Switch function MAP 0620 Priority Failing component Action 1 Software a Have customer verify that the software is configured and operating correctly for this processor 1 of 7 node system b If no problem is found continue with next highest priority item in the list for this SRN c Otherwise power off this processor node and continue service at 3 2 External clock a Check to see if a switch clock cable is connected at switch adapter of this processor node 2 of 7 b If clock cable exists reseat switch clock cable otherwise reseat switch data cable c Run advanced diagnostics on the device css0 and its associated switch port d If diagnostics fail with the previous SRN check at least one other processor node for indication of a clock problem If necessary run advanced diagnostics on this other node using the device cssO and its associated switch port e If more than one node has a clock problem go to en 062
60. d automatically from the fault service daemon when certain serious errors are detected It can also be issued from the command line by service personnel when a switch or adapter related problem is indicated Switch supervisor self test The following procedures will help you perform self test on the switch supervisor cards Upon completion of this test return to the procedure that sent you here 1 Power off switch assembly from the circuit breaker on the front of the unit 2 Detach supervisor harness from connector at back of the unit Detaching the supervisor harness removes the 12 volt power from the supervisor card 3 Reinsert the supervisor harness to perform the supervisor card self test 4 Check green and yellow LEDs on front of the unit This self test should indicate one of the following conditions for the processor node m Self test Conditions Pass sequence Both LEDs light about 10 seconds Green LED stays lit while yellow LED goes off about two seconds Green LED stays lit while yellow LED flashes node address Both LEDs turn off about two seconds Both LEDs light about one second oaoop Fail conditions e Green and Yellow LEDs never light e Yellow LED flashes wrong address Verification tests using Perspectives This section gives you the basic information needed to check the supervisor cards on a single node frame or switch Node supervisor verification From the Hardware Perspectives windo
61. d by GSA ADP Schedule Contract with IBM Corp Contents Figures Tables Safety and environmental notices Safety notices Danger notices Caution notices j Laser safety information Environmental notices Product recycling and disposal About this book Who should use this book Related information How to send your comments Summary of changes GA22 7443 04 GA22 7443 03 GA22 7443 02 GA22 7443 01 GA22 7443 00 Chapter 1 Maintenance Analysis Procedures MAPS SP Switch MAPs SP Switch description and problem determination MAP 0590 SP Switch environment MAP 0600 SP Switch power MAP 0610 SP Switch function MAP 0620 Chapter 2 Locations Naming standard for RS 6000 SP components Format structure Location diagrams of the RS 6000 SP components Front and rear views of RS 6000 SP frame Frame locations Switch assembly locations Connector details Cable routing f Cable routing in a multi switch frame FIC 2030 1 Switch data cables bu te dn a a Chapter 3 Service procedures Personal ESD requirements Tools and files overview Using the css snap script Switch supervisor self test Verification tests using Perspectives Node supervisor verification Frame supervisor verification Switch supervisor verification Selecting appropriate switch clocks Copyright IBM Corp 1999 2002 Selecting the switch clock source Determining the correct switch clock s
62. d configurations therefore be careful about relying on node and frame information Any connector jack numbers will be correct regardless of the customer s cable configuration so you may trace the cabling from the jack number if necessary 1 Have customer complete all active parallel jobs or jobs using switch interface for example tape B U ADSM then remove ie switch feature from the active configuration Refer to RETTE 10 li for these procedures 2 Check for system monitor errors indicating environmental problems with the switch feature These can be viewed by issuing the appropriate command from the control workstation errpt a N sphwlog pg For SSP code levels 1 02 and aE e Refer to the Starting a service call MAP 0100 in information on the pg command Note In a frame with processor nodes entries for the switch will refer to node17 or slot17 In a multi switch frame switches will be listed as even slot addresses 3 Determine the SP Switch type see page I 4 Refer to Table 1 1 on page 1 4 for a prioritized list of conditions Find the first condition that describes your problem then perform the associated action Chapter 1 Maintenance Analysis Procedures MAPs 1 3 Switch description and problem determination MAP 0590 Table 1 1 Switch problem diagnostics Priority Message or condition Action 1 1 of 3 Environmental problems Errpt Failure e Log
63. dition 4 Processor nodes and SP Switches receive 48 volt power from one of five power harnesses The sets are as follows PDU BH P1 Frame slots 1 2 3 4 PDU BH P2 Frame slots 5 6 7 8 PDU BH P3 Frame slots 9 10 11 12 PDU BH P4 Frame slots 13 14 15 16 PDU BH P5 Frame slot 17 5 Isthe Power green LED on other processor nodes or SP Switches On or Flashing e If yes go to Step 0610 0081 e If no go to Main power MAP 0450 in Step 0610 008 You found that the Power green LED on other processor nodes or switch assemblies is On or Flashing 1 If there is only one SP Switch on the dc power harness skip the next item and answer Yes to the question below 2 Check all other SP Switches on the same dc power harness as the failing SP Switch and look for the same symptom e Look for the circuit breaker on but the Power green LED is not lit 3 Is this the only SP Switch showing this symptom e If yes go to Sten 0610 0091 e If no you have a problem with 48 V dc power distribution a Go to Open in 48 V de distribution MAP 0560 in Step 0610 009 Only one SP Switch is showing the circuit breaker on but the Power green LED is not lit 1 Check cable connections at the frame power unit see and at the SP Switch bulkhead BH J1 for good connections 2 Are the connections good Chapter 1 Maintenance Analysis Procedures MAPs 1 15 SP Switch power MAP 0610 e If yes a Have the
64. e 1 Manual method for any SP system From an AIX window enter spmon G m i framex spmon G m 1 framex spmon G m 2 framex spmon G m 3 framex 2 Topology file method a From an AIX window enter Eclock The display will tell you the current clock topology b To select an alternate clock topology enter Eclock a filename Refer to IBM RS 6000 SP Command and Technical Reference for more information on the correct file name to use Determining the correct switch clock source All switch assemblies in the system must run off the same clock source The following procedure describes how to determine the correct clock settings for each switch assembly based on standard cabling configurations 1 Determine the number of logical frames in the system 2 Determine which clock choice to use for the system 3 Locate the appropriate box in able 3 4 on page a using the number of logical frames and the master clock choice 4 Based on information in this box set the switch clock source for each logical frame Lxx represents the logical frame number Sxx represents a multi switch frame i or 0 1 2 or 3 indicates the appropriate clock setting for that logical frame iorO Internal Clock SPS 3 6 RS 6000 SP SP Switch Service Guide 1 Input 1 BH J3 for SPS 2 Input 2 BH J4 for SPS 3 Input 3 BH J5 for SPS Table 3 4 Setting switch clock sources Number of Logical Master Clock Choice
65. e clock settings on the problem SP Switches appear to be okay e If yes go to Step 0620 0251 e If no a Change the clock selections on the appropriate switches refer to to verify fix Step 0620 025 The clock settings on the problem SP Switches appear to be okay 1 Is the problem switch in a frame with processor nodes 2 e If yes go to e If no go to Sten 0620 0261 Step 0620 026 Problem switch is in a switch frame contains switches only no processor nodes 1 From Perspectives on the control workstation open the Switch Front Panel window for this switch 2 Check the clock selection to see if it is set to Internal or to one of the following external clocks e For SP switch types Jack3 Jack4 or Jack5 3 Is the clock selection set to one of the External clocks 1 30 RS 6000 SP SP Switch Service Guide SP Switch function MAP 0620 e If yes go to Step 0620 0271 e If no a The problem switch is providing the master clock for the system b Have customer use the Eclock command to select a different master clock for the system c Go to A to verify fix Step 0620 027 The clock selection is set to one of the External clocks 1 Select a different external clock for this SP Switch 2 From the control workstation of the primary node reinitialize the switch by entering Estart 3 Check var adm SPlogs css out top file to see if problems are still reported on all data cable
66. er is open the service representative and all people in the area must be ESD safe If power is switched on or if removing or un any FRU always use the ESD kit part 93F2649 Put on the ESD wrist strap Attach the ESD cable to the wrist strap Attach the ESD mat to the wrist strap if required Attach the insulated clip to the ESD cable arwn gt grounding point on the frame Tools and files overview Attach the insulated clip to the frame holes labeled ESD If the frame holes are not available use a The following three tables provide an overview of the tools setup files and tuning files used in the service procedures Copyright IBM Corp 1999 2002 Table 3 1 Service procedure tools Utility see note Runs on Description fault_service_Worm_RTG All nodes Monitors the switch for faults It restarts the switch if a fault is detected fs_monitor All nodes Monitors the adapter for interrupts that have not been serviced This is required because of a hardware oversight in MSMU in which there is a window in time that causes it to hold interrupts and then forget to introduce them rc switch All nodes Starts the daemons and IP configuration tools Estart Primary or Tunes the switch and puts it into run phase Also kicks Control Node off the route table generator and distributes the routes to the nodes by placing them in the etc SP directory on the control node Estart_sw Primary or T
67. er unit must have a power module in position D right most slot For N 1 operation a power module may be installed in position C next to slot D There are no skirts on the 49 inch frame A High node or SMP High node takes up 2 shelf positions slots It is identified by the least odd number position of the occupied slots The switch assembly is not available in the 1 4 m frame shows a rear view of the RS 6000 SP frame locations Chapter 2 Locations 2 5 SP frame locations rear view Processor node 8 slot 8 Processor node 7 slot 7 RF shunt assembly PCI card Frame supervisor card Right skirt Left skirt Testpoints AC power cable Rear power LED Figure 2 4 Rear view of frame locations Note See notes under Figure 2 1 on page 2 3 for processor node switch assembly numbering Frame locations shows a front view of the RS 6000 SP frame locations with numbered processor nodes and the three phase SEPBU Frame FRA This list shows the designations specifically for the RS 6000 SP frame G1 Right hand rear ground G2 Left hand rear ground G3 PDU ac ground G4 PDU dc ground G5 Input cable ground 2 6 RS 6000 SP SP Switch Service Guide G6 G7 G8 SW LD FC RC Front door ground Rear door ground Ground Power on switch LED card Front cover Rear cover Example 01 F
68. evice Use one hand when possible to connect or disconnect signal cables to prevent a possible shock from touching two surfaces with different electrical potentials During an electrical storm do not connect cables for display stations printers telephones or station protectors for communications lines SPSFDO02 DANGER In the U S Canada and Japan this product has a 4 wire power cable with a 4 prong plug Use this power cable with a correctly grounded power receptacle to prevent possible electric shock SPSFDO003 Copyright IBM Corp 1999 2002 ix DANGER Before you connect the power cable of this product to ac power verify that the power receptacle is correctly grounded and has the correct voltage SPSFD004 DANGER During an electrical storm do not connect or disconnect any cable that has a conductive outer surface or a conductive connector SPSFDO05 DANGER Switch off power and unplug the machine power cable from the power receptacle before removing or installing any part that is connected to primary power SPSFD006 DANGER To prevent possible electrical shock during machine installation relocation or reconfiguration connect the primary power cable only after connecting all electrical signal cables SPSFD007 DANGER High voltage present Perform Lockout safety procedures to remove primary power to the frame SPSFDO008 DANGER
69. ff 0 position 2 Disconnect all switch data cables also clock cables and data cable interposers if present from the bulkhead of the switch assembly Replace the switch inner chassis Replace the switch planar Reinstall all cables inside the switch assembly Reinstall the top cover Reinstall the switch assembly Go to ESten 0620 042 Step 0620 042 You have replaced switch components and need to complete a final check before verifying the fix 1 Be careful to reconnect all cables according to the appropriate wiring chart or labels and put wrap plugs on any unused data ports SOO BH J3 through S00 BH J34 SONAR w Chapter 1 Maintenance Analysis Procedures MAPs 1 35 SP Switch function MAP 0620 2 Putthe SP Switch s circuit breaker into the On 1 position 3 Goto to verify fix Step 0620 043 You have replaced switch components and need to verify that the problem has been fixed 1 Make sure any processor node s that was put in SERVICE mode is returned to NORMAL mode Make sure all processor node s are IPLed completely in NORMAL mode If any processor nodes that lost switch clocks before or during service due to unplugging node data clock cable loss of clock at the SP Switch connected to processor nodes or so on you will need to include them in the following command From the control workstation enter dsh w nodelist usr lpp ssp css css_restart_node OR if all processor nodes have lo
70. g is a list of all safety notices in English only pertaining to SP hardware maintenance tasks from this and other RS 6000 SP hardware publications Translations of each of the safety notices into other languages are included in RS 000 SP Safety Informallarl DANGER notices warn you of conditions or procedures that can result in death or severe personal injury CAUTION notices warn you of conditions or procedures that can cause personal injury that is neither lethal nor extremely hazardous Each notice contains a reference number SPSFXXXX which you can use to help find a specific notice in other languages Danger notices DANGER Do not attempt to open the covers of the power supply Power supplies are not serviceable and are to be replaced as a unit SPSFDOOT DANGER An electrical outlet that is not correctly wired could place hazardous voltage on metal parts of the system or the devices that attach to the system It is the responsibility of the customer to ensure that the outlet is correctly wired and grounded to prevent an electrical shock Before installing or removing signal cables ensure that the power cables for the system unit and all attached devices are unplugged When adding or removing any additional devices to or from the system ensure that the power cables for those devices are unplugged before the signal cables are connected If possible disconnect all power cables from the existing system before you add a d
71. he directory for these utilities is var adm SPlogs css Table 3 3 Tuning output files File see note Location Description daemon stdout Primary Keeps a detailed account of the tuning process initiated by the Estart command It includes data from every tuning operation since the current daemon on the primary node was initiated daemon results Primary A record of how many nodes were initialized 3 2 RS 6000 SP SP Switch Service Guide Table 3 3 Tuning output files continued File see note Location Description daemon stderr Primary A record of which nodes were not initialized out top Primary Reports errors from the last tuning procedure It begins as a copy of the topology file and errors are indicated to the right of each entry out top old Primary A copy of out top from the previous run router log Primary The router log file generated by the route table generator after initialization router log x All nodes The route information for a particular node x Note Unless otherwise noted the directory for these utilities is var adm SPlogs css Using the css snap script The css snap script collects log files created by switch support code device driver worm fault service diagnostic tests into a single package Attention css snap uses a number of undocumented utilities to collect information Some of these like read_regs and the tbXdump routines can be destr
72. hese procedures to replace a fan in an SP Switch assembly 1 Transfer shock mounts from the old fan to the new fan 2 Using another fan as an example route the fan cable and orient the fan with airflow indicator pointing toward the rear of the switch assembly 3 Mount fan in the fan bracket by pulling the shock mounts through the holes in the fan bracket 4 Push shock mounts outward to lock in place 5 Reconnect fan cable to the supervisor control cable 6 Reinstall top cover of switch assembl 7 9 to remove the switch Figure 4 2 Removing the SP Switch fans Removing the switch front chassis cable Note Refer to fH devices ndling before removing or installing ESD sensitive Chapter 4 FRU removals and replacements 4 3 Perform these procedures to remove the fan contra cable from an SP Switch assembly 9 to place the switch assembly into the service position 2 Unplug connectors P7 P8 and P9 Remove cable by unhooking retaining material along raceway and unplugging the Fan connectors P2 P3 P4 P5 and P6 Using fingertip pressure remove connector P1 by removing screws Retain these screws for new cable installation 3 Remove cable by unhooking retaining material along raceway Switch front chassis cable a Air flow Figure 4 3 Removing the SP Switch fan control cable Replacing the switch front chassis cable Perform these procedures to replace the fan control cable in a
73. his is the primary node command to start the RS 6000 Control Node SP High Performance Switch Estart_sw is called by Estart It is useful to note that the time out variable LIMIT for the Estart process is located in this file This may have to adjusted for systems that take longer to tune than in other systems Eprimary All nodes Sets a node as the primary node Eprotocol All nodes Stores switch protocol information in the System Data Repository for a node or a series of nodes Also retrieves a topology files out of the SDR Eclock Primary or Controls the clock source for each switch board within Control Node an SP cluster ifconfig All nodes This sets up the IP interface to the switch Note Unless otherwise noted the directory for these utilities is usr lpp ssp css Table 3 2 Setup output files File see note Location Description rc switch log All nodes Logs all information on the last run of rc switch It will indicate if this is a primary or secondary node the associated switch chip information and IP address information rc switch log previous All nodes A copy the previous run of rc switch expected top no_comments All nodes Used by rc switch to more easily parse out chip connection information fs_daemon_print file All nodes A log of the daemon fs_monitor log All nodes A log of the monitor daemon css snap All nodes Log files created by the switch support code Note T
74. his point you must run the advanced diagnostics in service mode on the device css0 and its associated switch port 1 Did you get an SRN from the diagnostics e If yes go to e If no go to 1 28 _RS 6000 SP SP Switch Service Guide SP Switch function MAP 0620 Step 0620 019 Since the diagnostics did not return an SRN no problem was detected 1 Have you just reseated repaired or replaced a component e If yes go to A to verify fix e If no a Problem is in the SP Switch b Go to ESten 0620 041 on page 1 35 to service or replace switch assembly Step 0620 020 The advanced diagnostics returned an SRN indicating that the diagnostics detected a problem 1 Is this SRN the same as the previous SRN e If yes go to LStep 0620 011 on page 1 23 to perform service on the next highest priority component in the list for this SRN e If no go to Step 0620 021 An error message in the var adm SPlogs css out top file indicated that you either have an external switch connection problem or a problem with a wrapped port Priority 3 in directed you to this step 1 Check the specified jacks at the rear of the SP Switches for an obvious problem such as a loose connection 2 Pay special attention to cables that may have been moved recently 6 to service the new SRN 3 Do connections appear to be okay e If yes go to e If no a Fix the obvious connection problem to verify fix Step 0620 022 The
75. iate switch clocks Selecting the switch clock source j Determining the correct switch clock source Removing and restoring switch resources Removing a switch assembly from the active configuration i Restoring a switch assembly to the active configuration Viewing switch partitions Fencing nodes Unfencing nodes Service position procedures Placing a switch assembly into service gt position Replacing a switch assembly from service position Resetting the clock and bootlist after servicing a node Installing firmware updates on SP nodes Installing adapter microcode packages Running diagnostics on a switch port Personal ESD requirements 3 1 3 1 3 4 3 4 3 5 3 5 3 6 3 6 3 6 et 3 7 3 8 3 8 3 8 3 8 3 9 3 9 3 9 3 9 3 10 3 10 3 12 The processor uses FRUs that are known to be sensitive to electrostatic discharge ESD To prevent ESD damage to FRUs or to prevent system failures observe the following procedures e Keep the FRU in its original static dissipative shipping container until the FRU is ready to be installed in the system Move the static dissipative container near the location where the FRU is to be installed within ESD wrist strap distance Ifthe FRU must be put down for any reason first place it in its static dissipative container or place it on the static dissipative mat e Open only the covers that are necessary to complete the task Any time a cov
76. ic location of the text you are commenting on for example a page number or table number e Fill out one of the forms at the back of this book and return it by mail by fax or by giving it to an IBM representative XVi _RS 6000 SP SP Switch Service Guide Summary of changes GA22 7443 04 This edition contains replaces GA22 7443 03 and any update versions made to that level and makes them obsolete This edition contains minor changes and fixes to softcopy cross book links GA22 7443 03 This edition contains replaces GA22 7443 02 and any update versions made to that level and makes them obsolete This edition contains minor changes and fixes to softcopy cross book links GA22 7443 02 This edition contains replaces GA22 7443 01 and any update versions made to that level and makes them obsolete This edition contains minor changes and corrections throughout the book GA22 7443 01 This edition replaces GA22 7443 00 and makes it obsolete Added cross book links for reference links between this publication and the other RS 6000 SP hardware publications These links assist navigating between documents in the softcopy environment when using the Adobe Acrobat Reader GA22 7443 00 First edition of the restructured RS 6000 SP hardware service library This publication along with the other SP service publications see xvi replaces The Maintenance Information Manuals Volumes 1 4 GA22 7375 GA22 7376 GA22 7377 and GA22 7378 and
77. ice 3 1 firmware updates on SP nodes installing 3 10 format structure 2 1 frame cable routing path in rear of frame 2 9 2 11 X 1 frame locations 2 3 2 5 2 6 frame naming standard 2 1 frame supervisor verification 3 5 front chassis cable SPS 1 8 front view of 49 inch frame locations 2 4 front view of frame locations 2 3 front view of multi switch frame locations 2 4 H handling static sensitive devices 4 1 installing adapter microcode packages 3 10 installing firmware updates on SP nodes 3 10 K kerberos authentication 3 1 L location diagrams of the RS 6000 SP components component connector details 2 9 external cable routing 2 11 frame 2 6 frame cable routing path in rear of frame 2 9 2 11 front view of 49 inch frame locations 2 4 front view of frame locations 2 3 front view of multi switch frame locations 2 4 rear view of frame locations 2 5 locations cable plug locations 2 1 connector details 2 1 location diagrams of RS 6000 SP components 2 1 major assembly naming standard 2 2 male wrap plug 1 4 microcode packages installing adapter 3 10 multi switch frame F C 2030 1 5 11 multi switch frame locations 2 4 N naming standard assembly 2 2 connector location 2 2 for RS 6000 SP components 2 1 format structure 2 1 frame 2 1 major assembly 2 2 node supervisor verification 3 4 nodes fencing 3 8 nodes unfencing 3 8 X 2 _RS 6000 SP SP Switch Service Guide P partitioning SPS 3 8 partiti
78. ice Request Number source Failing component s Description 763 200 D ext clock Problem detected with the through SPS MX adapter external clock SP switch 763 299 wrap plug except 763 282 763 282 D switch cable Adapter is good but switch cable is not attached 763 2A0 D SPS MX adapter Problem detected with the through internal clock 763 2A9 763 3xx D SPS MX adapter Problem detected with POS See note system or I O planar registers 763 4xx D SPS MX adapter TBIC test failed See note software 763 5xx D SPS MX adapter SRAM test failed See note 763 6xx D SPS MX adapter Microprocessor test failed See note 763 7xx D SPS MX adapter Interrupt test failed See note 763 8xx D SPS MX adapter FIFO test failed See note 763 9xx D SPS MX adapter DMA engine test failed See note switch cable system or I O planar 763 Ax3 D SPS MX adapter Card wrap test failed See note terminator 763 Ax4 D switch cable Cable wrap test failed See note wrap plug 763 xx2 D SPS MX adapter Bad adapter except SRN See note 763 282 763 xx3 D SPS MX Adapter Bad adapter 763 xx4 D SPS MX Adapter Bad switch cable Switch Cable 764 1xx D SP System Attachment Adapter Problem detected with a device See note software or device data corrupted system or I O planar Note You may need to reinstall ssp css software 764 200 D ext clock Problem detected with the through SP System Attachment Adapter external clock SP switch 764 299 wrap plug excep
79. ils as seen at receiving ends not at cable ends Cable routing and Eigure 2 8 on page 2 10 show back views of the RS 6000 SP frame showing the horizontal and vertical paths of cable routing from connector to connector with the depth amplified on the drawing Chapter 2 Locations 2 9 Note When attaching exterior and interior cables to a POWER3 SMP High Node allow for enough cable for a 2 foot service loop for node movement into service position 1 93 m frame _ 16 Hio 15 t Mpy 14 H8 13 1 36 m frame I 12 H8 11 y 1 My am am Y 10 H 9 i l AIR lp R ry 8 He 7 v 8 H6 7 v I gie My If TE v 6 H5 5 6 H5 5 Y r a ll a Ny 4 H4 3 al My 4 H4 3 Jy yr 2 H3 i 4 2 H 4 i Switch l i Switch i Power N T o ae E ai Figure 2 7 Frame cabling routing path in rear of RS 6000 SP frame 1 93 m frame
80. ion type P for plug cable side J for jack card component side G for chassis ground connection e 2nd 3rd and 4th characters are number identifiers Leading zeroes may be omitted Example P102 is plug 102 Examples for using complete levels of nomenclature To describe the jack 23 on the switch assembly bulkhead in the second RS 6000 SP frame in a four frame configuration designate as EQ2 S01 BH J23 To describe plug 1 on the power card of the any switch assembly of any RS 6000 SP frame in any size system configuration designate as E00 S00 PC P1 or just PC P1 Location diagrams of the RS 6000 SP components See Eigure 2 1 on page 2 3 Figure 2 2 on page 2 4 and Eigure 2 4 on page 2 4 in the pages that follow for views of the RS 6000 SP frame locations Refer to the diagrams included in this section for specific views and cabling of the main component sections in the RS 6000 SP frame 2 2 RS 6000 SP SP Switch Service Guide Front and rear views of RS 6000 SP frame Figure 2 1 shows a front view of the RS 6000 SP frame locations the assembly designations for the RS 6000 SP frame j describes SP frame locations 2 01 m and 1 93 m frames front view
81. ist before booting the node This is done as follows 1 Before powering down the node to be serviced display the current bootlist a Run diagnostics diag b Choose the Service Aids panel c Choose the Display Alter Bootlist panel Chapter 3 Service procedures 3 9 d Choose Normal Mode e Choose Display Current Bootlist This will display the current bootlist 2 Power down the node service it and hook it back into the frame 3 On the control workstation run spbootins to set the node to boot in maintenance mode For example if it is node 12 of frame 2 enter spbootins r maintenance 2 12 1 4 On the control workstation netboot the node From the SP Perspectives Launch Pad select Hardware Perspectives Click on the processor node or nodes you are going to boot from a network Click on Actions button on the tool bar Verify the nodes selected then click on the Apply button IPL from network device begins oaoop Note If Packets Received always shows 00000 there is a network or configuration problem 5 When this boots a console window will pop up on your display Follow the prompts a Start Maintenance Mode for System Recovery b Access a Root Volume Group c Continue d Choose correct disk from the list e Access this volume group and start a shell 6 In the maintenance shell set the date command For example to set the date to August 3 1995 do date 0803123095
82. itch function MAP 0620 Al for views 1 Make sure a wrap plug is properly installed on the connector Refer to of the switch wrap plugs 2 Have customer check and update if necessary the switch configuration file appropriately before continuing e For code level 1 02 and higher use the Etopology command 3 Did you just correct a problem with a wrap plug or correct the switch configuration e If yes go to e If no go to Step 0620 041 on page 1 39 to service or replace the switch assembly Step 0620 008 You have a switch cable connected to the indicated jack on the SP Switch 1 Check cable connection at SP Switch and processor node 2 Does the cable appear to be fully connected at the SP Switch and the processor node e If yes go to e If no a Fix the cable connection problem Step 0620 009 The cable appears to be fully connected at both the SP Switch and at the processor node 1 From the front of the RS 6000 SP frame or from the control workstation check the Power green LED on the processor node to make sure it is lit 2 Is the Power green LED lit indicating the processor node is powered on e If yes go to e If no go to Step 0620 0104 Step 0620 010 The processor must be powered on for the switch to recognize the port 1 Power on this processor node and check the Power green LED 2 Does the Power green LED stay lit indicating the processor node is powered on e If yes go to e If
83. ither a Perspectives window or by looking at the SP Switch you determined that the Power green LED is Off 1 Check the SP Switch s circuit breaker 2 Put this circuit breaker in the On 1 position if it is not already in that position 3 Does the circuit breaker go trip to the Off 0 position e If yes go to e If no go to 1 14 RS 6000 SP SP Switch Service Guide SP Switch power MAP 0610 Step 0610 005 When you put the circuit breaker into the On 1 position the circuit breaker tripped into the Off 0 position 1 Have the customer remove the SP Switch from the active configuration and power off the SP Switch 2 Putthe switch into service position refer to Step 0610 006 When you put the circuit breaker into the On 1 position the circuit breaker stayed in the On 1 position 1 From the control workstation or SP Switch check Power green LED for this switch 2 Is the Power green LED lit e If the Power green LED is not lit go to Step 0610 0071 e If the Power green LED is lit a You have resolved the SP Switch problem Zi and refer to Step 0610 007 The Power green LED is not lit 1 Check processor nodes or SP Switches on any other dc power harness for any that are powered on 2 Ensure that the other processor node or SP Switch has its circuit breaker in the On 1 position 3 Check the Power green LED for an On or Flashing con
84. n locate the frame which was sourcing the switch clock 2 Locate a test processor node in this frame which can be used for service 3 Disconnect the switch data cable for this processor node at the tailgate of the SP Switch 4 Using the switch cabling list locate the switch data cable at the frame containing the processor node One of the following cables sources the clock to the failing SP Switch e S00 BH J3 e S00 BH J5 e S00 BH J7 Chapter 1 Maintenance Analysis Procedures MAPs 1 31 SP Switch function MAP 0620 e SO00 BH J9 5 Disconnect the switch data cable at the tailgate of the frame containing the processor node 6 Connect the end of the processor node data cable to the jack 7 Run advanced diagnostics on css0 on the test processor node and its associated switch port Do not perform the cable wrap test e Refer to Running diagnostics in a processor node in 8 Look for an SRN indicating a clock problem such as External clock 9 Do the css0 diagnostics fail with an indication of a clock problem e If yes go to Step 0620 0311 e If no go to ESten 0620 0301 Step 0620 030 The css0 diagnostics do not fail with an indication of a clock problem This indicates that the problem is either in a cable or in the reported SP Switch 1 Replace switch data cable that originally supplied the switch clock to the failing SP Switch 2 Set the switch clock selection on this SP Switch to the
85. n SP Switch assembly 1 Using fingertip pressure attach connector P1 with screws retained in the removal procedure Route cable along raceway hooking retaining material where needed and plugging Fan connectors P2 P3 P4 P5 and o Plug connectors P7 P8 and 9 to remove the switch assembly an the service position Removing the switch supervisor card Note Refer to Handling 8 before removing or installing ESD sensitive devices Attention The hexhead retaining screws require a 4 mm socket Perform these procedures to remove the supervisor card from an SP Switch assembly 9 to place the switch assembly into the service position 2 Unplug connector P5 3 Rotate card thumb locks outward to unseat card 4 Remove supervisor card 4 4 RS 6000 SP SP Switch Service Guide Replacing the switch supervisor card Perform these procedures to replace the supervisor card in an SP Switch assembly 1 Insert supervisor card 2 Rotate card thumb locks inward to seat card 3 Plug connector P5 4 A to remove the switch Switch Supervisor Card Figure 4 4 Removing the SP Switch supervisor card Removing the switch inner chassis cable Note Refer to H before removing or installing ESD sensitive devices Perform these procedures to remove the supervisor power cable from an SP Switch assembly B 9 to place the switch assembly into the service position 2 Remove screws from connector P1 at the rear
86. nce New Frame Entrance Vertical Routing Routing Old Node millimeters inches Style Old Style Old Style Style 1 1800 71 E3 E1 V4 H3 2 1500 59 E3 E1 V4 H3 3 1680 66 E3 E2 V5 H4 4 1980 78 E3 E2 V5 H4 5 2160 85 E3 E1 V3 H5 6 1850 73 E3 E1 V3 H5 7 2030 80 E3 E2 V6 H6 8 2340 92 E3 E2 V6 H6 9 2510 99 E3 E1 v2 H7 10 2210 87 E3 El V2 H7 11 2390 94 E3 E2 V7 H8 12 2690 106 E3 E2 V7 H8 13 2870 113 E3 El V1 H9 14 2570 101 E3 E1 V1 H9 15 2740 108 E3 E2 V8 H10 16 3050 120 E3 E2 V8 H10 2 11 1 93 m frame 16 H10 15 v qll ei e i i 1 36 m frame i 12 He 1 l nl TE v _ to Hr 9 In TE In hg 8 H6 7 y i 8 H6 7 v l qj all Wy l ns s 6 H 5 M r um Way r F E 7 1 i AT I b 4 M 3 T l rn 4 3 y L244 2 1 1 p Switch i l Switch Powe Power L Oo 2 Figure 2 9 Frame cable routing paths in rear of RS 6000 SP multi switch frame F C 2030 1 1 93 m frame
87. ocessor node 2 of 5 as primary node epnnection e If this is a dependent node connection to ae go to ee node e el in 3 e Wrapped ports Ato check the frame to frame wrapped port or external swich connection 3 of 5 e switch to switch e Switch connections 4 Node connections not on same e If this is a standard node go to ESten 0620 0051 to 4 of 5 switch as primary node check secondary processor node connection e If this is a dependent node connection to Mee go to Se node a sea in 5 All problems in var adm SPlogs css out top file have 5 of 5 been addressed Step 0620 005 The var adm SPlogs css out top file indicates a problem with a Primary node or Secondary node connection 1 Open frame rear cover and check the cable connection from the indicated SP Switch jack to the processor node 2 Is there a cable connected to the indicated jack on the SP Switch e If yes go to If no go to EStep 0620 0061 Step 0620 006 No cable connected at the indicated jack on the SP Switch refer to the tables in 1 Should there be a cable connecting the indicated jack to a processor node e If yes a Connect a switch data cable from this jack to the proper processor node If no go to FStep 0620 0071 Step 0620 007 A switch cable is not required on the indicated jack 1 22 RS 6000 SP SP Switch Service Guide SP Sw
88. of adding the pane to the current window or to a new window If desired enter a new pane title Click OK to open the pane and close the dialog box e You may repeat these steps to add additional frame panes In the Frame and Switch pane click the icon of the frame you want to verify Click the Notebook icon on the tool bar e When the Notebook window opens make certain that the Frame Status tab is selected The Controller responds attribute displays the status of the frame supervisor e OK displayed in a green box indicates that the frame supervisor is responding e No response displayed in a red box indicates that the frame supervisor is not responding oap Note Clicking Help in the Notebook window s lower right corner displays attribute descriptions Switch supervisor verification From the Hardware Perspectives window 1 Unless you have saved display settings tne Hardware Perspective does not open with a switch pane displayed To open a switch pane a Click the Add Pane icon on the tool bar e The Add Pane dialog box opens From the Pane Type pull down select Frames and Switches Select your choice of adding the pane to the current window or to a new window If desired enter a new pane title Click OK to open the pane and close the dialog box e You may repeat these steps to add additional switch panes In the Frame and Switch pane click the icon of the switch you want to verify e A switch
89. of the switch assembly and retain them for new cable installation Remove cable by unhooking retaining material along raceway Unplug connectors P3 P4 P5 and P6 Chapter 4 FRU removals and replacements 4 5 Inner Chassis Cable Figure 4 5 Removing the SP Switch inner chassis cable Replacing the switch inner chassis cable Perform these procedures to replace the supervisor power cable from an SP Switch assembly 1 Plug connectors P3 P4 P5 and P6 Route cable along the raceway hooking retaining material where needed Attach P1 connector to the rear of the switch assembly with screws retained in the removal procedure 9 to remove the switch assembly from the service position Removing the switch power cards before removing or installing ESD sensitive devices Attention The hexhead retaining screws require a 4 mm socket Perform these procedures to remove the switch power cards from an SP Switch assembly g e 9 to place the switch assembly into the service position 2 Unplug connector P4 PS1 or P6 PS2 3 Rotate card thumb locks outward to unseat card 4 6 RS 6000 SP SP Switch Service Guide 4 Remove power supply card Replacing the switch power cards Perform these procedures to replace the switch power cards in an SP Switch assembly 1 Insert power supply card 2 Rotate card thumb locks inward to seat card 3 Plug connector P4 PS1 or P6 PS2 4 A to remove the switch Figu
90. oning switch 3 8 Perspectives verification tests using 3 4 placing a SPS assembly into service position 3 9 Procedures ESD 3 1 purpose of book xv task procedures overview xv R rear view of frame locations 2 5 removing 4 1 switch assembly 4 2 switch fans 4 3 switch front chassis cable 4 3 switch inner chassis cable 4 5 switch power card 4 6 switch supervisor card 4 4 the RS 6000 SP components 4 1 removing a switch assembly from the active configuration 3 7 removing switch resources 3 7 replacing 4 1 switch assembly 4 2 switch fans 4 3 switch front chassis cable 4 4 switch inner chassis cable 4 6 switch power card 4 7 switch supervisor card 4 5 the RS 6000 SP components 4 1 replacing a SPS assembly from service position 3 9 Requirements ESD 3 1 restoring a switch assembly to the active configuration 3 8 restoring switch resources 3 7 S selecting appropriate switch clocks 3 6 selecting the switch clock source 3 6 service position procedures 3 9 service procedures placing a SPS assembly into service position 3 9 removing switch resources 3 7 replacing a SPS assembly from service position 3 9 restoring switch resources 3 7 selecting appropriate switch clocks 3 6 service position procedures 3 9 verification and isolation procedures kerberos authentication 3 1 Service Request Number SRN table for SP Switch adapters 1 24 SP Switch adapters Service Request Number SRN table 1 24 SP Switch environment MAP flowcharted 1 5
91. oot access Customer may enter root password when required 2 If either of the following two conditions is true Estart has not been run by the customer e The CE has already replaced components that might affect the switch then from the control workstation of the primary processor node enter Estart You should receive a message indicating successful initialization at the primary node If you receive any other message consult the Diagnosing High Performance Switch Problems section of BM Parallel System Support Programs for AIX Diagnosis Guide 3 Determine the primary node number From an AIX window on the control workstation enter Eprimary 4 Determine the host name for this node by checking reliable hostname for the node number with the command Chapter 1 Maintenance Analysis Procedures MAPs 1 17 SP Switch function MAP 0620 splstdata n pg 5 From the system file server log into primary processor node as root using the telnet command telnet PrimaryNodename 6 Check errpt a N Worm pg for any switch related problems If any errors are listed use the error information with this MAP to help isolate the problem 7 Check the functional state of the switch in the out top file by issuing the following command on the primary processor node pg var adm SPlogs css out top Notes a Verify that the timestamp on the out top file is current neue Is l var adm SPlogs css out top
92. original correct setting 3 From the control workstation of the primary node reinitialize the switch by entering Estart 4 Check var adm SPlogs css out top file to see if problems are still reported on all data cables connected to this SP Switch 5 Are there still problems reported on all data cables connected to this SP Switch e If yes a You have isolated the clock problem to the reported SP Switch b Return to e If no a Clock problem resolved for this SP Switch b Go to Step 0620 031 The css0 diagnostics fail with an indication of a clock problem This indicates that there is a problem with switch clock source SP Switch 1 Replace clock card in the clock source SP Switch 2 Make sure to reconnect all cables connected to the clock card 3 Set the switch clock selection on this SP Switch to the original correct setting 4 Return to Step 0620 032 The clock settings on the problem switch appear to be okay and the problem switch is mounted in a frame that also contains processor nodes 1 Check the ground straps connected to the problem frames e Ground straps are connected at bolts near the cable escapes inside the bottom rear of each frame e For a view of a ground strap connection refer to 2 Do all ground straps connecting the frames make adequate contact at both ends e If yes go to e If no a Fix or replace ground strap between the frames 1 32 RS 6000 SP SP Switch Servi
93. orm cable wrap test e Refer to Running diagnostics in a processor node in 3 Look for an SRN indicating a clock problem such as External clock being listed as one of the failing components j to verify fix Note Do not exit from diagnostics at this time as it may be used in the following steps 4 Do the css0 diagnostics fail with an indication of a clock problem e If yes a Problem is not in the SP Switch b Replace the switch data cable corresponding to the selected clock reconnecting it at both SP Switches e If no go to Step 0620 035 The css0 diagnostics did not fail with an indication of a clock problem This indicates that the problem is related to the clock selection in this SP Switch 1 Replace the clock card in this SP Switch e Make sure to connect all cables connected to the clock card 2 Reinstall only the two switch data cables which are not selected and the switch data cable to the test processor node e Do not connect the remaining switch data cables at this time 3 Reconnect cables S00 BH P1 and SO0 BH P2 at the rear of the SP Switch 4 Put the SP Switch s circuit breaker into the On 1 position Chapter 1 Maintenance Analysis Procedures MAPs 1 33 SP Switch function MAP 0620 5 6 Run advanced dj diagneeties on css0 on the test processor node and its associated switch port e Do not perform cable wrap test e Refer to Running di
94. ound a problem with the switch supervisor control cable S00 SP P102 1 One or more of the following conditions exist e Warning Fan fanwarning1 fanwarning2 fanwarning5 e Shutdown Fan fanfail1 fanfail2 fanfail5 Have customer remove the SP Switch from the active configuration and power off the SP Switch Set switch circuit breaker to the Off position Unplug cables connected to J1 and J2 at the rear of the SP Switch Remove the front panel assembly from the inner chassis se Hable 1 4 on page 1 8 to reseat or replace components Dapy Chapter 1 Maintenance Analysis Procedures MAPs 1 7 SP Switch environment MAP 0600 Table 1 4 Fan failure diagnostics Priority Component Action 1 Fan 1 2 3 4o0r5 a Check specified fans for blockages or loose cable connections 1 of 5 j b Fix ay obvious problems and continue at Steg c If you do not find any problems continue at Priority 2 2 Fan 1 2 3 4or5 a Replace fans as described in 2 of 5 i b Continue 1e at FStep 0600 0121 3 Switch supervisor card a Replace the card 3 of 5 b Continue at EStep 0600 0121 4 Switch supervisor control cable a Replace the cable Refer to Eigure 1 3 for cable 4 of 5 conngotons b Continue at EStep 0600 0121 5 All replaced Call next level of support 5 of 5 P5 Switch SV Card 2x20 Figure 1 3 SP Switch inner chassis and front chassis cables Inner
95. ource Removing and restoring switch resources Removing a switch assembly from the active configuration i Restoring a switch assembly to the active configuration Viewing switch partitions Service position procedures Placing a switch assembly into service gt position Replacing a switch assembly from service position Resetting the clock and bootlist after servicing a node Installing firmware updates on SP nodes Installing adapter microcode packages Running diagnostics on a switch port Chapter 4 FRU removals and replacements Handling static sensitive devices a Procedures for switch assemblies Removing the switch assembly Replacing the switch assembly Removing the switch fans Replacing the switch fans Removing the switch front chassis cable Replacing the switch front chassis cable Removing the switch supervisor card Replacing the switch supervisor card Removing the switch inner chassis cable Replacing the switch inner chassis cable Removing the switch power cards Replacing the switch power cards Chapter 5 Parts catalog Re SPS SPS 8 Switch assembly feature view 1 SPS SPS 8 Switch assembly feature view 2 SPS SPS 8 Switch assembly an view 3 Switch cables feature re Multi switch frame F C 2030 1 Notices Trademarks f Electronic emissions notices ee Federal Communications Commission FCC statement European Union EU statement United Kingdom telecommunica
96. plstdata s pg SDRGetObjects DependentNode 4 lf this occurs on all nodes of a frame check for a logical to physical frame number mismatch 5 Ifthere is a mismatch have the customer update the configuration or you can correct the cabling as appropriate 6 Send packet from local node failed Description Possible hardware problem 5 SPS 6 Device status Device has been Description AUTOJOIN was not selected for node during SPS removed from network no removal AUTOJOIN Action Link status Link has been removed If there is an obvious problem like node powered off or from network no AUTOJOIN disconnected fix that problem first Then issue Eunfence for this processor node or dependent node 2 If problem was resolved repeat Step 0620 001 onl for next problem or go to Step 0620 043 on to verify fix 3 If PEES was not resolved go to Device status Device has been Description Possible hardware problem 7 removed from network for not SPS responding Action Go to 1 20 Link status Link has been removed from network fenced RS 6000 SP SP Switch Service Guide Table 1 7 SP Switch error conditions continued SP Switch function MAP 0620 Error Message condition Description and action 8 Device status Device has been Description Initialization of this link detected a different switch SPS removed from network because of a node number than the one expected mis
97. publication IBM may make improvements and or changes in the product s and or the program s described in this publication at any time without notice IBM may use or distribute any of the information you supply in any way it believes appropriate without incurring any obligation to you Trademarks The following terms are trademarks of the International Business Machines Corporation in the United States or other countries or both AIX ESCON IBM IBMLink pSeries RS 6000 S 370 SP server Other company product and service names may be the trademarks or service marks of others Copyright IBM Corp 1999 2002 A 1 Electronic emissions notices Federal Communications Commission FCC statement This equipment has been tested and found to comply with the limits for a Class A digital device pursuant to Part 15 ofthe FCC Rules These limits are designed to provide reasonable protection against harmful interference when the equipment is operated in a commercial environment This equipment generates uses and can radiate radio frequency energy and if not installed and used in accordance with the instruction manual may cause harmful interference to radio communications Operation of this equipment in a residential area is likely to cause harmful interference in which case the user will be required to correct the interference at his own expense Properly shielded and grounded cables and connectors must be used in order to meet
98. rame and component identification information GA22 7442 General SP system service procedures the system Start MAP and MAPs and parts catalog for the frames and power subsystems Use this book to begin a diagnostic procedure to isolate a problem to a specific major component of the SP system uida GA22 7444 Service procedures MAPs and parts catalog information specific to the SP Switch2 GA22 7445 Service procedures MAPs and parts catalog information specific to all uniprocessor type nodes parts catalog information specific to this node This book and other RS 6000 SP hardware and software documentation are available both on line and for some books in printed form from the following sources e The Web site at http www ibm com servers eserver pseries library sp_books index html e The Resource Center on the PSSP product media e Printed and CD ROM versions which can be ordered from IBM For more information on these sources and an extensive listing of RS 6000 SP related publications see the bibliography in Copyright IBM Corp 1999 2002 XV How to send your comments Your feedback is important in helping to provide the most accurate and highest quality information If you have any comments about this book or any other RS 6000 SP documentation e Send your comments by e mail to mhvrefs us ibm com Be sure to include the name of the book the order number of the book and if applicable the specif
99. re 1 1 on page 1 4 for a high level view of the RS 6000 SP Switch Switch connection types Standard node Processor nodes in 9076 SP frames are attached to the SP Switches with switch cables Dependent node Switch adapter resides outside of the 9076 SP frame Each dependent node is a single SP Switch Router Adapter in the 9077 Switch Router There can be several dependent nodes in each SP Switch Router Switch to switch Connections between SP Switches SP Switch types SPS All clocks are distributed through data cables External clock inputs are selected from J3 J4 or J5 Copyright IBM Corp 1999 2002 1 1 Switch description and problem determination MAP 0590 FRUs include Fans circuit breaker LED card switch supervisor card switch power card s inner chassis cable front chassis cable complete assembly SPS 8 There are only 8 ports All clocks are distributed through data cables FRUs include Fans circuit breaker LED card switch supervisor card switch power card s inner chassis cable front chassis cable complete assembly On external bulkhead SSL I Ibi lb SS IISS J1 J2 Sa all On planar P1 Supervisor bus card Switch planar 48 V dc 12 V dc Switch power Switch Switch power supply 2 supervisor supply 1 card Inner chassis cable
100. re 4 6 Removing the SP Switch power supply cards Chapter 4 FRU removals and replacements 4 7 4 8 RS 6000 SP SP Switch Service Guide Chapter 5 Parts catalog SPS SPS 8 Switch assembly feature view 1 SPS SPS 8 Switch assembly feature view 2 SPS SPS 8 Switch assembly feature view 3 Switch cables feature 22 20 Multi switch frame F C 2030 1 This chapter presents the Parts Catalog listing all RS 6000 SP Switch parts and FRUs with corresponding figures containing indexed descriptions Copyright IBM Corp 1999 2002 5 2 5 4 5 6 5 8 5 10 SPS SPS 8 Switch assembly feature view 1 5 2 RS 6000 SP SP Switch Service Guide Table 5 1 SPS SPS 8 Switch assembly feature view 1 Assembly Part number index 1 26H7255 77G0961 46H9305 26H7212 26H7395 84X4841 46G5686 46H9730 77G0818 46H9220 oo N O oO A WwW PY Units Description SPS Switch assembly reference only SPS 8 Switch assembly reference only Front chassis assembly Card power Card switch supervisor Screw hex Hd M4 x 6 Fan assembly Nut hex M4 Cable power Chassis outer Wrap SPS not shown Wrap SPS 8 not shown Chapter 5 Parts catalog 5 3 SPS SPS 8 Switch assembly feature view 2 5 4 RS 6000 SP SP Switch Service Guide Table 5 2 SPS SPS 8 Switch assembly feature view 2 Assembly index e a A O N Part number 11P
101. res MAPs 1 11 SP Switch environment MAP 0600 Step 0600 025 You have a PS1Fail problem 1 Replace power supply card PS1 2 Reinstall the SPS front panel assembly being careful to align the guide pins on the P1 to the inner chassis cup guide 3 Replug the power cable J1 and supervisor cable J2 to the rear of the assembly 4 Put the circuit breaker in the On 1 position 5 Does the PS1Fail condition still exist e If yes go to Step 0600 02611 e If no go to Step 0600 026 After replacing the power supply card you still have a PSxFail condition 1 Put the circuit breaker in the Off 0 position 2 Remove the power cable J1 and supervisor cable J2 from the rear of the SP Switch 3 Remove the front panel assembly 4 Replace the supervisor card 5 Reinstall the SPS front panel assembly being careful to align the guide pins on the P1 to the inner chassis cup guide 6 Replug the power cable J1 and supervisor cable J2 to the rear of the assembly 7 Put the circuit breaker in the On 1 position 8 Does the PSxFail condition still exist e If yes a Replace the SP Switch inner chassis assembly Step 0600 027 You have a PSFuseGood problem 1 Remove power supply PS1 refer to 2 Reinstall the SPS front panel assembly bengi careful to align the aide pins on 1 the P1 connector to the inner chassis 3 Replug the power cable J1 and supervisor cable J2 at the rear of
102. res around the frame looking for sources of abnormally high temperatures above 40C or 104F 2 Is there an obvious airflow blockage or abnormally high temperature source near air intakes e If yes go to Step 0600 0167 If no go to Step 0600 014 You have an over temperature condition but you did not find an obvious airflow blockage or abnormally high temperature source near air intakes This indicates that there is a problem in the switch supervisor card 1 Have the customer remove the switch from the active configuration and power off the SP Switch 2 Set the circuit breaker to the Off position Unplug cables connected to J1 and J2 at rear of the SP Switch 3 Remove the front panel assembly from the inner chassis 4 Replace the switch supervisor card 5 6 Perform Switch supe Q Does the card pass self test e If yes go to Step 0600 0151 e f the card does not pass self test check the cable connections to the switch supervisor card If no obvious problems are found call next level of support Step 0600 015 The switch supervisor card passed the self test 1 Check Environmental yellow LED for ON or FLASHING condition 2 Is the Environmental yellow LED ON or FLASHING e If the Environmental LED is on or flashing check the cable connections to the switch supervisor card If no obvious problems are found call next level of support If the Environmental LED is not on or flashing a You have resolved
103. s connected to this SP Switch 4 Have some of the problems reported on data cables connected to this switch assembly been resolved e If yes go to EStep 0620 02911 If no go to Step 0620 028 None of the problems reported on data cables connected to this SP Switch been resolved by selecting a different external clock This indicates a probable problem with switch clock card or switch planar in this SP Switch 1 Replace the clock card in this SP Switch 2 Make sure to reconnect all cables connected to the clock card 3 Reconnect cables SO0 BH J1 S00 BH J2 SOO BH J3 SOO BH J5 and S00 BH J7 at tailgate of this SP Switch 4 Set the switch clock selection on this SP Switch to the original correct setting 5 From the control workstation of the primary node reinitialize the switch by entering Estart 6 Check var adm SPlogs css out top file to see if problems are still reported on all data cables connected to this SP Switch 7 Are there still problems reported on all data cables connected to this SP Switch e If yes a You have a problem with switch planar to replace switch assembly e If no a Clock problem resolved for this SP Switch b Go to Step 0620 029 Some of the problems reported on data cables connected to this SP Switch been resolved by selecting a different external clock This indicates a probable problem with the original external clock source 1 Based on the original external clock selectio
104. s step 1 Check the switch clock selections on the problem switches 2 Do the clock settings on the problem SP Switches appear to be okay e If yes go to Step 0620 0141 e Ifno a Change the clock selections on the appropriate switches refer to Blacks on page ac b Go to Gl to verify fix Step 0620 014 The clock settings on the problem SP Switches appear to be okay 1 Is an external clock selected e If yes go to Step 0620 0181 e If no go to EStep 0620 0151 Step 0620 015 An external clock is not selected This indicates that you have a defective master clock card 1 Is this the only switch in this system e If yes a Replace the switch assembly to verify fix e If this is not the only switch in this system go to EStep 0620 0161 Step 0620 016 The system you are working on either has an external clock or the system uses more than one switch 1 Have the customer select an alternate clock via the Eclock command 2 Issue the Estart command 3 Does the switch run e If yes go to Step 0620 0181 e If no a Replace the switch assembly to verify fix Step 0620 017 An SRN listed in fable 1 9 on page 1 24 indicated that you have a software problem and Priority 1 in directed you to this step 1 From the node front panel on the control workstation put the node in the SERVICE mode 2 Power on this processor node 3 Go to Step 0620 0181 Step 0620 018 At t
105. service position Step 0600 001 You have arrived at this MAP from able 1 1 on page 1 Al 7 1 Perspectives reports Warning Shutdown or Failure message associated with SP Switch 2 Does message indicate Shutdown or Failure e If yes go to EStep 0600 00311 e No the message indicates a Warning go to ESten 0600 00211 Step 0600 002 You received a switch environment Warning 1 Does this same message occur on other SP Switches or on any processor nodes mounted in the same frame as the switch 9 for placing a switch into the service position or e If yes call the next level of support e If no verify that the customer is not experiencing problems with this SP Switch f no problems are being experienced or this is an N 1 fan or power supply failure then no immediate service is required and service can be deferred until a later date If problems are being experienced service can be performed now by returning to ESted pano o01 and treating this message as a Shutdown or Failure message Step 0600 003 A Perspectives message indicates a Shutdown or Failure This means that a serious environmental condition has been detected in the SP Switch Note If service action has just been completed on this SP Switch check for loose cables or shorted conditions in this component 1 Based on the text of the message use the information in Table 1 2 to contin
106. st switch clocks enter dsh a usr lpp ssp css css_restart_node where nodelist is a list of node names separated by commas Example dsh w fr2n03 fr2n04 fr3n01l usr lpp ssp css css_restart_node Attention Issuing this command on processor nodes that did not lose a clock may cause unpredictable results possibly requiring relPL Have customer verify that the switch code is running on all processor nodes OR from the control workstation enter Estart 4 Check var adm SPlogs css out top file to verify that there are no other switch problems 1 36 RS 6000 SP SP Switch Service Guide Chapter 2 Locations Naming standard for RS 6000 SP ONE 2 oe ee ee ee ee ee ee ee Format structure pact amp hog Bom woe amp amp aoe db ok ase g 2 Example of format structure Gu eee ete ee ee ea Re a ee Frame WWW he te el Major assembly XXX s a non 2 Sub assembly YY 2 1 ke ee ee 22 Connection location ZZZZ a es Bue te ee A BR Ge ae et eh eee Examples for using complete levels of nomenclature ge Aa se eR ts Be te a CA ee gy og ae Location diagrams of the RS 6000 SP components 2 2 2 22 Front and rear views of RS 6000 SP frame 2 2 a ee ee ee 28 Frame locations mo n moe poe oa e a ooa d actae w a oi 2 6 Frame FRA x ss sesa be oe woe ee Eee ee ee ee tose bo 28 Switch assembly locations 2 1 ee ee 2 8 Connector details s s zs e k e
107. t 764 282 764 282 D switch cable Adapter is good but switch cable is not attached Chapter 1 Maintenance Analysis Procedures MAPs 1 25 SP Switch function MAP 0620 Table 1 9 Service Request Number SRN table for SP Switch adapters continued SRN Service Request Number source Failing component s Description 764 2A0 D SP System Attachment Adapter Problem detected with the through internal clock 764 2A9 764 3xx D SP System Attachment Adapter Problem detected with POS See note system or I O planar registers 764 4xx D SP System Attachment Adapter TBIC test failed See note software 764 5xx D SP System Attachment Adapter SRAM test failed See note 764 6xx D SP System Attachment Adapter Microprocessor test failed See note 764 7xx D SP System Attachment Adapter Interrupt test failed See note 764 8xx D SP System Attachment Adapter FIFO test failed See note 764 9xx D SP System Attachment Adapter DMA engine test failed See note switch cable system or I O planar 764 Ax3 D SP System Attachment Adapter Card wrap test failed See note terminator 764 Ax4 D switch cable Cable wrap test failed See note wrap plug 764 xx2 D SP System Attachment Adapter Bad adapter except SRN See note 764 282 764 xx3 D SP System Attachment Adapter Bad adapter 764 xx4 D SP System Attachment Adapter Bad switch cable Switch Cable 80x 762 D Software Error oc
108. teps can potentially fail therefore be sure that a diagnostic wrap plug or terminator is not faulty before performing major replacements b Where applicable frame and SP Switch jack numbers are indicated on each line of the out top file Refer to For example L01 S00 BH J18 for more information on the nomenclature identifies Logical frame 1 SOO BH J18 E02 S00 BH J18 identifies Physical frame 2 SOO BH J18 L03 N01 identifies Logical frame 3 Node 1 E02 N04 identifies Physical frame 2 Node 4 01 S02 BH J3 identifies Multi switch frame 1 slot 2 BH J3 Chapter 1 Maintenance Analysis Procedures MAPs 1 21 SP Switch function MAP 0620 Refer to notes at beginning of on cable connections and logical to yia frame translations c Be aware that for logical frames consisting of two physical frames all out top even numbered nodes represent nodes physically located in the expansion frame F C 1010 but at the next lower slot number For example L01 N04 logical frame 1 node 4 could be physically located at E02 NO3 physical frame 2 slot 3 for more information Table 1 8 Problem messages from the out top file Priority Message condition Action 1 On board connections j to replace the switch assembly 1 of 5 2 e Primary node connection e If this is a standard node go to to Node connections on same switch check primary or secondary pr
109. ternal 15 m Cable S S external 20 m 5 8 RS 6000 SP SP Switch Service Guide Chapter 5 Parts catalog 5 9 Multi switch frame F C 2030 1 5 10 RS 6000 SP SP Switch Service Guide Table 5 5 Multi switch frame F C 2030 1 Assembly index 1 o oO NOOA DY ai oa pe Oo Part number 54G3281 93G1065 0375867 84X4761 17H5026 54G2877 54G2947 1624776 26H7198 08J5557 04H9469 11J5189 11J5191 11J5193 11J5195 21L3932 Units OF GF B BO mw w Description SPS Switch Assembly Screw Phil Pan Hd M5 x 12 Shelf Assembly Nut Clip Screw Hex FI Hd Supradrive Screw SI Hex FI Hd M5 x 6 Panel Switch Blank Panel Blank 7 inch Screw Hex FI Hd M5 x 12 Extender Frame Chocks Rubber Duct Long Cable 48 V dc frame slots 2 and 4 not shown Cable 48 V dc frame slots 6 and 8 not shown Cable 48 V dc frame slots 10 and 12 not shown Cable 48 V dc frame slots 14 and 16 not shown Cable 48 V dc connector converter not shown Chapter 5 Parts catalog 5 11 5 12 _RS 6000 SP SP Switch Service Guide Notices This information was developed for products and services offered in the U S A IBM may not offer the products services or features discussed in this document in other countries Consult your local IBM representative for information on the products and services currently available in your area
110. the assembly 4 Put the circuit breaker in the On 1 position 5 Does the failure condition still exist e If yes go to e If no go to Sten 0600 0281 Step 0600 028 You have a PSFuseGood problem that went away when you removed power supply PS1 1 Put the circuit breaker in the Off 0 position Remove power cable J1 and supervisor cable J2 from the rear of the SP Switch Remove the front panel assembly Replace power supply PS1 Take the switch out of the service position refer to Dapy 1 12 _RS 6000 SP SP Switch Service Guide SP Switch environment MAP 0600 Step 0600 029 You have a PSFuseGood problem that did not go away when you removed power supply PS1 1 Put the circuit breaker in the Off 0 position 2 Remove power cable J1 and supervisor cable J2 from the rear of the SP Switch 3 Remove the front panel assembly 4 Replug power supply PS1 and remove power supply PS2 5 Reinstall the SPS front panel assembly being careful to align the guide pins on the P1 connector to the inner chassis 6 Replug the power cable J1 and supervisor cable J2 at the rear of the assembly 7 Put the circuit breaker in the On 1 position 8 Does the failure condition still exist e If yes go to ESten 0600 03111 If no go to FStep 0600 0304 Step 0600 030 You have a PSFuseGood problem that went away when you removed power supply PS2 1 Put the circuit breaker in the Off
111. this MAP 1 From a Perspectives window on the control workstation or by looking at the SP Switch check the Power green LED for this SP Switch 2 The definition of the Power green LED is as follows r Power Green Off No 48 V dc power available at the SP Switch Flashing Power available at the SP Switch but switch logic is Off On Power available at the SP Switch and logic is On 3 Is Power green LED Off e If yes go to Step 0610 0041 If no go to FStep 0610 0021 Step 0610 002 You found the switch Power green LED was lit This indicates that the SP Switch is getting 48 V dc power 1 Is Power green LED flashing e If yes go to Step o610 0nal e If the Power LED is lit and it is not flashing a This indicates that you do not have a power supply problem b Verify that you have the proper SP Switch then restart determination MAP 0590 on page c If this is the proper SP Switch call the next level of support Step 0610 003 The Power LED is lit and it is flashing 1 This indicates that the SP Switch is getting power 2 Power up switch logic from the virtual front panel on the control workstation 3 Does Power green LED light and stay lit e If yes the switch logic is getting power and refer to e If the Power LED does not stay lit the SP Switch is not responding to the command a Go to Frame supervisor not responding MAP 0110 in Step 0610 004 From e
112. tion This is a Class A product In a domestic environment this product may cause radio interference in which case the user may be required to take adequate measures United Kingdom telecommunications safety requirements Notice to customers This apparatus is approved under approval number NS G 1234 J 100003 for indirect connection to public telecommunications systems in the United Kingdom Industry Canada compliance statement This Class A digital apparatus meets the requirements of the Canadian Interference Causing Equipment Regulations Cet appareil num rique de la classe A respecte toutes les exigences du Reglement sur le mat riel brouilleur du Canada A 2 RS 6000 SP SP Switch Service Guide For installations in Japan SNzElt FRUNERESERFEBE FH EZ VCC DEI BOC DIRAA ase CT Se ee CBRE in SOSEIIKAENEUNGNRER O k DBR The following is a summary of the VCCI Japanese statement in the box above This is a Class A product based on the standard of the Voluntary Control Council for Interference by Information Technology Equipment VCCI If this equipment is used in a domestic environment radio disturbance may arise When such trouble occurs the user may be required to take corrective actions Electromagnetic interference EMI statement Taiwan Sores ie ce HARRY aN En TES Eee Pe AB iF ARE Se SEE EEM T GAS SRE Ree ee BHA The following is a summary of the EMI Taiwan statement above Warning This is a
113. tions safety requirements Industry Canada compliance statement For installations in Japan Electromagnetic interference EMI statement Taiwan Radio protection for Germany Index iv RS 6000 SP SP Switch Service Guide 3 6 3 6 3 7 3 8 3 8 3 9 3 9 3 9 3 10 3 10 3 12 4 1 42 42 4 3 4 3 4 4 4 4 4 5 4 5 4 6 4 6 4 7 5 1 9 2 5 6 5 8 5 10 A 1 A 2 A 2 A 2 A 2 AS A 3 A X 1 Figures 1 1 SP Switch high level diagram 1 2 SP Switch wrap plugs 1 3 SP Switch inner chassis and front chassis cables 2 1 Front view of frame locations i 2 2 Front view of multi switch frame locations 2 3 Front view of 49 inch frame locations 2 4 Rear view of frame locations 2 5 SPS SPS 8 assembly fs i 2 6 RS 6000 SP connector details as s seen at receiving ends not at cable ends 2 7 Frame cabling routing path in rear of RS 6000 SP frame 1 93 m frame 2 8 Frame cabling routing path in rear of RS 6000 SP frame 2 01 m frame 2 9 Frame cable routing paths in rear of RS 6000 SP multi switch frame F C 2030 1 1 931 m frame 2 10 Frame cable routing paths in rear of RS 6000 sp multi switch frame F C 2030 1 2 O11 m frame eRe oe oe Ro ite w 2 a 2 Bde aoe 4 1 Handling an anti static device 4 2 Removing the SP Switch fans 4 3 Removing the SP Switch f
114. uctive when used on a running system After using css snap to collect diagnostic information it s best to run usr Ipp ssp css rc switch in order to reset reload the switch adapter and eliminate residual effects of these utilities This procedure should be used only under the direction of the IBM Support Center Note css snap is located in the usr lpp ssp css directory Under normal circumstances it will collect the following cable_miswire cable_miswire old core fault service daemon dump file css snap log css_dump out daemon stderr daemon stdout dtbx trace dtbx failed trace errpt out most recent errpt a and errpt entries flt fs_daemon_print file fs_dump out netstat out current netstat I css0 and netstat m out top rc switch log regs out router log scan_out log scan_save log tb_dump out vdidl out worm trace Chapter 3 Service procedures 3 3 The files ending in out are produced by running the appropriate command to dump internal in memory trace information or dump data to a file The completed output file will be found in var adm SPlogs css css snap date time tar Z css snap avoids flooding var by following these rules e If less than 10 of var is free css snap exits e If the CSS portion of var is more than 30 of the total space in var css snap erases old snap files until the CSS share sinks below 30 If successful css snap proceeds If not it exits css snap is calle
115. ue service Table 1 2 Environmental messages for switches Condition Action P48OK Go to shutdownP5 Go to shutdownP12 shutdownN5 fanfail Go to shutdownTemp Go to PS1Fail Go to PS2Fail P33High P33Low voltP33Range PSFuseGood Chapter 1 Maintenance Analysis Procedures MAPS 1 5 SP Switch environment MAP 0600 Step 0600 004 Perspectives indicated a shutdown condition and Table 1 2 on page 1 5 directed you to this step 1 One or more of the following conditions exist e Voltage out of range 5 V shutdownP5 e Voltage out of range 12 V shutdownP 12 e Voltage out of range 5 V shutdownN5 2 Have the customer remove the SP Switch from the active configuration and power off the SP Switch Put the switch into service position Refer to 4 Check the following hardware items e Cable conditions at switch supervisor card SOO SP J102 e Cable conditions at power supply SOO PC P4 and S00 PC P6 e Cable conditions of wires especially the inner chassis cable 5 Leave the cable disconnected at the switch power card 6 Does the switch supervisor control cable appear to be okay e If yes go to Step 0600 0031 e lf the switch supervisor control cable S00 SP P102 appears to have a problem go to EStepl d Refer to Priority 4 and replace the cable
116. valent e The activities in Performing the Customer 50 60 Hz Power Receptacle Safety Check have been performed on all customer power source outlets and cable connectors SPSFDO15 DANGER Ensure that the customer s branch distribution circuit breakers customer power source circuit breakers to the ac power outlets are off and tagged with DO NOT OPERATE tags S229 0237 or national language equivalent SPSFDO16 DANGER Both the SEPBU power chassis and the PDU 48 V dc power chassis are field replaceable units FRUs which contain NO serviceable parts they are labeled as such Do not attempt to isolate or repair these components since doing so may result in severe injury or even death SPSFDO1 7 Caution notices CAUTION The weight of the PDU assembly 48 V dc power chassis and the SEPBU power chassis is greater than 18 Kg 40 Ibs Be careful when removing or installing Remove all 48 V dc power supplies from the power chassis before removing or installing the power chassis SPSFC001 CAUTION The unit weight exceeds 18 Kg 40 Ibs and requires two service personnel to lift SPSFC002 Safety and environmental notices xi CAUTION The covers are to be closed at all times except for service by trained service personnel SPSFC003 CAUTION When the unit is being serviced the covers should not be left off or opened while the machine is running unattended SPSFCO004 CAUTION
117. var adm SPlogs css out top file indicated a switch problem and all connection appeared to be okay Note Be aware that in large systems connections for SP Switches not in the multi switch frame are usually scattered throughout the out top file 1 If this is a multi frame system check the indicated jacks against the wiring charts for multi frame switch systems 2 Is this a multi frame system with problems reported on all switch data cables from one or more switches e If yes a The problem may be caused by clocking or grounding between frames e If no go to Step 0620 023 Some but not all switch data cables appear to be having problems 1 Depending on whether the problem is a wrapped port or a switch to switch connection perform one of the following steps e wrapped port wrap plug installed Remove the existing wrap plug Install a different wrap plug on the jacks indicated e switch to switch connection Disconnect the switch data cable from both ports indicated Install wrap plugs on the jacks indicated Chapter 1 Maintenance Analysis Procedures MAPs 1 29 SP Switch function MAP 0620 Note When unplugging the two ends of the suspect cable check var adm SPlogs css out top to verify only two 2 ports were lost If four 4 ports were lost then two cables were swapped across that switch to switch connection Connect the cables correctly and check var adm SPlogs css out top for any other errors 2 If this is a switch to
118. w 1 The Hardware Perspective should open with a node pane displayed If it does not or if you would like to open an additional node pane a Click the Add Pane icon on the tool bar e The Add Pane dialog box opens From the Pane Type pull down select Nodes Select your choice of adding the pane to the current window or to a new window If desired enter a new pane title Click OK to open the pane and close the dialog box 9205 3 4 RS 6000 SP SP Switch Service Guide In the Node pane click the icon of the node you want to verify Click the Notebook icon on the tool bar e When the Notebook window opens make certain that the Node Status tab is selected The Node failure attribute displays the status of the node supervisor e No displayed in a green box indicates that the node supervisor has not failed and the supervisor is responding e Yes displayed in a red box indicates that the node supervisor has failed and it is not responding Note Clicking Help in the Notebook window s lower right corner displays attribute descriptions Frame supervisor verification From the Hardware Perspectives window 1 Unless you have saved display settings the Hardware Perspective does not open with a frame pane displayed To open a frame pane a Click the Add Pane icon on the tool bar e The Add Pane dialog box opens From the Pane Type pull down select Frames and Switches Select your choice
119. wer occurs during the download process the adapter can be left with no microcode If this happens repeat the microcode download If unsuccessful replace the adapter 7133 Disks can also be updated however the method varies depending upon which disks are attached If they are 4 5 GB or 9 1 GB Scorpion disks and the AIX version is either 4 1 5 or 4 2 1 then run dsh ssadload u to update the disks Other disks will be updated by a cfgmgr or reboot cycle Running diagnostics on a switch port Before running diagnostic tests on a switch to diagnose node to switch problems you must identify the port number For example Bulkhead jack 6 on switch 17 frame 1 is identified as E 1 S17 BH J6 To run the test from the control workstation for this example the syntax of the command is usr 1pp ssp bin spd wrap_test j EO1 S17 BH J6 3 12 RS 6000 SP SP Switch Service Guide Chapter 4 FRU removals and replacements Handling static sensitive devices Procedures for switch assemblies Removing the switch assembly Replacing the switch assembly Removing the switch fans Replacing the switch fans Removing the switch front chassis cable Replacing the switch front chassis cable Removing the switch supervisor card Replacing the switch supervisor card Removing the switch inner chassis cable Replacing the switch inner chassis cable Removing the switch power cards Replacing the switch power cards 4 4 2 42 4
120. wire Action Link status Link has been removed 1 Check this connection against the actual hardware cabling from network probable miswire 2 If the cabling does not match correct the cabling problem then go to step 5 3 Check the node s hostname or IP address and switch node number against the configuration using the command splstdata s pg standard node SDRGetObjects DependentAdapter dependent node 4 lf this occurs on all nodes of a frame check for a logical to physical frame number mismatch 5 Have the customer update the configuration or you can correct the cabling as appropriate 6 Zi for next problem or to verify fix Device status Destination not Description Possible hardware problem a9 reachable SPS Action Go to Link status Link has been removed from network not connected Step 0620 004 You were directed here by an Error Condition listed in able 1 7 on page 1 19 which lists functional problems reported in var adm SPlogs css out top file This file lists all switch data connections with comment lines indicating the various types 1 Check the entire var adm SPlogs css out top file for errors before proceeding to the prioritized table 2 Use the following prioritized table to service problems reported in the var adm SPlogs css out top file The messages are from comment lines preceding the error Notes a Be aware that wrap plugs or terminators used in the following s
121. witch fans replacing 4 3 switch front chassis cable removing 4 3 switch front chassis cable replacing 4 4 switch function MAP flowcharted 1 17 switch inner chassis cable removing 4 5 switch inner chassis cable replacing 4 6 switch inner chassis cable SPS 1 8 switch partitioning 3 8 switch power card removing 4 6 switch power card replacing 4 7 switch power MAP flowcharted 1 13 switch resources removing 3 7 switch resources restoring 3 7 switch supervisor card removing 4 4 switch supervisor card replacing 4 5 switch supervisor self test 3 4 switch supervisor verification 3 5 T tools overview service 3 1 trademarks A 1 U unfencing nodes 3 8 updates on SP nodes installing firmware 3 10 V verification frame supervisor 3 5 node supervisor 3 4 switch supervisor 3 5 using Perspectives 3 4 verification and isolation procedures switch supervisor self test 3 4 W who should use book xv wrap plugs 1 4 Index X 3 X 4 RS 6000 SP SP Switch Service Guide Reader s comments We d like to hear from you RS 6000 SP SP Switch Service Guide Publication No GA22 7443 04 Overall how satisfied are you with the information in this book Very Satisfied Satisfied Neutral Overall satisfaction OD How satisfied are you that the information in this book is Very Satisfied Satisfied Neutral Accurate Complete Easy to find Easy to understand Well organized Appli
122. witch into service position refer to What problem type is given in the condition e If the condition states that you have a P33 type problem P33Low P33High or voltP33Range go to i e lf the condition states that you have a PSxFail or PSxFuseGood problem go to na ole GNM Step 0600 019 The condition states that you have a P33 type problem P33Low P33High or voltP33Range 1 Check cable conditions at switch supervisor card SOO SC P5 2 Check cable conditions at switch power supply SOO PC P4 and S00 PC P6 3 Do the inner chassis cables appear to be okay e Ifthe inner chassis cables appear to be okay and the condition is P33low or voltP33Range go to e Ifthe inner chassis cables appear to be okay and the condition is P33High go to e If the inner chassis cables do not appear to be okay go to and replace the cables Step 0600 020 You have a P33low or voltP33Range condition and the inner chassis cables appear to be okay 1 Disconnect connectors SO0 PC P4 SOO PC P6 at the power cards and S0O0 PC P5 at the supervisor card 1 10 RS 6000 SP SP Switch Service Guide SP Switch environment MAP 0600 2 Remove the switch supervisor card refer to 3 Using a digital multimeter measure resistance at the planar connection for the supervisor card between pins 12A and 12B e The resistance should be in a range of 4 to 20 ohms 4 Is the measured resistance in the acceptable range e If yes If no go to
Download Pdf Manuals
Related Search
Related Contents
Beko CS232020S fridge-freezer OWNERS MANUAL Rainwater Harvesting and Water SUPPLY System FV3000 - FAST CORPORATION[株式会社ファースト] Magnese MA-204019 Manuel d`utilisation pédagogie - Formation informatique à Bordeaux 消費電力測定器の貸出しについて(PDF形式、137KB) Network Technologies SE-DVI-2 User's Manual Elipar™ FreeLight™ 2 do Manual Philips SensoTouch wet and dry electric shaver RQ1155 Copyright © All rights reserved.
Failed to retrieve file